Oncall Specialist

Dhaka, Dhaka Division, Bangladesh | System Operations | Full-time

Apply

 

We are seeking skilled and dedicated Oncall Specialists to join our team. The ideal candidate will be proficient in troubleshooting issues within their scope, communicating with other teams, escalating incidents when necessary, documenting incidents, and continuously improving processes. This role is crucial in maintaining the seamless operation and dependability of Therap's critical systems, which employ industry-leading technologies from Oracle, VMware, F5, Fortigate, Cisco, and NetApp.

 

Responsibilities:

  • Monitor system infrastructure using various monitoring tools to track system performance, server health, network traffic, and application status on a 24x7x365 basis.

  • Perform initial troubleshooting and diagnostics to identify and resolve issues within their scope, utilizing knowledge of system administration, network engineering, and application support.

  • Collaborate and communicate with other teams during incident resolution, escalate when necessary, and provide comprehensive documentation of the issue.

  • Continuously review and update monitoring configurations, alert thresholds, and escalation procedures to adapt to changes in the infrastructure and application landscape.

Requirements:

  • Basic understanding of Linux operating systems and command-line interface is required.
  • Bachelor's degree in Computer Science, Information Technology, or a related field.

  • Willingness to work in shifts, ensuring 24x7x365 coverage (nights, weekends, and holidays) and be available for Oncall duties as required.

  • Familiarity with monitoring and diagnostic tools, such as log analyzers, performance monitoring applications, and network analysis tools.

  • Troubleshooting and problem-solving skills, strong communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams.

By joining the Oncall Operations Team as a specialist, you will play a significant role in ensuring the continued smooth operation of our critical systems, contributing to the overall success of our organization.