Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

IBM Site Reliability Engineer Hybrid 
Costa Rica 
924695533

Today

Your Role and Responsibilities
  • Troubleshoot, monitor, and support critical production systems.
  • Perform root cause analysis and manage incidents to ensure timely resolution.
  • Provision and deploy environments in a cloud infrastructure (preferably IBM Cloud).
  • Handle initial intake for Salesforce-related customer cases, ensuring SLA commitments are met.
  • Provide on-call support, sharing rotation duties with global resources (including Poland/Costa Rica), ensuring minimized MTTR (Mean Time to Recovery).
  • Manage workloads and resources to maintain commitments and prevent SLA breaches.


Required Technical and Professional Expertise

  • Strong working knowledge of Kubernetes and cloud infrastructures, with a preference for IBM Cloud.
  • Expertise in administration, configuration, and management of MS SQL Server 2022.
  • Proven experience in providing on-call support for critical production systems, with a focus on determining root cause analysis (RCA).
  • Expertise in automation platforms such as AWX.
  • Proficiency in scripting languages like Python and related tools.
  • Strong problem-solving skills and attention to detail.
  • Fluent english required


Preferred Technical and Professional Expertise

  • Familiarity with Salesforce infrastructure and case management processes.
  • Experience with monitoring tools and incident management platforms.
  • Ability to work efficiently in a global, distributed team environment.