Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

IBM Lead Site Reliability Engineer 
Egypt, Cairo, Cairo 
13590888

Today

In this role, you’ll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we deliver deep technical and industry expertise to a wide range of public and private sector clients around the world.​ Our delivery centers offer our clients locally based skills and technical expertise to drive innovation and adoption of new technology.


  • Lead Observability Efforts:Design, set up and maintain comprehensive observability solutions (logs, traces, metrics and dashboarding/visualization) across nonprod and prod environments
  • Platform Resiliency:Ensure the resilience of our platform by proactively monitoring, detecting, and resolving potential issues before they impact services.
  • Cost optimization: Implement strategies to optimize costs in our Azure cloud environment, identifying areas for improvement.
  • Cross-Team Collaboration: Work closely with development and platform engineering teams to align on goals and ensure smooth, scalable operations.
  • Automation and Tools Development: Develop and implement internal tools and bots to automate processes and improve efficiency across the organization.


Required Technical and Professional Expertise

  • : +5 years in a DevOps/SRE role, ideally in Azure Cloud environments and cloud-native architectures such as microservices.
  • Observability Expertise:Hands-on experience with observability tools such as Azure Monitor, OpenTelemetry, Prometheus, Jaeger, Kiali, and Grafana stack.
  • Containerization: Strong knowledge of container orchestration tools like Kubernetes, Helm, and Docker.
  • Scripting & Automation: Proficiency in scripting languages (e.g., PowerShell, Python, Go, Bash) for automating tasks and workflows.
  • Problem Solving: Proven ability to independently debug and resolve complex issues.

Preferred Technical and Professional Expertise

  • : Experience with Terraform, Terragrunt, and other IaC tools.
  • Service Mesh Knowledge: Familiarity with Istio and related service mesh technologies.
  • Azure Cloud Services: Working knowledge of Azure services such as FuncApps, Container Apps, Logic Apps, Data Factory, Event Hubs, and Event Grid, with a strong understanding of best practices.
  • CI/CD Pipelines: Experience configuring and maintaining CI/CD pipelines, preferably with Azure DevOps.