Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

IBM Site Reliability Engineer Lead 
Egypt, Cairo, Cairo 
37671220

06.01.2025

In this role, you’ll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we deliver deep technical and industry expertise to a wide range of public and private sector clients around the world.​ Our delivery centers offer our clients locally based skills and technical expertise to drive innovation and adoption of new technology.


• Lead and manage the Site Reliability Engineering (SRE) team to ensure high availability, reliability, and performance of systems on the Azure platform.
• Design and implement strategies for monitoring, alerting, and incident response.
• Collaborate with development and operations teams to establish SRE best practices and optimize system reliability.
• Automate infrastructure provisioning, configuration, and deployment using Infrastructure as Code (IaC).
• Drive root cause analysis and implement solutions for production issues.
• Develop and maintain SLOs, SLIs, and SLAs to improve service reliability and performance; implement and oversee CI/CD pipelines for streamlined software delivery.


Required Technical and Professional Expertise

  • +10 years in a DevOps/SRE role, ideally in Azure Cloud environments and cloud-native architectures such as microservices.
  • Expertise in Azure cloud services , including AKS, Azure DevOps, Azure Monitor, and App Services; strong knowledge of Kubernetes and containerization technologies (Docker).
  • Proficiency in scripting and programming languages (e.g., Python, Bash, PowerShell).
  • Hands-on experience with Infrastructure as Code (IaC) tools such as Terraform or ARM templates.
  • Solid understanding of DevOps practices and CI/CD pipelines.
  • Experience with logging, monitoring, and observability tools (e.g., Prometheus, Grafana, Splunk, ELK); strong troubleshooting and performance tuning skills in cloud-based environments.


Preferred Technical and Professional Expertise
• Experience withmicroservices architectureand distributed systems;
– k
nowledge ofsecurity best practicesin cloud environments.
– familiarity withAzure Policyand governance practices.
– expertise in database performance tuning (e.g., Azure SQL, Cosmos DB).
• Leadership experience in managing cross-functional teams.
Certifications such as,, or
CKA (Certified Kubernetes Administrator).

Exposure to AI/ML workloads on Azure; experience with hybrid or multi-cloud environments; knowledge of serverless architecture (e.g., Azure Functions).