Design, implement, and manage robust observability solutions using tools such as Dynatrace, Grafana, Prometheus, and Site24x7.
Develop, maintain, and improve monitoring, alerting, and incident response processes to ensure system reliability and minimize downtime.
Collaborate with development teams to enhance application performance, scalability, and reliability through proactive monitoring insights.
Manage and optimize cloud infrastructure across AWS and Azure using Infrastructure as Code (IaC) tools like Terraform and Bicep.
Develop and maintain CI/CD pipelines to automate software deployments and infrastructure updates.
Write and maintain automation scripts in Bash, Shell, and Python to support operational efficiency.
Identify performance bottlenecks, implement improvements, and support post-incident reviews to drive continuous improvement.
Work closely with security teams to ensure compliance, security best practices, and data protection in cloud environments.
Maintain comprehensive documentation for observability configurations, automation processes, and cloud infrastructure standards.
What you bring
Proven experience as an SRE, DevOps Engineer, or similar role in cloud environments.
Expertise in observability tools such as Dynatrace, Grafana, Prometheus, and Site24x7 for performance monitoring and alerting.
Strong proficiency in AWS and Azure cloud services.
Hands-on experience with Terraform, CloudFormation, or Bicep for Infrastructure as Code (IaC).
Proficient in CI/CD tools such as Jenkins, GitLab CI, or Azure DevOps.
Solid scripting skills in Bash, Shell, and Python for automation tasks.
Strong troubleshooting skills with a focus on performance tuning and incident management.
Experience in securing cloud environments and implementing compliance best practices.
Preferred Qualifications
Any certifications such as AWS Certified DevOps Engineer – Professional, AWS Certified Solutions Architect – Professional, Microsoft Certified: Azure DevOps Engineer Expert, or Microsoft Certified: Azure Solutions Architect Expert.
The position can be filled as a part time position.
What We Offer
An opportunity to shape the future of Sovereign Cloud initiatives in a dynamic and collaborative environment.
A role that combines cutting-edge cloud technologies with innovative automation and observability solutions.
Flexible working arrangements and competitive compensation.
Opportunities for career growth within our expanding SAT team structure.