As aat, you'll lead the management of AWS/GCP infrastructure, oversee the administration of Kubernetes clusters, and mentor a team of developers in Infrastructure as Code practices. You'll ensure reliability through advanced monitoring and refine CI/CD pipelines.
You'll also drive innovation by integrating cutting-edge technologies into our development lifecycle.
Responsibilities:
- Infrastructure Management: Lead the management and maintenance of our AWS/GCP infrastructure, including large-scale Kubernetes clusters.
- Developer Enablement: Mentor and guide developers in using IaC technologies like Terraform for efficient environment creation and management.
- Monitoring and Observability: Implement and oversee advanced monitoring, alerting, and observability solutions using tools like DataDog, Splunk, Prometheus, Grafana, etc.
- CI/CD Pipeline Management: Refine and optimize our complex CI/CD pipelines, collaborating with systems like Jenkins and ArgoCD.
- Innovation and Development: Drive innovation by researching and integrating new technologies to enhance our development lifecycle, environments, and production systems.
- Team Leadership: Provide technical leadership and mentorship to junior DevOps engineers, fostering a collaborative and growth-oriented environment.
Requirements:
- 4+ years of experience in high-scale production environments
- Advanced proficiency in high-level programming languages (preferably Python/Go)
- Extensive production experience with Kubernetes and Helm
- Proven experience with public cloud providers (Preferably AWS/GCP)
- Expertise in CI/CD methodologies, GitOps, and Infrastructure as Code (IaC)
- Excellent problem-solving and debugging skills
- Strong leadership, communication, and documentation skills