Primary Role & Responsibilities:
• You are an DevOps Engineer with real interest and experience in building end to end automations for different SaaS team e.g. Server/Platform/Network/Storage and solve operational problems.
• You are comfortable writing code to automate API-driven tasks at scale. Python preferred.
• Architect and implement automations to auto-remediate/self-heal issues in production.
• You will participate in SRE software engineering, writing code for the continuing reduction of human intervention in operational tasks and automation of processes.
• Monitor the application ecosystem, jumping on bridges and resolving the issues.
• Having a good understanding of core DevOps and SRE practices and technologies.
Skills & Qualifications:
• Overall 4+ years of experience with DevOps and SRE practices, technologies, and industry standards to make production reliable and resilient.
• Having experience of core DevOps and SRE technologies like:
o Ansible
o Docker
o Kubernetes, Helm
o Jenkins
o Terraform
o IaaC via Terraform
• Good understanding of application logs and Kubernetes events, application, and infrastructure metrics (Prometheus/Grafana/FluentD). Good to Have
• Experience with Public Cloud like AWS, GCP, OCI etc is a great plus.
• Good understanding of Linux systems and Bash scripting.
• Ability to explain technical concepts to multiple audiences.