Primary Role & Responsibilities:
• You are an SRE Engineer with real interest and experience in troubleshooting BMC Helix Product issues, Databases, containers/Kubernetes, cloud technologies etc, and a proven interest and experience in using software engineering to solve operational problems.
• Architect and implement automation to auto-remediate/self-heal issues in production.
• You will participate in SRE software engineering, writing code for the continuing reduction of human intervention in operational tasks and automation of processes.
Skills & Qualifications:
• Overall 4+ years of experience with DevOps and SRE practices, technologies, and industry standards to make production reliable and resilient.
• Having experience of core DevOps and SRE technologies like:
o Python
o Ansible
o Docker
o Kubernetes, Helm
o Jenkins
o Terraform
o IaaC via Terraform
• Good understanding of application logs and Kubernetes events, application, and infrastructure metrics (Prometheus/Grafana).
• Should be willing to work in shift
Good to Have
• Experience with Public Cloud like AWS, GCP, OCI etc is a great plus.
• Good understanding of Linux systems and Bash scripting.
• Ability to explain technical concepts to multiple audiences.
• Having a good understanding of core ITSM and SRE practices and technologies.