- We are looking for a customer obsessed Site Reliability Engineer with extensive experience in implementing Service Level Objectives (SLOs) monitoring solutions to top Azure customers.
- Experience : Atleast 6+years experience in driving platform reliability and customer satisfaction through proactive engagement, technical resolution, and cross-functional collaboration. Skilled in observability, automation, and translating operational insights into meaningful customer outcomes.
3+ years of experience in designing Observability and monitoring solutions in Azure(or AWS/GCP), SLO/SLI Implementation is a plus.
3+ years of experience in an external client facing role or customer handling.
- Degree: Bachelor’s or master’s degree in computer engineering (or equivalent)
- Customer Obsession : Passion for customers and focus on delivering the right customer experience.
- Growth Mindset : Openness and ability to learn new skills and technologies in a fast-paced environment.
- Excellent Communication : Must have the ability to empathize with customers and convey confidence. Able to explain highly technical issues to varied audiences. Able to prioritize and advocate customer’s needs to the proper channels. Take ownership and work towards a resolution.
- Technical Skills :
- Proven expertise in implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for cloud customers.
- Proven experience in designing and implementing monitoring solutions for customers.
- Extensive experience with monitoring tools and platforms
- Advanced certifications in SRE or related fields.
- Experience in observability, SRE OpenTelemetry, Prometheus, Grafana, Dynatrace, Datadog, AzureMonitor, AI, ML