Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Microsoft Site Reliability Engineer II 
India, Telangana, Hyderabad 
505958884

16.07.2024
Qualifications
  • 3+ years of experience in designing Observability and monitoring solutions in Azure(or AWS/GCP), SLO/SLI Implementation is a plus.

    3+ years of experience in an external client facing role or customer handling.

  • Bachelor's Degree in Computer Science, Information Technology, or related field
  • Master's Degree in Computer Science, Information Technology, or related field.
Responsibilities
  • Collaborate with customers to jointly define and establish SLOs and SLIs that align with their business goals and expectations.
  • Instrument code to measure SLOs , develop solutions to detect SLO breaches
  • Develop automated solutions and troubleshooting guides to remediate or mitigate SLO breaches.
  • Collaborate closely with service engineering teams to develop solutions for corelating customer-defined SLOs with relevant platform SLOs, signals to effectively pinpoint, address, and resolve customer-impacting issues.
  • Ensure customer-centric SLOs are consistently exceeded through cross-functional collaboration.
  • Analyze SLO data for trends, improvements, and reliability risks, proposing remediation plans.
  • Proactively engage customers on SLO performance, addressing concerns and offering insights.
  • Lead optimization efforts for system performance, scalability, and efficiency to exceed SLOs.
  • Develop and maintain documentation related to customer-specific SLOs, SLIs, and monitoring processes.