Proven experience as SRE, with a focus on operations management and demonstrate expertise in managing large-scale production outages and leading incident response.Experience in strategizing and achieving operational excellence in global distributed systems.Deep understanding of production monitoring systems, log analysis, and troubleshooting, support dashboards and proficiency in scripting languages and automation tools.Envision and build automation tools to deliver infrastructure services reliably and in a repeatable fashion. Utilize AI & ML models to gain Operations Excellence in application support.Be a problem-solver who is self-directed and capable of exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner.Provide guidance to improve the stability, efficiency and scalability of systems. Strong troubleshooting ability will be used daily.Determine future needs for capacity by closely reviewing upcoming application features and load.