Key Responsibilities: - Implement and improve our infrastructure and application monitoring and observability capabilities that results in improving our reliability.- Engage with application engineering teams to improve service operability and reliability, on-call efficiencies, drive incident management, and post-mortem analysis.- Drive production readiness, and improve key areas like capacity planning, configuration management, and observability. - Design and improve architectures of new and existing systems based on the principles of reliability and high availability with extensive logging and observability. - Develop expertise in Apple Infrastructure and best practices and bring that to Ad Platforms to run a world class distributed systems. - Create tooling and automation to improve the operations and operability of our infrastructure and applications.