As a Site Reliability Engineer you will be responsible for providing the platform for critically important ad-tech systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish. Key Responsibilities: - Implement and improve our infrastructure and application monitoring and observability capabilities that results in improving our reliability. - Engage with application engineering teams to improve service operability and reliability, on-call efficiencies, drive incident management, and post-mortem analysis. - Drive production readiness, and improve key areas like capacity planning, configuration management, and observability - Design and improve architectures of new and existing systems based on the principles of reliability and high availability with extensive logging and observability. - Develop expertise in Apple Infrastructure and best practices and bring that to Ad Platforms to run a world class distributed systems. - Create tooling and automation to improve the operations and operability of our infrastructure and applications.