As a Site Reliability Engineer, you will be a pivotal contributor to the reliability and scalability of the backend services that underpin machine learning models, safeguarding against abuse and fraud by providing and maintaining state of the art cloud-based infrastructure services and automation tools. You will collaborate with engineering and machine learning teams to translate requirements into resilient infrastructure designs, and subsequently deploy those systems employing modern system reliability engineering practices. Whether it involves constructing automation to manage extensive service deployments, implementing observability frameworks to proactively identify and resolve issues, or driving performance improvements across distributed systems you will be at the forefront of operational perfection. You will assume ownership of end-to-end service health, participating in on-call rotations and leading technical incident response when vital.