We are looking for an SRE with experience building and supporting machine learning (ML) infrastructure. You will apply SRE best practices to ensure the availability, reliability, and performance of our ML systems and services. You will actively engage with our development partners and product teams regularly so the ML services are well aligned with business needs. Responsibilities will include:Support and maintain ML services by measuring and monitoring availability, latency, and overall system health Deploy and support existing and new ML models and infrastructure Provide insights to partner stakeholders through log and telemetry analysis Maintaining documentation and automating manual processes where possible