What you'll be doing - Design and optimize large-scale data platforms and machine learning infrastructure systems for efficiency, reliability, and cost-effectiveness.
- Lead improvements in infrastructure tooling by researching, prototyping, and implementing solutions that enhance operational excellence and development agility.
- Implement next-generation ML Ops capabilities in our model training, deployment, and serving pipelines.
- Champion infrastructure optimization projects across the company, driving significant cost savings while maintaining or improving service performance.
- Engage in on-call rotations, utilizing your deep understanding of infrastructure to identify and resolve complex issues quickly, minimize impact, and prevent future occurrences through automation and system enhancement.
What we're looking for - Experience in infrastructure and back-end engineering, with a track record of building robust distributed systems.
- Daily hands-on experience with model training, deployment, experimentation, and serving tools for machine learning.
- Deep understanding of cloud infrastructure using GCP, AWS, or Azure and proficiency in either Go, Java, or Python.
- Experience with the latest data and machine learning technologies such as Google's Vertex AI, Ray, and SageMaker.
- Experience in building Kubernetes infrastructure stacks, experience from either the ML or Data domain and its associated technologies is advantageous.
You might also have - An 'in-it-together' mentality that appreciates engineering as a collective effort.
- Hands-on experience with tools such as Vertex AI, Protobuf, Kafka, Flink, BigQuery, Druid, Feast, Kubeflow, Ray, or KServe.
- Experience with Terraform, ArgoCD, GitHub Actions, and similar tools used for CI/CD.
Additional information - Relocation support is not available for this position.
- International relocation support is not available for this position.
- Work visa/immigration sponsorship is available for this position.
Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
Gross pay salary$111,000—$211,300 USD