In this role, you will be building infrastructure to support product-focused machine learning projects. You will build systems that leverage machine learning to index terabytes of data for projects in domains like image generation, LLMs, computer vision, natural language processing, human-computer interaction and text recognition. You will define and build out systems for analysis of failure modes of algorithms built upon this data, and for reporting overall benchmarking results for model comparisons. The technology you build will play a major role in defining important datasets, ingestion of annotated/inferred data into our systems, and making the data available to machine learning scientists in a seamless manner. This role requires a diverse set of skills, from tackling low-level distributed computing challenges at bare metal, to contributing to internal client user experiences by building stable interfaces, and everything in between.