Share
Key job responsibilities
- Work closely with applied scientists to scale pre-training of machine learning models on GPUs
- Optimize training workflows using highly distributed training techniques and frameworks
- Work closely with applied scientists and engineers to support inference use cases- Work in an agile environment to deliver high quality software for production use cases.A day in the life
As a member of the Delivery Foundation Model team, you’ll spend your day on the following:
- Develop and implement novel foundation model architectures, working hands-on with data and our extensive training and evaluation infrastructure
- Guide and support fellow scientists in solving complex technical challenges, from trajectory planning to efficient multi-task learning
- Guide and support fellow engineers in building scalable and reusable infra to support model training, evaluation, and inference
- Lead focused technical initiatives from conception through deployment, ensuring successful integration with production systems- Conduct experiments and prototype new ideas
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Excellent knowledge of theory and practice of ML
- Experience with CUDA
- Experience with Pytorch
- Experience with distributed training
These jobs might be a good fit