Finding the best job has never been easier
Share
The ideal candidate is clearly passionate about new opportunities and has a demonstrable track record of success in delivering new features and products. A commitment to team work, hustle, and strong communication skills (to both business and technical partners) are absolute requirements. Creating reliable, scalable, and high performance products requires exceptional technical expertise, a sound understanding of the fundamentals of Computer Science, and practical experience building large-scale distributed systems. This person has thrived and succeeded in delivering high quality technology products/services in a hyper-growth environment where priorities shift fast.Key job responsibilities
- Responsible for pre-training multimodal LLMs
- Work closely with Applied scientists to scale pre-training of machine learning models on GPUs while optimizing the training workflows using highly distributed training techniques and frameworks (Like FSDP, NVIDIA NeMo, Megatron Core etc)- Will work in an Agile/Scrum environment to deliver high quality software against aggressive schedules.
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- 2+ years of expertise in Machine Learning and model training
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Expertise in training Generative AI vision models
These jobs might be a good fit