The point where experts and best companies meet
Share
The ML Apps team works side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large models using Python is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending all of this for the Neuron based system is key.Key job responsibilitiesWork/Life Balance
Mentorship & Career Growth
- BASIC QUALIFICATIONS
- - 3+ years of non-internship professional software development experience
- - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- - Experience programming with at least one software programming language
- - 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- - 2+ years of Machine Learning expertise with prior work on key ML frameworks (Pytorch, MxNet etc) and experience with distributed training
These jobs might be a good fit