Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Amazon Software Development Engineer II AI/ML AWS Neuron Distributed Training 
United States, California, Cupertino 
403173735

14.08.2024
DESCRIPTION

The ML Apps team works side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large models using Python is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending all of this for the Neuron based system is key.Key job responsibilitiesWork/Life Balance
Mentorship & Career Growth

BASIC QUALIFICATIONS

- BASIC QUALIFICATIONS
- - 3+ years of non-internship professional software development experience
- - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- - Experience programming with at least one software programming language


PREFERRED QUALIFICATIONS

- - 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- - 2+ years of Machine Learning expertise with prior work on key ML frameworks (Pytorch, MxNet etc) and experience with distributed training