Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Amazon Software Engineer II- AI/ML AWS Neuron Distributed Training 
United States, California, Cupertino 
370831681

05.08.2024
DESCRIPTION

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine
The ML Apps team works side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large models using Python is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending all of this for the Neuron based system is key.Key job responsibilitiesWork/Life Balance
Mentorship & Career Growth

BASIC QUALIFICATIONS

- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language


PREFERRED QUALIFICATIONS

- - - Bachelor's degree in computer science or equivalent
- - - Machine Learning knowledge in frameworks and end to end model training.