Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Amazon Sr Software Development Manager AWS Neuron 
United States, California, Cupertino 
991627564

10.06.2024
DESCRIPTION

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine
Responsible for the full development life cycle of our integrations and extensions for inference and training support in Pytorch, XLA, JAX as well as distributed training libraries like FSDP, DDP and others.
Characterization, enablement and development of existing and future massive-scale ML models like GPT3 as well as BERT, ViT, LLama, Stable Diffusion and more
Lead the way to ensure support for key ML functionality in a combined chip / software platform
Key job responsibilities
Responsible for the full development life cycle of our integrations and extensions for training support in Pytorch, XLA, Tensorflow as well as distributed training libraries like FSDP, DDP and others.
Characterization, enablement and development of existing and future massive-scale ML models like GPT3 as well as BERT, ViT, Stable Diffusion and more
Lead the way to ensure support for key ML functionality in a combined chip / software platform
Ensure the right thing is being built and delivered to customersA day in the life
You will work with the executive leadership and other senior management and technical leaders to define product directions and deliver them to customers. We build massive-scale distributed training and inference solutions. This organization builds the full stack of software, servers and chips to accelerate at the highest scale.
Work/Life Balance
Mentorship & Career Growth


BASIC QUALIFICATIONS

- 10+ years of engineering experience
- 5+ years of engineering team management experience
- 10+ years of planning, designing, developing and delivering consumer software experience
- Experience partnering with product or program management teams
- Experience managing multiple concurrent programs, projects and development teams in an Agile environment