המקום בו המומחים והחברות הטובות ביותר נפגשים
AWS Neuron is the complete software stack for the AWS Inferentia and Trainium (Neuron) cloud-scale machine learning accelerators.As a Sr. SDM of Software Development for the Machine Learning Distributed Training, Core Technologies and Infra org, you will be responsible for leading a strong teams of software engineers and managers to help design and deploy a software that enables ML workloads work seamlessly on these new products.Key job responsibilities
Responsible for the full development life cycle of our integrations and extensions for training support in Pytorch, XLA, JAX as well as distributed training libraries like FSDP and others.
In charge of. characterization, enablement and development of existing and future massive-scale ML models like Claude 3, GPT4 as well as ViT, Llava, Stable Diffusion3 and more.
Lead the way to ensure support for key ML functionality in a combined chip / software platform
A day in the life
You will work with the executive leadership and other senior management and technical leaders to define product directions and deliver them to customers. We build massive-scale distributed training and inference solutions. This organization builds the full stack of software, servers and chips to accelerate at the highest scale.
Work/Life Balance
Mentorship & Career Growth
- 10+ years of engineering experience
- 5+ years of engineering team management experience
- 10+ years of planning, designing, developing and delivering consumer software experience
- Experience partnering with product and program management teams
- Experience managing multiple concurrent programs, projects and development teams in an Agile environment
- Experience designing and developing large scale, high-traffic applications
- 5+ Years in Industry Experience in Machine/Deep Learning software/framework and/or Infra.
משרות נוספות שיכולות לעניין אותך