Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Amazon Software Development Engineer - AI Hyperpod Engines 
United States, California 
41577744

Yesterday
DESCRIPTION

Amazon AI is looking for a software engineer to be part of Hyperpod Engines team building a resilient platform for deep learning training. Amazon Sagemaker Hyperpod scales and accelerates generative AI model development across thousands of AI accelerators. As a part of Hyperpod Engines team you will be developing training frameworks and communication libraries. You will be working on training frameworks like Pytorch, Nemo, Megatron, and collective communications libraries like NCCL. you will be developing software to train and fine tune large language models like LLAMA.As part of the team, you will be working in a fast-paced, cross-disciplinary team of engineers and researchers who are leaders in the field. You will take on challenging problems, distill real requirements, and then deliver solutions that either leverage existing academic and industrial research, or utilize your own out-of-the-box pragmatic thinking. In addition to coming up with novel solutions and prototypes, you will deliver these to production in customer facing products.Key job responsibilities
As a Software Development Engineer in Hyperpod Engines team you will be responsible for the major portion of existing and/or new software. List of responsibilities includes but is not limited to creating and modifying a large or significant set of components, a mid-size application, or service. You will be developing model training optimizations like context parallel , pipeline parallel and tensor parallel.About the team
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Work/Life Balance*** Please continue to use the below tagline in all job postings as the statement has been approved by all stakeholders and aligns with Amazon's working culture.

BASIC QUALIFICATIONS

- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language


PREFERRED QUALIFICATIONS

- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Experience in machine learning, data mining, information retrieval, statistics or natural language processing
- Experience with Large Language Models, model Training, fine tuning, training libraries development. Familiarity with model optimizations using model parallel and data parallel technologies.