The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Amazon Sr Software Engineer - AI/ML AWS Neuron Inference Multimodal
United States, Washington, Seattle
877474833

04.09.2024

DESCRIPTION

The ML Inference team works side by side with chip architects, compiler engineers and runtime engineers to create, build and optimize distributed inference solutions with Trainium/Inferentia instances. Experience with training and optimizing inference on these large models using Python/C++ is a must. Model parallelization, quantization, memory optimization - vLLM, DeepSpeed and other distributed inference libraries can be central to this and extending all of them for the Neuron based system is the key.Key job responsibilities
You will help leading the efforts to build and achieve the best distributed training and inference performance of PyTorch, JAX, TensorFlow with XLA and other advanced frameworks on Neuron stacks. You will help optimizing these models to ensure the highest performance and maximize the efficiency of them running on the custom AWS Trainium and Inferentia silicon and the Trn1, Inf1/2 servers. Strong software development (Python and C++) and Machine Learning knowledge (Multimodal, Computer Vision, Speech) are both critical to this role.
Work/Life BalanceMentorship & Career Growth

BASIC QUALIFICATIONS

- Bachelor's degree in computer science or equivalent
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Experience as a mentor, tech lead or leading an engineering team
- Experience in machine learning, data mining, information retrieval, statistics or natural language processing

PREFERRED QUALIFICATIONS

- Master's degree in computer science or equivalent
- Experience in computer architecture
- Previous software engineering expertise with Pytorch/Jax/Tensorflow, Distributed libraries and Frameworks, End-to-end Model Training.

These jobs might be a good fit

Amazon Software Engineer - AI/ML AWS Neuron Inference Multimodal United States, Washington, Seattle

Amazon Software Development Engineer - AI/ML AWS Neuron Multimodal ... United States, Washington, Seattle

Amazon Software Engineer - AI/ML AWS Neuron Distributed Training Mu... United States, Washington, Seattle

Professional CV Builder tool from Expoint.

Get to the top of the "yes list" with a standout CV!

CREATE CV

Amazon Sr Software Engineer - AI/ML AWS Neuron Inference Multimodal United States, Washington, Seattle 877474833

Amazon Software Engineer - AI/ML AWS Neuron Inference Multimodal United States, Washington, Seattle

Amazon Software Development Engineer - AI/ML AWS Neuron Multimodal ... United States, Washington, Seattle

Amazon Software Engineer - AI/ML AWS Neuron Distributed Training Mu... United States, Washington, Seattle

Amazon Software Engineer - AI/ML AWS Neuron Distributed Training Mu... United States, Washington, Seattle

Amazon Sr Software Engineer - AI/ML AWS Neuron Inference Multimodal
United States, Washington, Seattle
877474833