Expoint – all jobs in one place
Finding the best job has never been easier
Limitless High-tech career opportunities - Expoint

Amazon Senior Software Development Engineer AI/ML AWS Neuron Model Inference 
United States, Washington, Seattle 
969560148

14.10.2025
Description

You can learn more about NeuronKey job responsibilities
In this role, you will:
* would with state of the art LLMs, Open source and internal LLM families, large scale performance and benchmark evaluations etc.,* develop and performance tune a wide variety of LLM model families, including 500B+ large language models like the Llama family, DeepSeek and beyond.* work side by side with performance, compiler and runtime engineers to create, build and tune distributed inference solutions with Trainium and Inferentia.* build infrastructure to systematically analyze and onboard multiple models with diverse architecture.* collaborate with performance team to enable and evaluate optimizations such as fusion, sharding, tiling, and scheduling etc.,* conduct comprehensive testing, including unit and end-to-end model testing with continuous deployment and releases through pipelines.* work directly with customers to enable and optimize their ML models on AWS accelerators* collaborate across teams to develop innovative optimization techniques* Build online/offline inference serving with vLLM, SGLang, TensorRT or similar platforms in production environments.
A day in the life
You will also build high-impact solutions to deliver to our large customer base and participate in design discussions, code review, and communicate with internal and external stakeholders. You will work cross-functionally to help drive business decisions with your technical input. You will work in a startup-like development environment, where you’re always working on the most important initiative.

Basic Qualifications

- 5+ years of non-internship professional software development experience
- 5+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model execution.
- Experience programming with at least one software programming language


Preferred Qualifications

- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Masters degree in computer science or equivalent