Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Amazon Software Development Engineer Frontier AI & Robotics 
United States, California, San Francisco 
785122619

25.02.2025
DESCRIPTION

In this role, you'll balance deep technical optimization work with strategic input on model architecture decisions, ensuring our innovative robotics models are designed with performance in mind from the ground up. You'll leverage NVIDIA's acceleration stack and other compilation techniques to tackle ambitious performance targets, working at the intersection of large language models and real-world robotics applications.
Key job responsibilities
- Drive inference optimization strategies for large-scale foundation models using TensorRT, CUDA, and other NVIDIA tools- Design and implement efficient compilation pipelines for complex transformer architectures
- Develop comprehensive benchmarking frameworks to measure and optimize model performance
- Build robust monitoring solutions to ensure reliable model serving at scale
- Explore and evaluate emerging optimization techniques including ONNX Runtime and other ML compilers
- Maintain high engineering standards through proper testing, documentation, and code review practicesA day in the life
- Optimize transformer blocks using custom CUDA kernels and TensorRT optimization techniques
- Partner with scientists to analyze model architectures and propose efficiency improvements
- Implement and benchmark various optimization strategies for large-scale models
- Debug performance bottlenecks using NVIDIA profiling tools- Design and maintain performance monitoring systems for production deployment
- Prototype new acceleration approaches using emerging compilation frameworks
1. Medical, Dental, and Vision Coverage
2. Maternity and Parental Leave Options
3. Paid Time Off (PTO)
4. 401(k) Plan

BASIC QUALIFICATIONS

- Bachelor's degree in computer science or equivalent
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- Strong expertise in Python, C++ and CUDA programming
- Experience with TensorRT or similar ML optimization frameworks
- Track record of optimizing ML models for production


PREFERRED QUALIFICATIONS


Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.