Expoint – all jobs in one place
Finding the best job has never been easier

Agi Sensory Inference Software Development Engineering jobs at Amazon in United States, Pittsburgh

Discover your perfect match with Expoint. Search for job opportunities as a Agi Sensory Inference Software Development Engineering in United States, Pittsburgh and join the network of leading companies in the high tech industry, like Amazon. Sign up now and find your dream job with Expoint
Company (1)
Job type
Job categories
Job title (1)
United States
State
Pittsburgh
4 jobs found
25.04.2025
A

Amazon AGI Sensory Inference Software Development Engineering United States, Pennsylvania, Pittsburgh

Limitless High-tech career opportunities - Expoint
Develop high-performance inference software for a diverse set of neural models, typically in C/C++. Design, prototype, and evaluate new inference engines and optimization techniques. Participate in deep-dive analysis and profiling...
Description:
DESCRIPTION

Key job responsibilities
• Develop high-performance inference software for a diverse set of neural models, typically in C/C++
• Design, prototype, and evaluate new inference engines and optimization techniques
• Participate in deep-dive analysis and profiling of production code
• Optimize inference performance across various platforms (on-device, cloud-based CPU, GPU, proprietary ASICs)
• Collaborate closely with research scientists to bring next-generation neural models to life
• Partner with internal and external hardware teams to maximize platform utilization
• Work in an Agile environment to deliver high-quality software
• Hold a high bar for technical excellence within the team and across the organization

BASIC QUALIFICATIONS

- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- Bachelor's degree in Computer Science, Computer Engineering, or related field
- Strong C/C++ programming skills
- Solid understanding of deep learning architectures (CNNs, RNNs, Transformers, etc.)


PREFERRED QUALIFICATIONS

- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Experience with inference frameworks such as PyTorch, TensorFlow, ONNXRuntime, TensorRT, LLaMA.cpp, etc.
- Proficiency in performance optimization for CPU, GPU, or AI hardware
- Proficiency in kernel programming for accelerated hardware using programming models such as (but not limited to) CUDA, OpenMP, OpenCL, Vulkan, and Metal
- Experience with latency-sensitive optimizations and real-time inference
- Understanding of resource constraints on mobile/edge hardware
- Knowledge of model compression techniques (quantization, pruning, distillation, etc.)
- Experience with LLM efficiency techniques like speculative decoding and long context
- Strong communication skills and ability to work in a collaborative environment
- Passion for solving complex problems and driving innovation in AI technology

Show more
18.04.2025
A

Amazon Applied Scientist AGI - Neural Efficiency Science United States, Pennsylvania, Pittsburgh

Limitless High-tech career opportunities - Expoint
DESCRIPTION Pioneer new approaches to foundation models Publish and present research at top tier conferences and journals Work with state of the art LLMs and multi modal foundation models Access...
Description:
DESCRIPTION

- Pioneer new approaches to foundation models
- Publish and present research at top-tier conferences and journals
- Work with state-of-the-art LLMs and multi-modal foundation models
- Access to substantial computational resources for researchKey job responsibilities
- Research and develop novel techniques for efficient runtime inference (low latency, high throughput)
- Design and evaluate efficient foundation model architectures
- Create new methods for improving training efficiency
- Conduct experimental studies to validate efficiency improvements
- Write high-quality Python code to implement research ideas- Author technical documentation and research papers
- Present findings to technical and non-technical stakeholders
A day in the life
Your day might start with a team stand-up to discuss ongoing projects and brainstorm solutions to technical challenges. You'll spend time implementing and testing new efficiency optimization techniques in Python, analyzing performance metrics, and iterating on approaches. You'll collaborate with team members to review code and research results, participate in technical discussions about architecture designs, and engage with other AGI teams to understand their efficiency needs. You might end your day analyzing experimental results or writing up findings for a research paper. Throughout the week, you'll have opportunities to present your work to stakeholders and contribute to the team's research roadmap.


BASIC QUALIFICATIONS

- 3+ years of building models for business application experience
- PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
- Experience programming in Java, C++, Python or related language
- Experience in any of the following areas: algorithms and data structures, parsing, numerical optimization, data mining, parallel and distributed computing, high-performance computing


Show more

These jobs might be a good fit

Limitless High-tech career opportunities - Expoint
Develop high-performance inference software for a diverse set of neural models, typically in C/C++. Design, prototype, and evaluate new inference engines and optimization techniques. Participate in deep-dive analysis and profiling...
Description:
DESCRIPTION

Key job responsibilities
• Develop high-performance inference software for a diverse set of neural models, typically in C/C++
• Design, prototype, and evaluate new inference engines and optimization techniques
• Participate in deep-dive analysis and profiling of production code
• Optimize inference performance across various platforms (on-device, cloud-based CPU, GPU, proprietary ASICs)
• Collaborate closely with research scientists to bring next-generation neural models to life
• Partner with internal and external hardware teams to maximize platform utilization
• Work in an Agile environment to deliver high-quality software
• Hold a high bar for technical excellence within the team and across the organization

BASIC QUALIFICATIONS

- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- Bachelor's degree in Computer Science, Computer Engineering, or related field
- Strong C/C++ programming skills
- Solid understanding of deep learning architectures (CNNs, RNNs, Transformers, etc.)


PREFERRED QUALIFICATIONS

- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Experience with inference frameworks such as PyTorch, TensorFlow, ONNXRuntime, TensorRT, LLaMA.cpp, etc.
- Proficiency in performance optimization for CPU, GPU, or AI hardware
- Proficiency in kernel programming for accelerated hardware using programming models such as (but not limited to) CUDA, OpenMP, OpenCL, Vulkan, and Metal
- Experience with latency-sensitive optimizations and real-time inference
- Understanding of resource constraints on mobile/edge hardware
- Knowledge of model compression techniques (quantization, pruning, distillation, etc.)
- Experience with LLM efficiency techniques like speculative decoding and long context
- Strong communication skills and ability to work in a collaborative environment
- Passion for solving complex problems and driving innovation in AI technology

Show more
Find your dream job in the high tech industry with Expoint. With our platform you can easily search for Agi Sensory Inference Software Development Engineering opportunities at Amazon in United States, Pittsburgh. Whether you're seeking a new challenge or looking to work with a specific organization in a specific role, Expoint makes it easy to find your perfect job match. Connect with top companies in your desired area and advance your career in the high tech field. Sign up today and take the next step in your career journey with Expoint.