Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

Amazon Principal Applied Scientist Hardware Silicon Systems Group 
India, Karnataka 
227660183

14.10.2025
Description

As Principal Applied Scientist you will blend expertise at the intersection of ML and hardware optimization for model training, build cutting-edge architectures for vision, language, and multi-modal tasks. Role requires a specialist in hardware-aware quantization, with hands-on experience in model compression techniques like pruning and distillation. You will be responsible for computer architecture, ML accelerator designs, efficient inference algorithms and low-precision arithmetic.
Key job responsibilities
As a Principal Applied Scientist, you will:
• Own the technical architecture and optimization strategy for ML models deployed across Amazon's device ecosystem, from existing to yet-to-be-shipped products.
• Develop novel model architectures optimized for our custom silicon, establishing new methodologies for model compression and quantization.
• Create an evaluation framework for model efficiency and implement multimodal optimization techniques that work across vision, language, and audio tasks.
• Define technical standards for model deployment and drive research initiatives in model efficiency to guide future silicon designs.
• Spend the majority of your time doing deep technical work - developing novel ML architectures, writing critical optimization code, and creating proof-of-concept implementations that demonstrate breakthrough efficiency gains.
• Influence architecture decisions impacting future silicon generations, establish standards for model optimization, and mentor others in advanced ML techniques.

Basic Qualifications

Basic Qualifications:
• Masters degree in Computer Science, Electrical Engineering, or a related technical field
• 8+ years of experience in machine learning, with a focus on model architecture design, optimization, and deployment
• Expertise in developing and deploying deep learning models for real-world applications, including vision, language, and multi modal tasks
• Strong background in computer architecture, hardware acceleration, and efficient inference algorithms
• Hands-on experience with model compression techniques such as pruning, quantization, and distillation
• Proficiency with deep learning frameworks like TensorFlow, PyTorch, or ONNX


Preferred Qualifications

• PhD in Computer Science, Electrical Engineering, or a related technical field
• 10+ years of experience in machine learning, with a track record of developing novel model architectures and optimization techniques
• Proven expertise in co-designing ML models and hardware accelerators for efficient on-device inference
• In-depth understanding of the latest advancements in model compression, including techniques like knowledge distillation, network pruning, and hardware-aware quantization
• Experience working on resource-constrained embedded systems and deploying ML models on edge devices
• Demonstrated ability to influence technical strategy and mentor cross-functional teams
• Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical stakeholders