Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Amazon Principal Applied Scientist Hardware Silicon Systems Group 
United States, California, Sunnyvale 
261165213

Yesterday
DESCRIPTION

Key job responsibilities
As a Principal Applied Scientist, you will:
• Own the technical architecture and optimization strategy for ML models deployed across Amazon's device ecosystem, from existing to yet-to-be-shipped products.
• Develop novel model architectures optimized for our custom silicon, establishing new methodologies for model compression and quantization.
• Create an evaluation framework for model efficiency and implement multimodal optimization techniques that work across vision, language, and audio tasks.
• Define technical standards for model deployment and drive research initiatives in model efficiency to guide future silicon designs.
• Spend the majority of your time doing deep technical work - developing novel ML architectures, writing critical optimization code, and creating proof-of-concept implementations that demonstrate breakthrough efficiency gains.
• Influence architecture decisions impacting future silicon generations, establish standards for model optimization, and mentor others in advanced ML techniques.

BASIC QUALIFICATIONS

This role requires a blend of expertise at the intersection of ML and hardware optimization. You must be an expert in model training, with deep knowledge of cutting-edge architectures for vision, language, and multimodal tasks. Crucially, you need to be a specialist in hardware-aware quantization, with hands-on experience in model compression techniques like pruning and distillation. A strong background in computer architecture and familiarity with ML accelerator designs is essential, as is expertise in efficient inference algorithms and low-precision arithmetic.
Basic Qualifications:
• Advanced degree (PhD preferred) in Computer Science, Electrical Engineering, or a related technical field
• 8+ years of experience in machine learning, with a focus on model architecture design, optimization, and deployment
• Expertise in developing and deploying deep learning models for real-world applications, including vision, language, and multimodal tasks
• Strong background in computer architecture, hardware acceleration, and efficient inference algorithms
• Hands-on experience with model compression techniques such as pruning, quantization, and distillation
• Proficiency with deep learning frameworks like TensorFlow, PyTorch, or ONNX


PREFERRED QUALIFICATIONS

• PhD in Computer Science, Electrical Engineering, or a related technical field
• 10+ years of experience in machine learning, with a track record of developing novel model architectures and optimization techniques
• Proven expertise in co-designing ML models and hardware accelerators for efficient on-device inference
• In-depth understanding of the latest advancements in model compression, including techniques like knowledge distillation, network pruning, and hardware-aware quantization
• Experience working on resource-constrained embedded systems and deploying ML models on edge devices
• Demonstrated ability to influence technical strategy and mentor cross-functional teams
• Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical stakeholders