המקום בו המומחים והחברות הטובות ביותר נפגשים
Job Description
Responsibilities
Research and implement techniques for model compression, quantization, and optimization
Conduct experiments to evaluate the impact of optimization methods on model accuracy, latency, and throughput
Collaborate with researchers and engineers to integrate optimizations into real-world machine learning workflows
Document findings and contribute to technical reports, blog posts, or research publications
Requirements
Currently pursuing a Ph.D. degree in Computer Science, Electrical Engineering, Machine Learning, or a related field
Strong programming skills in Python, with experience in deep learning frameworks such as PyTorch or TensorFlow
Familiarity with AI model optimization techniques such as quantization (e.g., INT4, FP8), pruning, and knowledge distillation.
Strong analytical and problem-solving skills
Excellent communication skills and ability to work in a team-oriented research environment
Background in efficient inference techniques for large-scale language models or computer vision models
Prior experience contributing to open-source ML frameworks or research publications
Why work with us
Hands-on experience with state-of-the-art AI optimization research
Mentorship from leading experts in machine learning and model efficiency
Opportunity to contribute to research papers, patents, or open-source projects
משרות נוספות שיכולות לעניין אותך