Key job responsibilities
What will you do?
- Quantize, prune, distill, finetune Gen AI models to optimize for edge platforms
- Use first principles of Information Theory, Scientific Computing, Deep Learning Theory, Non Equilibrium Thermodynamics
- Train custom Gen AI models that beat SOTA and paves path for developing production models
- 3+ years of building machine learning models for business application experience
- PhD, or Master's degree and 4+ years of applied research experience
- Experience programming in Java, C++, Python or related language
- Experience with neural deep learning methods and machine learning
- Prior experience in productionizing ML models and managing their full lifecycle from development to deployment
- Understanding and preferably hands-on experience with recent methods for inference optimization, including Mixture-of-Experts (MoE), Diffusion Models for Language Generation, etc.
משרות נוספות שיכולות לעניין אותך