Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Nvidia Senior Machine Learning Engineer Quantized Training 
United States, Texas 
90067676

01.09.2024

NVIDIA is seeking machine learning engineers to support next-generation recipes for mixed-precision training. In this role you will (1) distill LLM research literature into its core, (2) translate literature into experiments at scale, (3) create insights to support or refute the efficacy of a technique, and (4) generate reproducible training recipes.

What you'll be doing:

  • Review state-of-the-art literature in quantized training

  • Build robust, reproducible, and portable training recipes

  • Provide engineering support to customers using HW and SW approaches

  • Collaborate closely with hardware, software, and research teams to assess and adopt deep learning algorithmic advancements in quantization

  • Work with production SW teams to realize recipes in production workflows

What we need to see:

  • Experience with PyTorch or similar frameworks such as jax/xla/etc

  • Proficient in the math of machine learning

  • Familiarity with FP8 for training

  • Published research or significant contributions to the field of AI, particularly in algorithm development for hardware-software co-design

  • PhD, M.S. degree or equivalent experience in Computer Science or a related field

  • 5+ YoE working in ML / AI

  • Strong written and oral communication skills

  • Strong programming skills and ability to debug ML systems

Ways to stand out from the crowd:

  • Experience in LLM training, fine-tuning and optimization (quantization, sparsity)

  • Familiarity with MX formats for training

  • Experience with Transformer Engine, Megatron-LM, or NeMo

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.

You will also be eligible for equity and .