Share
What you'll be doing:
Innovate and develop new machine learning compiler and systems technologies
Design, implement, and optimize compilers for high impact AI workloads
Building efficient just-in-time domain specific compiler and runtime for high impact workloads in generative AI
Co-design learning system solutions with current and future ML compiler and algorithm technologies.
Collaborate closely with other engineering teams at NVIDIA to build high impact solutions for machine learning acceleration
What we need to see:
Bachelor's degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); MS or PhD are preferred
4+ years (academic/ industry) experience with ML/DL systems development preferable for compilers
Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX etc)
Strong python and C/C++ programming skills
Expertise in AI frameworks such as PyTorch, TensorFlow, and ONNX
The Crowd:
Expertise in machine compilers (e.g. Apache TVM, MLIR)
Expertise in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention)
Strong experience in GPU performance optimizations
Strong experience machine learning systems research and productization
Open source project ownership or contributions
You will also be eligible for equity and .
These jobs might be a good fit