Expoint – all jobs in one place
The point where experts and best companies meet
Limitless High-tech career opportunities - Expoint

Nvidia Senior Deep Learning Compiler Engineer - CUDA 
China, Shanghai 
499624821

Today
China, Shanghai
China, Beijing
time type
Full time
posted on
Posted 5 Days Ago
job requisition id

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.

What you'll be doing:
  • Design and implement the DSL and the core compiler of tile-aware GPU programming model for emerging GPU architectures

  • Continuously innovate and iterate on the core architecture of the compiler to consistently optimize performance

  • Investigation of next-generation GPU architectures and provide solutions in the DSL and compiler stack

  • Performance analysis on emerging AI/LLM workloads and integrate with AI/ML frameworks

What we need to see:
  • Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)

  • 4 + years of relevant work experience

  • Excellent C/C++ programming and software engineering skills, ACM background is a plus

  • Good fundamental knowledges on computer architecture

  • Strong ability in abstracting problems and the methodology in resolving problems

  • Strong compiler backgrounds including MLIR/TVM/Triton/LLVM is desired

  • Good knowledge of GPU architecture and fast kernel programming skills is a plus

  • Knowledge of LLM algorithms or a certain HPC domain is a plus

  • Knowledge of multi-GPU distributed communication is a plus

  • Excellent oral communication in English is a plus