Expoint – all jobs in one place
Finding the best job has never been easier
Limitless High-tech career opportunities - Expoint

Nvidia Deep Learning Performance Software Engineer 
China, Shanghai 
842888733

Today
China, Shanghai
China, Beijing
time type
Full time
posted on
Posted 20 Days Ago
job requisition id

What you’ll be doing:

  • Develop deep learning compiler

  • Develop highly optimized deep learning kernels

  • End-to-end performance optimization

  • Do performance optimization, analysis, and tuning


What we need to see:

  • Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)

  • SW Agile skills helpful

  • Excellent C/C++ programming and software design skills

  • Python experience a plus

  • MLIR experience a plus

  • AI agent experience a plus

  • Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU

  • GPU programming experience (CUDA or OpenCL) desired

  • 3 years of relevant work experience