Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

Nvidia Deep Learning Performance Software Engineer 
China, Shanghai 
842888733

Today
China, Shanghai
China, Beijing
time type
Full time
posted on
Posted 20 Days Ago
job requisition id

What you’ll be doing:

  • Develop deep learning compiler

  • Develop highly optimized deep learning kernels

  • End-to-end performance optimization

  • Do performance optimization, analysis, and tuning


What we need to see:

  • Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)

  • SW Agile skills helpful

  • Excellent C/C++ programming and software design skills

  • Python experience a plus

  • MLIR experience a plus

  • AI agent experience a plus

  • Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU

  • GPU programming experience (CUDA or OpenCL) desired

  • 3 years of relevant work experience