Expoint – all jobs in one place
Finding the best job has never been easier
Limitless High-tech career opportunities - Expoint

Nvidia Deep Learning Performance Architect 
China, Shanghai 
891061966

Today
China, Shanghai
China, Beijing
time type
Full time
posted on
Posted 21 Days Ago
job requisition id

What you’ll be doing:

  • Develop highly optimized deep learning kernels for inference

  • Do performance optimization, analysis, and tuning

  • Work with cross-collaborative teams across automotive, image understanding, and speech understanding to develop innovative solutions

  • Occasionally travel to conferences and customers for technical consultation and training

What we need to see:

  • Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)

  • SW Agile skills helpful

  • Excellent C/C++ programming and software design skills

  • Python experience a plus

  • Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU

  • GPU programming experience (CUDA or OpenCL) desired

  • 3+ years of relevant work experience