Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Nvidia Deep Learning Performance Architect 
China, Shanghai 
400107882

04.04.2024

What you’ll be doing:

  • Develop highly optimized deep learning kernels for inference

  • Do performance optimization, analysis, and tuning

  • Work with cross-collaborative teams across automotive, image understanding, and speech understanding to develop innovative solutions

  • Occasionally travel to conferences and customers for technical consultation and training

What we need to see:
  • Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)

  • SW Agile skills helpful

  • Excellent C/C++ programming and software design skills

  • Python experience a plus

  • Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU

  • GPU programming experience (CUDA or OpenCL) desired

  • 5+ years of relevant work experience