What you’ll be doing:
Develop deep learning compiler
Develop highly optimized deep learning kernels
End-to-end performance optimization
Do performance optimization, analysis, and tuning
What we need to see:
Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)
SW Agile skills helpful
Excellent C/C++ programming and software design skills
Python experience a plus
MLIR experience a plus
AI agent experience a plus
Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU
GPU programming experience (CUDA or OpenCL) desired
3 years of relevant work experience
משרות נוספות שיכולות לעניין אותך