The point where experts and best companies meet
Share
What You’ll Be Doing:
Develop highly efficient and low cost AI models and algorithms for computer vision and video AI
Optimize the performance, latency and power consumption of AI models on low power processors for deep learning acceleration
Deploying deep learning models and optimize the inference stack for real-time performance
Deliver the benefits of NVIDIA’s latest hardware and platform software innovations to the Deep Learning
Closely collaborate with different deep learning software and hardware teams across NVIDIA to influence roadmaps and deliver solutions
What We Need To See:
Strong experience of building and optimizing AI model architectures for fast inference such as model pruning, distillation, post-quantization and quantization aware training
Experience with analyzing and fine-tuning deep learning pipeline performance
Experience with building real-time AI models for laptop and cloud use cases
5+ years of relevant engineering or research background in deep learning and/or computer vision
Hands-on development skills using deep learning libraries and frameworks such asPyTorch/TensorFlow/ONNX,TensorRT/Triton/WinMLand other neural processing SDKs
Collaboration ability to define project scope and roadmap together with the team while independently drive development effort with strong self-motivation
MS, or PhD in Computer Science, Computer Engineering, or closely related field or equivalent experience
Ways to stand out from the crowd:
Experience with AI inference accelerating hardware and building/optimizing models on them
Background with performance and latency analysis, profiling and tuning of AI workloads
Experience with CUDA programming, as well as a real passion for optimizing AI system performance
Experience of building platforms for computer vision such as real-time tracking of human face, gaze and body, as well as avatar animation and modeling.
You will also be eligible for equity and .
These jobs might be a good fit