Finding the best job has never been easier
Share
What you’ll be doing:
Develop, maintain and optimize latency and throughput of NVIDIA’s L2/L3/L4 autonomous driving solutions
Devise acceleration strategies and patterns to improve software architecture and its efficiency on our computers with multiple heterogeneous hardware engines while meeting or exceeding product goals
Develop highly efficient product code in C++, making use of algorithmic parallelism offered by GPGPU programming (CUDA)/ARM NEON while following quality and safety standards such as defined by MISRA
Diagnose and fix performance issues reported on our target platform including on-road & simulation
What we need to see:
BS/MS or higher in computer science or a related engineering field
Excellent C and C++ programming skills
5+ years of relevant proven experience
Solid understanding of programming and debugging techniques, especially for parallel architectures
Good understanding of System SW / Operating Systems and Computer architecture
Experience with performance analysis, optimizations and benchmarking
You have excellent analytical, written and verbal interpersonal skills
Ways to stand out from the crowd:
Understanding of Embedded architectures and Real-time operating systems & scheduling
Strong mathematical fundamentals, including linear algebra and numerical methods
Experience implementing algorithms in Robotics, Computer Vision and/or Machine Learning
Software development experience with CUDA/GPGPU or any data parallel architectures
These jobs might be a good fit