Share
What you will be doing:
In this role, you will research and develop techniques to accelerate top CSP workloads on NVIDIA’s computing platform including advanced CPUs, GPUs and interconnects.
Work directly with key customers to perform in-depth analysis and optimization of complex workloads to ensure the best possible performance on current and next-generation hardware.
Collaborate with libraries, tools, system software architecture, hardware, and research teams at NVIDIA to influence the design of next-generation programming models, software, and architectures.
What we need to see:
A Masters degree in Computer Science, Computer Engineering, or related computationally focused science degree (or equivalent experience).
You have 10+ years of relevant work experience or research.
Programming proficiency in C/C++ with a deep understanding of software design, programming techniques, and algorithms.
A background that includes parallel programming, ideally CUDA C/C++.
Hands on experience doing low-level performance optimizations.
In-depth expertise with CPU and GPU architecture fundamentals.
Strong mathematical fundamentals, including linear algebra and numerical methods.
Good communication and organization skills, with a logical approach to problem solving, and prioritization skills.
You will also be eligible for equity and .
These jobs might be a good fit