The point where experts and best companies meet
Share
In this role, you will research and develop techniques to accelerate large scale applications running on NVIDIA’s family of advanced CPU platforms.
Work directly with other technical experts in their fields (industry and academia) to perform in-depth analysis and optimization of complex database and data analytics workloads to ensure the best possible performance on modern hardware architecture focused on CPU performance.
Publish and present discovered optimization techniques in developer blogs or relevant conferences to engage and educate the Developer community.
Influence the design of next-generation hardware architectures, software, and programming models in collaboration with research, hardware, system software, libraries, and tools teams at NVIDIA
A Masters or PhD in Computer Science, Computer Engineering, or related computationally focused science degree (or equivalent experience).
At least 6+ years of relevant work or research experience.
Expert knowledge of modern CPU architectures (ARM, x86) and system/OS
In-depth expertise with CPU architecture fundamentals, especially memory subsystem (cache DRAM, storage.)
Hands-on experience with low-level parallel programming, vectorization, CPU intrinsics and concurrent data structures.
Programming fluency in modern C/C++ with a deep understanding of algorithms, concurrency, and other optimization techniques.
Good communication and organization skills, with a logical approach to problem solving, and prioritization skills.
Experience optimizing the performance of distributed database systems and frameworks (e.g. production database or Spark).
Background with compression, storage systems, networking, and distributed computer architectures.
Knowledge of GPU architectures
You will also be eligible for equity and .
These jobs might be a good fit