Share
What you'll be doing:
Develop AI performance tools for large scale AI systems providing real time insight into applications performance and system bottlenecks.
Conduct in-depth hardware-software performance studies
Define performance and efficiency evaluation methodologies
Automate performance data analysis and visualization to convert profiling data into actionable optimizations
Support deep learning software engineers and GPU architects in their performance analysis efforts
Work with various teams at NVIDIA to incorporate and influence the latest technologies for GPU performance analysis
What we need to see:
Minimum of 8+ years of experience insoftware infrastructure and tools
BS or higher degree in computer science or similar (or equivalent experience)
Adept programming skills in multiple languages including C++ and Python
Solid foundation in operating systems and computer architecture
Outstanding ability to understand users, prioritize among many contending requests, and build consensus
Passion for “it just works” automation, eliminating repetitive tasks, and enabling team members
Ways to stand out from the crowd:
Experience in working with the large scale AI cluster
Experience with CUDA and GPU computing systems
Hands-on experience with deep learning frameworks (TensorFlow, PyTorch, JAX/XLA etc.)
Deep understanding of the software performance analysis and optimization process
You will also be eligible for equity and .
These jobs might be a good fit