What you will be doing:
Work with the latest in research, industry and closely with our most strategic partners and researchers to surface areas that stretch our hardware and software in unique ways.
Working closely with internal teams to identify the key gaps. Be abreast of the latest in AI systems infra - from kernel level optimizations to data center scale deployments.
Will include quick prototyping, architecting and crafting new features.
Working with engineering, research teams across all of NVIDIA to ensure flawless transition of concepts to the NVIDIA stack.
Conceptualize a solution across multiple facets - data center designs, networking, different model architectures, NVIDIA stack and deployment scenarios.
What we need to see:
Understanding of the latest in Deep Neural Networks, Large Language models, Multimodal and Scaling techniques.
10+ years proven experience in Deep Learning systems and infra and NVIDIA GPUs.
Excellent C/C++, Python programming and software design skills, including debugging, performance analysis, and test design.
Strong foundation in CPU and/or GPU architecture. Knowledge of high-performance computing and distributed programming.
Strong communication and interpersonal skills along with the ability to work in a dynamic and distributed team.
Doctoral degree in Computer Science, Computer Engineering, related field (or equivalent experience)
Ability to envision beyond what's possible right now.
Ways to stand out from a crowd:
Experience architecting or developing large-scale deep learning distributed systems
Knowledge of CPU and GPU architecture
GPU programming (CUDA)
You will also be eligible for equity and .
משרות נוספות שיכולות לעניין אותך