

computing for more than 25 years.a unique legacy of innovationfueled by great technology—and amazing people. Today,
You will define how AI models are deployed and scaled in production using the NVIDIA Spectrum-X Networking Platform, influencing decisions from inter-node communication and
Be Doing:
Lead research and development of end-to-end networking solutions for distributed AI training and inference at scale, with a focus on job completion time, failure resiliency, telemetry, scheduling, andplacement.
Analyze current deployments, develop prototypes, and recommend architectural improvements.
Stay abreast of the latest research; become the team’s authority in emerging networking techniques and technologies.
Design, simulate, and validate new systems using novel, scalable network simulator NSX.
Develop and test prototypes on large-scale GPU clusters (e.g., Israel-1).
Collaborate across hardware, firmware, and software teams to translate ideas into real networking product features.
Publish patents and present research at leading conferences.
What We Need to See:
M.Sc. or PhD (preferred) in Computer Science, Electrical/Computer Engineering, or related field—or B.Sc. with research experience andpublications.
5+ years of relevant experience.
Deep expertise in networking and communication internals (NCCL, RDMA, congestion control, routing).
Strong software engineering skills in C++ and/or Python.
Excellent system-level design and problem-solving abilities.
Outstanding communication and collaboration skills across technical domains.
Ways to Stand Out from the Crowd:
Proven passion for solving sophisticated technical problems and delivering impactful solutions.
Record of publications in top-tier conferences.
Experience in designing and building large-scale AI training clusters.
Post-PhD research experience
Practical understanding of deep learning systems, GPU acceleration, and AI model execution flows.
משרות נוספות שיכולות לעניין אותך

What you’ll be doing:
Crafting and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.
Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.
Collaborating closely with backend engineers, product managers, and other partners to deliver impactful solutions.
Writing clean, maintainable, and testable code in Go, contributing to our CI/CD pipelines.
Conducting code and build reviews to uphold high-quality standards and mentor team members.
Leading the development and implementation of advanced identity management systems that secure NVIDIA’s innovative AI and GPU cloud.
Developing scalable multi-tenant solutions that allow our diverse clientele to harness the power of NVIDIA’s platforms securely and efficiently.
Collaborating with multi-functional teams to integrate identity and access management features seamlessly into our products, from cloud services to edge computing devices.
What we need to see:
B.Sc. in Computer Science or a related field (or equivalent experience).
5+ years of experience
Experience in backend software development, including system design and architecture.
Proficiency in at least one backend programming language (Go preferred).
Strong knowledge in microservices architecture, RESTful APIs, and relational databases.
Proficient knowledge of security guidelines and experience applying them in large-scale systems.
Expertise in implementing OAuth, OIDC, SAML, and other modern authentication protocols - Advantage
Ways to stand out from the crowd:
Expertise in Kubernetes internals and advanced cloud-native technologies.
Experience working in Linux environments with knowledge of networking, security, and virtualization.
Contributions to open-source projects or active participation in tech communities.
Agile approach and familiarity with standard methodologies.
משרות נוספות שיכולות לעניין אותך

What you’ll be doing:
Enhance NVIDIA's GPU Networking offerings for accelerating AI workloads, such as NVIDIA Dynamo or NVIDIA NIXL.
Develop and evaluate new technologies, innovations relevant for scientific, Deep Learning, and data-intensive workloads.
Create proof-of-concept to evaluate and drive such new technologies.
Work on impactful projects involving state-of-the-art high-performance computing software and hardware.
Designing and implementing services, runtime systems, and applications over SDK
Partner and collaborate with other forward-thinking team members and external researchers
What we need to see:
Hold a B.Sc. or M.Sc. or Ph.D. in Computer Science, Electrical or Computer Engineering from a leading university.
0-2 years of industry experience (or equivalent) in system programming or related fields.
Background in algorithm design, system programming, and computer architecture.
Strong programming and software development skills.
A teammate with a can-do attitude, high energy and excellent interpersonal skills.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Proven research track record.
Experience and passion for system architecture,CPU/GPU/Memory/Storage/Networking.
Stellar communication skills.
Knowledge in Deep Learning frameworks and AI communication libraries (NCCL, UCX, MPI and equivalents).
משרות נוספות שיכולות לעניין אותך

What you'll be doing:
Developing a brand new digital twin powered by CUDA technology for advanced research on data centers.
Collaborating with a team of extraordinary engineers and researchers to develop and implement innovative solutions.
Partnering with diverse teams to ensure smooth integration and deployment of network solutions.
Continuously exploring new technologies and methodologies to improve our network capabilities.
What we need to see:
Bachelor’s degree in Computer Science, Electrical Engineering, or a related field.
5+ years of experience in computer science, network engineering or related fields.
Excellent problem-solving skills.
Outstanding collaboration and communication skills.
Ways to Stand Out From the Crowd:
Hands-on experience developing CUDA applications
Extensive knowledge of network protocols
Proficient knowledge of extensive network simulations and AI datacenter ecosystems.
משרות נוספות שיכולות לעניין אותך

What you’ll be doing:
Develop NVIDIA's GPU Networking offerings for accelerating AI workloads, such as NVIDIA Dynamo or NVIDIA NIXL
Develop novel HW architecture models and SDKs for them
Simulations ranging from specific components to complete data center environments
Designing and implementing services, runtime systems, and applications over SDK
Evaluate and optimize application performance
Partner and collaborate with other forward-thinking team members and external researchers
Participate and speak at conferences and events
Work with intelligent networking machines powered by AI systems that can learn, reason and interact with other network components
What we need to see:
Student for BSc/MSc/PhD in Electrical Engineering, Computer, Science/Engineering,Math/Physics/Statisticsor a related field
Knowledge in networking, operating systems, accelerator programming, systems and AI training and inference
Track record of research excellence
Good communications skills
Please include your internship availability in your application
משרות נוספות שיכולות לעניין אותך

The role of a Senior Software Engineer in the Platform Group is to design and develop scalable, high-performance systems that support the next generation of AI workloads. You will collaborate with experts across domains, tackle complex challenges, and drive innovations that empower our users to push the limits of AI capabilities.
What you’ll be doing:
Designing and developing enterprise-grade systems with a strong focus on scalability, reliability, and performance.
Building and optimizing microservices-based architectures using Kubernetes and cloud-native technologies.
Collaborating closely with backend engineers, product managers, and other collaborators to deliver impactful solutions.
Writing clean, maintainable, and testable code in Go
Conducting code and design reviews to uphold high-quality standards and mentor team members.
What we need to see:
B.Sc. in Computer Science or a related field.
5+ years of experience in backend software development, including system design and architecture.
Proficiency in at least one backend programming language (We write in Go).
Strong understanding of microservices architecture, RESTful APIs, and relational databases.
Deep familiarity with Kubernetes and the cloud-native ecosystem.
Demonstrated ability to tackle complex technical challenges and deliver high-quality solutions.
Ways to stand out from the crowd:
Expertise in Kubernetes internals and advanced cloud-native technologies.
Hands-on experience with HPC or AI/ML platforms.
Familiarity with AI inference workloads and performance optimization.
Proficiency in Linux, with knowledge in networking, security, storage, and virtualization.
משרות נוספות שיכולות לעניין אותך

What you'll be doing:
Design and execute performance benchmarks using industry-standard tools (e.g., MLPerf, UCX, NVIDIA Collective Communications Library - NCCL and CloudAI) andcustomer-representativeAI workloads on our state-of-the-art GPU clusters.
Translate your benchmark data and technical insights into compelling, high-impact marketing assets and performance-driven sales enablement materials
Collaborate closely with Product Management, ASIC and Software architecture and Sales teams, provide feedback on product features, and ensure our performance results are technically accurate and impactful
What we need to see:
B.Sc in Computer Science or Software Engineering or equivalent experience
5+ years of experience benchmarking and analyzing high‑performance networking solutions, including RDMA, MPI, and large‑scale collective communication frameworks.
Hands‑on expertise in testing and benchmarking deep learning workloads on NVIDIA GPUs with CUDA, TensorFlow, and PyTorch, focused on validating and demonstrating distributed training and inference performance over NCCL, RoCE, and RDMA.
Shown proficiency in Performance Analysis methodologies and techniques.
Understanding of Ethernet and high-performance networking.
Programming experience with Python, Bash and C languages.
Experience with distributed job orchestration (Slurm, Kubernetes).
Experience with Linux OS distros.
Fast and self-learning capabilities with strong analytical and problem-solving skills.
In-depth knowledge and experience with AI workloads and benchmarking for large-scale distributed training/inference systems.
Ways to stand out from the crowd:
Strong Performance Analysis skills and methodologies using modern tools.
Deep knowledge in AI/Data Center Ethernet networks protocols and best-practices (Clos fabrics, BGP, VXLAN, etc.).
Hands-on experience with automation, CI/CD pipelines and DevOps practices.
Expertise in AI fabrics telemetry including metrics capturing and analysis as well as telemetry tools such as Prometheus and Grafana.
In-depth System knowledge and understanding (Intel / AMD / ARM CPUs, NVIDIA GPUs, NIC, Memory, PCI)
NVIDIA has reshaped computer graphics, PC gaming, and accelerated computing for over 25 years. We have an outstanding legacy of innovation that’s powered by great technology—and outstanding people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world.
משרות נוספות שיכולות לעניין אותך

computing for more than 25 years.a unique legacy of innovationfueled by great technology—and amazing people. Today,
You will define how AI models are deployed and scaled in production using the NVIDIA Spectrum-X Networking Platform, influencing decisions from inter-node communication and
Be Doing:
Lead research and development of end-to-end networking solutions for distributed AI training and inference at scale, with a focus on job completion time, failure resiliency, telemetry, scheduling, andplacement.
Analyze current deployments, develop prototypes, and recommend architectural improvements.
Stay abreast of the latest research; become the team’s authority in emerging networking techniques and technologies.
Design, simulate, and validate new systems using novel, scalable network simulator NSX.
Develop and test prototypes on large-scale GPU clusters (e.g., Israel-1).
Collaborate across hardware, firmware, and software teams to translate ideas into real networking product features.
Publish patents and present research at leading conferences.
What We Need to See:
M.Sc. or PhD (preferred) in Computer Science, Electrical/Computer Engineering, or related field—or B.Sc. with research experience andpublications.
5+ years of relevant experience.
Deep expertise in networking and communication internals (NCCL, RDMA, congestion control, routing).
Strong software engineering skills in C++ and/or Python.
Excellent system-level design and problem-solving abilities.
Outstanding communication and collaboration skills across technical domains.
Ways to Stand Out from the Crowd:
Proven passion for solving sophisticated technical problems and delivering impactful solutions.
Record of publications in top-tier conferences.
Experience in designing and building large-scale AI training clusters.
Post-PhD research experience
Practical understanding of deep learning systems, GPU acceleration, and AI model execution flows.
משרות נוספות שיכולות לעניין אותך