

computing for more than 25 years.a unique legacy of innovationfueled by great technology—and amazing people. Today,
You will define how AI models are deployed and scaled in production using the NVIDIA Spectrum-X Networking Platform, influencing decisions from inter-node communication and
Be Doing:
Lead research and development of end-to-end networking solutions for distributed AI training and inference at scale, with a focus on job completion time, failure resiliency, telemetry, scheduling, andplacement.
Analyze current deployments, develop prototypes, and recommend architectural improvements.
Stay abreast of the latest research; become the team’s authority in emerging networking techniques and technologies.
Design, simulate, and validate new systems using novel, scalable network simulator NSX.
Develop and test prototypes on large-scale GPU clusters (e.g., Israel-1).
Collaborate across hardware, firmware, and software teams to translate ideas into real networking product features.
Publish patents and present research at leading conferences.
What We Need to See:
M.Sc. or PhD (preferred) in Computer Science, Electrical/Computer Engineering, or related field—or B.Sc. with research experience andpublications.
5+ years of relevant experience.
Deep expertise in networking and communication internals (NCCL, RDMA, congestion control, routing).
Strong software engineering skills in C++ and/or Python.
Excellent system-level design and problem-solving abilities.
Outstanding communication and collaboration skills across technical domains.
Ways to Stand Out from the Crowd:
Proven passion for solving sophisticated technical problems and delivering impactful solutions.
Record of publications in top-tier conferences.
Experience in designing and building large-scale AI training clusters.
Post-PhD research experience
Practical understanding of deep learning systems, GPU acceleration, and AI model execution flows.
משרות נוספות שיכולות לעניין אותך

What You’ll Be Doing:
In the position of Senior QA Software Engineer, you will have a key role in assuring our products meet high-quality standards! Your tasks will consist of:
Collaborating on projects involving ITU-T standards like G.8273.2 Class-C/D Boundary Clocks.
Applying your deep knowledge of communication network standards to validate and improve product quality.
Testing and verifying clock synchronization protocols, including Synchronous Ethernet (SyncE) and Precision Time Protocol (PTP).
Defining and implementing test topologies and setups to ensure comprehensive product coverage.
Participating in requirements, build, and feature reviews, providing QA insights and feedback.
Engaging with customers, presenting solutions, assisting in debugging on customer environments, and supporting integration efforts.
Staying up-to-date with emerging networking standards, features, and technologies to continually expand QA coverage.
Implementing and composing manual and automated firmware and system-level tests in a Linux environment.
What We Need to See:
B.Sc. in Computer Science, Software Engineering, or equivalent.
5+ years of practical experience working with telecommunication network protocols.
Strong networking and system-level testing background.
Solid programming skills (preferably in Python, C, or C++).
Excellent analytical, troubleshooting, and problem-solving skills.
High proficiency in English (spoken and written).
Self-motivated, proactive, and able to work independently while contributing to team goals.
Ways to Stand Out from the Crowd:
Proven experience as a Telecommunication QA Engineer and Python scripting and automation skills.
Familiarity with ITU-T standards and synchronization protocols, such as SyncE and PTP or equivalent experience.
Familiarity with SSM (Synchronous Status Message) or ESMC (Ethernet Synchronization Message Channel) testing.
Experience in noise generation and network security standards testing.
Demonstrated success working directly with customers in integration or debugging scenarios.

What You'll Be Doing
Experimental Root Cause Analysis: Design and build targeted experiments from the ground up to replicate complex hardware behaviors observed in AI data centers. You will then analyze the results to hunt down the root cause of these behaviors, whether in silicon, firmware, or software.
Hands-On Lab Investigation: Spend your time in the lab, working directly with the most advanced networking ASICs and systems to profile performance and characterize behavior under stress.
Test Automation and Development: Write and debug advanced automation scripts (Python) to programmatically control traffic generators (e.g., IXIA) and manipulate the test environment to expose corner-case issues.
Lab Environment Management: Maintain and support our lab environment, including equipment setup and racking, procurement, inventory, and coordinating maintenance and upgrades to support ongoing investigations .
What We Need to See
B.S.c in Engineering/Computer Science or equivalent experience with a strong foundation in hardware-software interaction
5+ years of deep hands-on lab experience focused on hardware validation, testing, and performance-tuning.
Strong proficiency in Python for test automation, hardware diagnostics, and data analysis.
Familiarity with basic networking concepts (Ethernet, Routing) and large-scale network design.
Proven ability to collaborate effectively with multi-functional teams, including hardware, software, and architecture groups.
Curiosity and a problem-solving approach, driven to understand how things work at the fundamental level.
Ways to Stand Out From the Crowd
Expertise in Ethernet protocols, L2/L3 routing, and large-scale data center network topologies.
Proficient in scripting tools for traffic generation (e.g., IXIA, Spirent) to compose intricate traffic scenarios, rather than simply running pre-existing scripts.
Expertise in validating and stress-testing network systems at the component level (e.g., NICs, Switches), with a focus on hardware diagnostics beyond standard protocol testing.
Familiarity with the unique network architectures and operational challenges of large-scale AI, HPC, or hyperscale data center environments (e.g., RDMA/RoCE, congestion control, high-radix fabrics).
Hands-on experience in configuring and managing datacenter network equipment.

What you’ll be doing:
Develop first tier features, with groundbreaking multi-protocol networking technology.
Lead features from planning through design and development, until delivery to the customer.
Work closely with other development teams, arch and verification to ensure features delivery on time with high quality.
Gain deep understanding of NVIDIA products and technologies.
What we need to see:
B.Sc. degree or equivalent experience in Engineering/Computer Science/related field.
At least 5 years experience in development positions in the industry.
C programming experience - must, Python programming experience- an advantage
High technical understanding and learning skills – specification, design, programming, integration and debugging abilities
Self-motivated, ability to work with little definition and supervision while multi-tasking and prioritizing across a number of projects and initiatives
Experience with testing methodologies, some tasks will include developing sophisticated fully automated testing environment
Excellent English communication and leading skills
Ways to stand out from the crowd:
Experience in a Ethernet switching product development, Routing / Bridging protocols knowledge
Experience in a multi-functional team and collaborate with teams in oversea sites.
Linux networking knowledge, TCP/IP stack



What you'll be doing:
Developing a brand new digital twin powered by CUDA technology for advanced research on data centers.
Collaborating with a team of extraordinary engineers and researchers to develop and implement innovative solutions.
Partnering with diverse teams to ensure smooth integration and deployment of network solutions.
Continuously exploring new technologies and methodologies to improve our network capabilities.
What we need to see:
Bachelor’s degree in Computer Science, Electrical Engineering, or a related field.
5+ years of experience in computer science, network engineering or related fields.
Excellent problem-solving skills.
Outstanding collaboration and communication skills.
Ways to Stand Out From the Crowd:
Hands-on experience developing CUDA applications
Extensive knowledge of network protocols
Proficient knowledge of extensive network simulations and AI datacenter ecosystems.

computing for more than 25 years.a unique legacy of innovationfueled by great technology—and amazing people. Today,
You will define how AI models are deployed and scaled in production using the NVIDIA Spectrum-X Networking Platform, influencing decisions from inter-node communication and
Be Doing:
Lead research and development of end-to-end networking solutions for distributed AI training and inference at scale, with a focus on job completion time, failure resiliency, telemetry, scheduling, andplacement.
Analyze current deployments, develop prototypes, and recommend architectural improvements.
Stay abreast of the latest research; become the team’s authority in emerging networking techniques and technologies.
Design, simulate, and validate new systems using novel, scalable network simulator NSX.
Develop and test prototypes on large-scale GPU clusters (e.g., Israel-1).
Collaborate across hardware, firmware, and software teams to translate ideas into real networking product features.
Publish patents and present research at leading conferences.
What We Need to See:
M.Sc. or PhD (preferred) in Computer Science, Electrical/Computer Engineering, or related field—or B.Sc. with research experience andpublications.
5+ years of relevant experience.
Deep expertise in networking and communication internals (NCCL, RDMA, congestion control, routing).
Strong software engineering skills in C++ and/or Python.
Excellent system-level design and problem-solving abilities.
Outstanding communication and collaboration skills across technical domains.
Ways to Stand Out from the Crowd:
Proven passion for solving sophisticated technical problems and delivering impactful solutions.
Record of publications in top-tier conferences.
Experience in designing and building large-scale AI training clusters.
Post-PhD research experience
Practical understanding of deep learning systems, GPU acceleration, and AI model execution flows.
משרות נוספות שיכולות לעניין אותך