What you will be doing:
Perform networking simulations of communication patterns prevalent in AI applications, such as using NCCL.
Design and implement new techniques and protocols to accelerate the communication performance.
Explore innovative solutions in HW and SW for our next generation platforms as part of programmable RoCE architecture.
Build proofs-of-concept, conduct experiments, and perform quantitative modeling to evaluate and drive new innovations.
Use simulation to explore performance of AI applications on large GPU clusters.
What we need to see:
M.S./Ph.D. degree in CS/CE or equivalent experience.
5+ years of relevant experience.
Excellent C/C++ programming and debugging skills.
Experience with network simulations.
Deep understanding of RDMA.
Proven fundamentals of compute, network architecture and operating systems.
Strong experience with Linux.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Expertise in related technology and passion for what you do. Experience with NCCL Collectives along with AI communication patterns and parallelization techniques.
Strong collaborative and interpersonal skills and a proven track record of effectively guiding and influencing within a dynamic and multi-functional environment.
You will also be eligible for equity and .
משרות נוספות שיכולות לעניין אותך