The point where experts and best companies meet
Share
What you’ll be doing:
Define the Infiniband and NVL system architecture end-to-end, by internal requirements and customers requirements through all product life cycles (post/pre silicon, to deployments).
research of various solutions to enable the nextlarge-scale-high-performancecomputing clusters. The position spans over various layers from algorithms, software, firmware, and HW.
Collaborate with cross-functional teams, including other architecture teams, logic design, system software, firmware, and research teams, to ensure the successful execution of the project.
What we need to see:
B.Sc, M.Sc, or Ph.D degree in Computer Science, Computer Engineer, or Electrical Engineer or equivalent experience.
5+ years of industry or research experience in computer networks.
Excellent understanding of large-scale networks behavior and the effect of distributed computing workloads effect on the network.
Experience in development of simulation environments.
Possess strong managerial, problem solving and critical thinking skills.
Ability to work and operate in a highly dynamic environment.
Partner with multiple groups in the organization.
Ways to stand out of the crowd:
Good knowledge in network protocols - such as InfiniBand, IP, TCP and RoCE and network topologies.
Good knowledge in Python, C++.
Familiarity with HPC environments, routing algorithms, Omnet++ and NS3 simulation environments.
Experience with AI workloads such as LLM and DLRM and familiarity with communication libraries like NCCL
These jobs might be a good fit