Expoint – all jobs in one place
המקום בו המומחים והחברות הטובות ביותר נפגשים
Limitless High-tech career opportunities - Expoint

Nvidia Senior Solutions Architect Infiniband Networking Ethernet - NVIS 
India, Karnataka, Bengaluru 
150473023

Yesterday
India, Bengaluru
time type
Full time
posted on
Posted 30+ Days Ago
job requisition id

What you'll be doing:

  • Primary responsibilities will include building AI/HPC infrastructure for new and existing customers.

  • Support operational and reliability aspects of large-scale AI clusters, focusing on performance at scale, real-time monitoring, logging, and alerting.

  • Engage in and improve the whole lifecycle of services—from inception and design through deployment, operation, and refinement.

  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.

  • Provide feedback to internal teams such as opening bugs, documenting workarounds, and suggesting improvements.

What we need to see:

  • BS/MS/PhD or equivalent experience in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields.

  • At least 5+ years of professional experience in networking fundamentals, TCP/IP stack, and data center architecture

  • Proficiency in configuring, testing, validating, and resolving issues in LAN and InfiniBand networks, especially in medium to large-scale HPC/AI environments.

  • Advanced knowledge of EVPN, BGP, OSPF, VXLAN protocols.

  • Hands-on experience with network switch/router platforms like Cumulus Linux, SONiC, IOS, JunosOS, and EOS.

  • Extensive experience delivering automated network provisioning solutions using tools like Ansible, Salt, and Python.

  • Ability to develop CI/CD pipelines for network operations.

  • Strong focus on customer needs and satisfaction.

  • Self-motivated with leadership skills to work collaboratively with customers and internal teams.

  • Strong written, verbal, and listening skills in English are essential.

Ways to stand out from the crowd:

  • Familiarity with cloud networks (AWS, GCP, Azure) is a plus.

  • Linux or Networking Certifications.

  • Experience with High-performance computing architectures. Understanding of how job schedulers (Slurm, PBS) work.

  • luster management technologies knowledge (bonus credit for BCM (Base Command Manager).)

  • Experience with GPU (Graphics Processing Unit) focused hardware/software.