Expoint - all jobs in one place

המקום בו המומחים והחברות הטובות ביותר נפגשים

Limitless High-tech career opportunities - Expoint

Nvidia Senior System Software Engineer Application Cluster 
United States, Texas 
502003507

24.06.2024

What you'll be doing:

This role will provide leadership in the design and implementation of clusters that run demanding graphics applications, GPU simulations, and other computationally costly workloads. As a member of the software development team, you will identify architectural changes and recommend innovative approaches for our application clusters. As an expert, you will develop solutions to strategic challenges we encounter including: simulation execution, GPU configuration, CI/CD, networking, and storage design for large scale, high-performance workloads, effective resource utilization in a heterogeneous compute environment, capacity modeling, and growth planning across our global computing environment.

What we need to see:

  • Bachelor’s degree in Computer Science, Computer Engineering or related field or equivalent experience.

  • Minimum 5 years of experience designing and operating large scale compute infrastructure.

  • Experience developing software in Microsoft Windows.

  • Strong Python programming capability.

  • Experience analyzing and tuning performance for a variety of application workloads.

Ways to stand out from the crowd:

  • Worked with a GPU cluster in a research lab or in a public cloud (AWS, Azure, Google Cloud).

  • Experience with job schedulers such as Nomad or Slurm.

  • Hands-on experience of cluster configuration management tools like Ansible.

  • Knowledge of modern software deployment systems like Kubernetes.

  • Understanding of fast, distributed storage systems and Linux or Windows file systems.

  • Proficiency programming relational databases, like MySQL.

You will also be eligible for equity and .