Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Nvidia AI K8s Infrastructure Generalist 
United States, Texas 
66987032

01.12.2024

What you'll be doing:

  • As part of Maglev AI infrastructure and SRE team you will propose and craft new ways to improve availability of our Cloud Native AI Platform by automating critical processes on the multiple distributed GPU clusters

  • The solutions you propose and build will impact directly efficiency of the NVIDIA Autonomous Vehicles Perception development team!

What we need to see:

  • BS or MS in the CS/CE/EE or equivalent experience

  • At least 6+ years of the k8s experience on-prem and in the cloud

  • At least 4 years building automation software APIs for the large scale computing clusters and data platforms

  • You can help our team to develop a better stack of Go and Python automation APIs

  • Complete understanding of the Kubernetes and Cloud Native Architecture and working experience with k8s clusters

  • Expertise at problem solving and complexity analysis of the distributed systems

  • Proficiency with Linux environment

  • Excellent written and verbal interpersonal skills

  • You'll be a fun and motivated teammate who enjoys a challenge and celebrates success

Ways to stand out from the crowd:

  • Previous experience with building sophisticated tooling and SRE automation on the large 100+ nodes GPU/CPU clusters

  • DevSecOps experience with good understanding of the cloud security concepts

  • Deep knowledge of the networking layers and fundamentals

For two decades, we have pioneered visual computing, the art and science of computer graphics. With our invention of the GPU - the engine of modern visual computing - the field has expanded to encompass video games, movie production, product design, medical diagnosis and scientific research. Today, we stand at the beginning of the new AI computing era, ignited by a new computing model, GPU deep learning. This new model - where deep neural network is trained to recognize patterns from extensive amounts of data - has shown to be deeply effective at solving some of the most adventurous problems in everyday life.

You will also be eligible for equity and .