Expoint – all jobs in one place
Finding the best job has never been easier
Limitless High-tech career opportunities - Expoint

Nvidia Senior Software Engineer AI Infrastructure 
United States, California 
264044154

Yesterday
US, CA, Santa Clara
time type
Full time
posted on
Posted 26 Days Ago
job requisition id

The platform enables users to rapidly build, train, and deploy large-scale AI models across leading cloud providers like Oracle, Azure, and Google Cloud, eliminating the complexity of managing their own infrastructure.Key features include pre-trained and fine-tunable models, serverless GPU inference, and a unified interface for multi-cloud management.

What you'll be doing:

You will be building restful cloud services and virtualization frameworks that come together to form our NVIDIA DGX Cloud Reference Architecture. These services have requirements for high security & maximum performance to support extensive AI workloads.

  • Design, build, and implement scalable cloud-based systems for PaaS/IaaS.

  • Work closely with other teams on new products orfeatures/improvementsof existing products.

  • Drive performance tuning and automation.

  • Support, maintain, and document software functionality.

What we need to see:

  • Expertise in Kubernetes (K8s) & KubeVirt.

  • Expertise in Virtualization technologies such as Firecracker, KVM, OpenStack, Nutanix AHV & Redhat OpenShift.

  • Extensive experience with Golang and building RESTful web services.

  • Demonstrate understanding of cloud design in the areas of virtualization and global infrastructure, distributed systems, and security.

  • Experience with Docker and Containers.

  • Background with Infrastructure as Code.

  • Experience with AWS (Fargate, EC2, IAM, ECR, EKS, Route53 etc...).

  • Experience with Continuous Integration and Continuous Delivery.

  • BS or MS in Computer Science or equivalent experience with over 12+ years of hands-on software engineering.

  • Excellent interpersonal and written communication skills required.

Ways to stand out from the crowd:

  • Experience with Postgres.

  • Exposure to Helm Charts & Terraform.

  • A track record of solving complex problems with elegant solutions.

  • Prior experience with Rust & Python as well as demonstrate delivery of complex projects in previous roles.

  • Experience with load testing frameworks as well as experience with secrets management

You will also be eligible for equity and .