Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

Nvidia Senior Manager DGX Cloud Automation Engineering 
United States, Texas 
573686785

Yesterday
US, CA, Santa Clara
US, CA, Remote
time type
Full time
posted on
Posted 6 Days Ago
job requisition id

What You’ll Be Doing:

You will play a crucial role in ensuring the success of the DGX Cloud platform by helping to build our development, tests, release and deployment processes, creating world-class performance and quality measurement and regression management tools, and maintaining a high standard of excellence in our automation tools for CI/CD, deployments and release engineering.

  • Provide strategic direction for designing and implementing scalable cloud-based systems for PaaS/IaaS.

  • Coordinate collaboration with Product, QA, and Development teams to deliver new features and improvements.

  • Drive improvements on release engineering for both on-premises and cloud deployments.

  • Implement procedures for software quality, security, and performance across the team.

  • Mentor and grow engineering talent, encouraging innovation and accountability.

What We Need to See:

  • Shown experience leading software engineering teams across different time zones

  • Deep understanding of cloud architecture, virtualization, global infrastructure, and security principles.

  • Expertise in Kubernetes (K8s) and containerization technologies.

  • Familiarity with Infrastructure as Code and major cloud service providers (AWS, Azure, GCP).

  • Strong background in CI/CD strategy and release engineering.

  • Exceptional leadership, communication, and stakeholder management skills.

  • BS/MS in Computer Science or related field (or equivalent experience)

  • 12+ overall years of software engineering experience with 5+ years in leadership roles.

Ways to Stand Out from the crowd:

  • Hands-on background with virtualization technologies (Firecracker, KVM, OpenStack, Nutanix AHV, RedHat OpenShift).

  • Hands-on experience with software to bring up datacenters, to have a cluster up & ready all you need is a BMC connection.

  • Prior experience with Go and Python development.

  • Proven success in scaling teams and delivering cloud platforms.

  • Expertise in load testing frameworks, secrets management, and security compliance.

You will also be eligible for equity and .