

Share
What you will be doing:
Drive strategic CSP partnerships, collaborate with keyhyperscaleCSPs to align project schedules, priorities, and technical roadmaps for next-generation data center platforms.
Manage complex technical collaborations proactively, identifying and resolving critical issues before they impact customer deployments.
Orchestrate internal stakeholder alignment ensuring CSP priorities are reflected across NVIDIA's engineering, product, and business.
Create comprehensive customer program visibility through executive dashboards, status reports, and metrics tracking that provide real-time insights into CSP project health, risks, and milestone achievement.
Lead large-scale deployment programs managing multi-rack, hyperscale infrastructure rollouts with complex technical dependencies, timeline coordination, and resource alignment across multiple internal teams.
What we need to see:
Technical Expertise: Solid understanding of system software design, OS fundamentals, Linux kernel development, and hardware/software interfaces. Experience in GPU-based data center server architectures.
Program Management: Proven ability to lead software development for rack-scale systems and data center servers, including complex hardware/software integration projects.
Industry Collaboration: Experience partnering with hyperscalers to drive technical outcomes, manage dependencies, handle escalations, and communicate effectively at the executive level.
Communication & Leadership: Exceptional ability to translate technical concepts for business stakeholders and align diverse teams toward common goals.
BS or MS in Computer Engineering, Computer Science, or related field or equivalent experience.
8+ years of technical program management experience in HPC or data center server software development.
Ways to stand out from the crowd:
Prior hands-on technology development experience in out-of-band manageability and observability solutions, system software/Linux kernel driver development, CUDA programming, or LLM/AI framework development.
You will also be eligible for equity and .
These jobs might be a good fit

Share
As a senior manager in our global IT PMO team, you will be accountable for critical infrastructure programs supporting Compute platforms and IT Automation. As a leader of these initiatives, you will drive the operating model, scaling approach and playbooks with delivering programs at scale.
What you'll be doing:
Lead, mentor, and develop a diverse team of Technical Program Managers, fostering professional growth and a culture of accountability and innovation.
Serve as a force multiplier across the organization by enabling effective coordination of cross-functional initiatives and managing complex interdependencies.
Promote collaboration in a fast-paced, dynamic environment while guiding teams through uncertainty with clarity, technical insight, and a results-oriented mindset.
Establish and maintain best-in-class program management practices to optimize delivery efficiency, mitigate risk, and ensure consistent execution excellence.
Drive continuous improvement by implementing data-driven feedback mechanisms and leveraging metrics to identify and act on opportunities for optimization.
Own and manage the Infrastructure program portfolio, ensuring alignment with organizational goals, strategic priorities, and resource capacity.
Lead quarterly portfolio planning sessions to align stakeholders on dependencies, risks, and prioritization of initiatives. Deliver clear, data-driven updates to senior stakeholders, tracking progress against key performance indicators and strategic objectives.
Evaluate and prioritize new program requests based on strategic value, business impact, and available capacity. Monitor and assess portfolio performance, identifying improvement areas and providing actionable recommendations to leadership.
What we need to see:
Bachelor's Degree in computer science, other related technical field or equivalent experience.
12+ overall years of IT experience.
7+ years of experience successfully leading technical programs in a fast paced, multifaceted, enterprise environment.
In depth technical knowledge of IT infrastructure within Automation, platforms, SW development and observability, and Compute Systems such as OpenShift.
Deep understanding of infrastructure standards and methodologies to optimize for quality and efficiency. Experience with various continuousintegration/deploymentmodels for large organizations will be important, as well as a foresight towards how to adopt and integrate such practices into a very dynamic infrastructure environment.
Certified Scrum Master or Certified Scrum Trainer certification or equivalent preferred.
Proven history of continuous improvement to enable higher performingprogram/organizationsand/or teams with improved business and customer outcomes.
Consistent track record of delivering critical infrastructure builds, while navigating a fast-paced environment with frequent shift in priorities.
Effective communication skills both written andverbal/presentations.Ability to bridge from high-level objectives to project details and vice-versa.
Willingness to work with distributed team members across different time zones.
You will also be eligible for equity and .
These jobs might be a good fit

Share
What you'll be doing:
Serve as the primary, high-impact contributor on complex features. Dedicate significant time to producing production code across the full stack, including UI, APIs, services, and infrastructure.
Code Review Leadership & Quality Assurance: Lead the code review process, setting and implementing thorough coding standards, performance benchmarks, and architectural integrity to ensure all merged code is high-quality, maintainable, and robust.
Architectural Ownership & Portability: Define and own the long-term technical roadmap, architecture, and design. This includes the required assurance that the deployment pipelines and services are platform-agnostic and easily deployable across the broader NVIDIA ecosystem, deliberately avoiding internal infrastructure dependencies.
Foundation Model Deployment Strategy: Lead the strategic implementation of web services and efficient batch processing queues to seamlessly integrate and operationalize our world foundation models into the customer-facing platform.
System Performance & Reliability: Implement and make sure standards for production-grade performance, monitoring, and fault tolerance across all services. Proactively identify and resolve systemic technical debt and scalability bottlenecks.
Deployment & Operational Excellence: Take ultimate ownership of the CI/CD pipelines, container orchestration strategy (Kubernetes/Helm), and operational readiness, ensuring seamless scalability and reliability in production.
Team Mentorship & Guidance: Mentor and guide the engineering team on advanced practices in full-stack development, distributed systems design, performance optimization, and clean, portable code architecture.
Multi-functional Partnership: Act as the key technical liaison, translating complex requirements from Product Managers, ML Engineers, and Data Scientists into robust, portable, and implementable designs.
What we need to see:
This role requires a proven track record of significant experience and technical mastery:
Minimum 12+ years of hands-on experience developing and deploying scalable full-stack web services in a cloud environment.
Proven Tech Lead or equivalent Senior/Staff level experience with demonstrated ability to define system architecture, mentor engineers, and take end-to-end technical ownership of a major platform while remaining deeply active in coding and code reviews.
Expert-level proficiency in designing and scaling distributed microservices architectures using gRPC and REST APIs.
Deep expertise in modern frontend frameworks and building highly responsive, data-intensive UIs capable of managing high-frequency data flows.
Direct experience designing and deploying containerized applications that use a GPU (e.g., NVIDIA Container Toolkit).
Experience with MaaS (Model-as-a-Service) patterns and serving large machine learning models as high-throughput endpoints.
Mastery of container orchestration, including Kubernetes and Helm for sophisticated, portable, multi-service production deployments.
Proficiency in backend languages such as Python and/or Go, and TypeScript for the frontend.
Strong practical experience with Cloud Infrastructure (AWS S3) and running complex data storage/access patterns (SQL, key-value stores).
Expertise in CI/CD practices (GitLab, Jenkins) with a focus on automation, testing, and improving deployment velocity and stability.
Bachelor's degree (B.S.) or equivalent experience in Computer Science, Software Engineering, Electrical Engineering, or a closely related technical field; Master's degree (M.S.) preferred
Ways to stand out from the crowd:
These skills represent a strong alignment with our specific domain challenges:
Experience in data querying platforms such as Apache Druid, ClickHouse, or Elasticsearch.
Familiarity with autonomous vehicle simulation environments (e.g., Carla) and synthetic data generation pipelines using foundational models.
You will also be eligible for equity and .
These jobs might be a good fit

Share
NVIDIA is seeking a Senior Technical Program Manager to lead the Infrastructure and Product Security and Compliance program for DGX Cloud. In this role, you will ensure our platforms and partner ecosystem meet the highest standards of trust, resilience, and governance.
As a Senior TPM focused on Cloud Security, you will own the design and execution of a DGXC-wide infrastructure security program that strengthens how DGXC operates with Cloud Service Providers (CSPs) and NVIDIA Cloud Partners (NCPs). You will drive security initiatives by embedding compliance controls, governance frameworks, and best practices across infrastructure, platform, and product teams. This role also ensures Product Security is integrated into product roadmap planning and the software development lifecycle, aligning product and infrastructure priorities. You will work closely with senior leaders and cross-functional teams in Security, Compliance, DevOps, and Engineering to continuously enhance and scale the DGX Cloud Security Posture.
What You’ll Be Doing:
Lead alignment across engineering, product, security, and partner teams to deliver against cloud security guidelines with CSP and NCP partners.
Drive programs that strengthen vulnerability management, access control, patching, and compliance readiness for SOC 2, ISO 27001, and related certifications.
Operate DGXC-wide security engineering forums and processes, establishing security KPIs, dashboards, and “run safe” SRE practices.
Partner with the CISO organization to define and assess emerging cloud providers against DGX Cloud security requirements, driving measurable improvements and action plans.
Implement and evolve security controls frameworks (e.g., SSH hardening, IAM, secret rotation) in CI/CD pipelines to ensure continuous compliance.
Lead certification readiness and audit cycles, including SOC 2 Type 1 & 2 and ISO 27001, from control mapping through evidence collection and remediation.
Chair the DGX Cloud Security & Compliance Working Group, managing governance reviews, risk dashboards, and executive reporting on posture and metrics.
Develop training programs to build security and compliance awareness across Product, DevOps, and Engineering teams.
Create playbooks and automation frameworks that streamline certification renewals, patching cycles, and vulnerability management workflows.
Maintain and continuously improve technical compliance documentation, including system diagrams, process flows, and control mappings.
What We Need to See:
12+ years of Program Management experience driving the planning and execution of large programs, software engineering projects in a fast paced environment.
Consistent track record delivering successful Security, Risk, and/or Compliance programs, particularly in cloud IaaS and SaaS environments, resulting in full certification of a suite of products and services.
Experience leading efforts related to SOC2 (Type 1 and Type 2) audits and readiness, including leading control implementation (e.g., access controls, change management, vulnerability management).
Experience operationalizing vulnerability management, patch management, SSH key governance, and access controls across distributed systems.
Ability to think strategically and tactically and to build consensus in making programs successful; ability to resolve technical issues and resource constraints across cross-functional teams.
Demonstrated ability to define metrics, dashboards, and risk indicators that measure posture improvement and audit readiness.
Proficiency with tools like JIRA, to comfortably guide engineering teams on execution in an Agile/scrum manner and ensure accurate governance artifacts are delivered.
Excellent executive communication and presentation skills able to distill complex technical and compliance topics for senior leadership
MS EE or CS degree, or equivalent experience.
Ways to Stand Out from the Crowd:
Highly motivated with strong interpersonal skills, with proven track record to work successfully with multi-functional teams and coordinate effectively across organizational boundaries and geographies.
Experience implementing security features in a multi-cloud environment.
Experience with sophisticated compliance programs, such as FedRamp, SCO2, or ISO certification efforts.
Solid understanding of tier 1 cloud technologies (AWS, GCP, Azure, OCI).
Experience with productivity tools and process automation.
You will also be eligible for equity and .
These jobs might be a good fit

Share
What You’ll Be Doing:
Guide partners along their Exemplar Path, a collective journey towards demonstrating outstanding cloud acceleration performance, scalability, and innovation leadership.
Collaborate with NVIDIA’s Cloud, Partner, and Enterprise Solutions teams to co-develop strategies that align cloud architecture with customer and national objectives.
Develop business and technical roadmaps that connect infrastructure investment with measurable business impact and long-term ecosystem growth.
Collaborate with sovereign and hyperscale cloud providers to establish AI acceleration strategies using NVIDIA’s technology stack (DGX Cloud, NVIDIA AI Enterprise, Grace Hopper, NVLink, CUDA).
Lead executive-level workshops and strategy sessions with partner leadership teams to align vision, performance goals, and value creation.
Support joint go-to-market initiatives that highlight exemplar implementations of NVIDIA accelerated computing within global and sovereign clouds.
Act as a trusted advisor and ecosystem strategist, connecting technology performance to economic, environmental, and strategic outcomes.
Contribute to NVIDIA’s Sovereign AI and Cloud Acceleration playbooks, defining frameworks for innovation that respect data sovereignty while scaling AI adoption.
What We Need to See:
Bachelor’s or Master’s in Business, Engineering, Computer Science, Data Science, or equivalent experience.
12+ years of experience in cloud strategy, AI infrastructure consulting, or partner ecosystem development.
Solid understanding of GPU-accelerated computing, AI infrastructure, and cloud architecture.
Demonstrable ability to translate technical solutions into strategic business narratives and measurable value frameworks.
Experience interacting with executive team members and ecosystem partners across both public and private sectors.
Effective communication and storytelling skills across technical and strategic audiences.
Ways to Stand Out from the Crowd:
Deep experience in hyperscaler or sovereign cloud partnerships and ecosystem development.
Familiarity with NVIDIA DGX Cloud, AI Enterprise, and Grace Hopper-based architectures.
Proven track record in strategic enablement programs or building exemplar partner models that demonstrate innovation excellence.
Understanding of Sovereign AI trends, AI infrastructure economics, or sustainable acceleration strategies.
You will also be eligible for equity and .
These jobs might be a good fit

Share
What you'll be doing:
Design intuitive data models and semantic layers to enable self‑service and AI apps reducing ad‑hoc query friction for business users.
Enrich data products with business glossary and metadata to reduce AI hallucinations, improve user adoption, searchability and governance.
Lead multi‑site integrations across new manufacturing plants and ops applications standardizing schemas and controls; enabling cross‑plant insights.
Engineer scalable pipelines with data integrity functions and audit features. Automate measuring and monitoring data quality for improved decision making.
Explain the data designs, system changes, enhancements, address any questions or issues effectively to the stakeholders.
Partner with stakeholders, solve business problems, train users, help with data and queries.
Optimize Lakehouse systems to deliver high performing solutionswhile controlling operational costs.
What we need to see:
BS, MS, or PhD in EE/CS or related field of education (or equivalent experience).
5+ years of programming experience (Python, PySpark, SQL, etc.).
5+ years of experience with big data technologies and cloud platforms (AWS, Databricks, Snowflake).
12+ overall years in Data Warehousing, implementing projects with data Lakehouse solutions.
Experience with enterprise BI databases like SAP BW/HANA, ERP/CRM systems like SAP/Salesforce, planning applications like IBP, APO etc.
Knowledge of operational processes in chips, boards, systems, and networking.
Proficiency in Tableau, PowerBI, and SAP reporting applications.
Ways to stand out from the crowd:
Strong analytical skills with the ability to collect, organize, and disseminate significant amounts of information with attention to detail and accuracy.
Highly independent, able to lead key technical decisions, influence project roadmap and work effectively with team members
Proven experience leading multiple analytics projects in a dynamic, fast-paced environment
Data science, AI/ML experience
Positive interpersonal skills with ability to convey good verbal and written communication
You will also be eligible for equity and .
These jobs might be a good fit

Share
What you’ll be doing:
Participate in an international team of software engineers working on products for testing NVIDIA products.
Oversee the design, implementation, and maintenance of scalable test automation frameworks.
Manage, mentor, and guide a team of automation engineers.
Design and implement robust, maintainable, and efficient automation test suite.
Work with Continuous integration systems and regression tools, automate builds, and test suites, generate test reports, isolate and classify failures and review new degradation.
Promote a culture of innovation, quality, and accountability. Bring SONiC NOS to shine in customer's view.
What we need to see:
B.Sc. degree or equivalent experience in Engineering/Computer Science/related field.
8+ overall years of experience in software development and testing. 2+ years of experience in a leadership role.
Proven experience in a leadership role, with a track record of successfully leading scrums and projects.
Intrinsically motivated with a desire for automation programming.
Strong programming skills in Python.
Strong technical abilities, problem-solving skills, coding, and design skills.
Ability to lead feature development, take full ownership and deliver independently.
Linux knowledge: have a general understanding of Linux operation system concepts.
Ways to stand out from the crowd:
Strong communication and interpersonal skills, with the ability to motivate and inspire others.
Knowledge in one or more Networking areas: Ethernet, VLANs, TCP/UDP/IP, QoS, L2-L3 protocols.
Extensive experience with automation frameworks (e.g., Selenium, Robot Framework, PyTest) and scripting languages (e.g., Python, Java).
You will also be eligible for equity and .
These jobs might be a good fit

Share
What you will be doing:
Drive strategic CSP partnerships, collaborate with keyhyperscaleCSPs to align project schedules, priorities, and technical roadmaps for next-generation data center platforms.
Manage complex technical collaborations proactively, identifying and resolving critical issues before they impact customer deployments.
Orchestrate internal stakeholder alignment ensuring CSP priorities are reflected across NVIDIA's engineering, product, and business.
Create comprehensive customer program visibility through executive dashboards, status reports, and metrics tracking that provide real-time insights into CSP project health, risks, and milestone achievement.
Lead large-scale deployment programs managing multi-rack, hyperscale infrastructure rollouts with complex technical dependencies, timeline coordination, and resource alignment across multiple internal teams.
What we need to see:
Technical Expertise: Solid understanding of system software design, OS fundamentals, Linux kernel development, and hardware/software interfaces. Experience in GPU-based data center server architectures.
Program Management: Proven ability to lead software development for rack-scale systems and data center servers, including complex hardware/software integration projects.
Industry Collaboration: Experience partnering with hyperscalers to drive technical outcomes, manage dependencies, handle escalations, and communicate effectively at the executive level.
Communication & Leadership: Exceptional ability to translate technical concepts for business stakeholders and align diverse teams toward common goals.
BS or MS in Computer Engineering, Computer Science, or related field or equivalent experience.
8+ years of technical program management experience in HPC or data center server software development.
Ways to stand out from the crowd:
Prior hands-on technology development experience in out-of-band manageability and observability solutions, system software/Linux kernel driver development, CUDA programming, or LLM/AI framework development.
You will also be eligible for equity and .
These jobs might be a good fit