

Share
Your Impact
Minimum Qualifications
Preferred Qualifications
These jobs might be a good fit

Share

Share
Your Impact
As an
AI Infrastructure Abstraction Engineer, you will help shape the next generation of AI compute platforms by designing systems that abstract away hardware complexity and expose logical, scalable, and secure interfaces for AI workloads. Your work will enable multi-tenancy, resource isolation, and dynamic scheduling of GPUs and accelerators at scale — making infrastructure programmable, elastic, and developer-friendly.
You will bridge the gap between raw compute resources and AI/ML frameworks, allowing infrastructure teams and model developers to consume shared GPU resources with the performance and reliability of bare metal, but with the flexibility of cloud-native systems. Your contributions will empower internal and external users to run AI workloads securely, efficiently, and predictably — regardless of the underlying hardware topology.
This role is critical to enabling AI infrastructure that is multi-tenant by design, scalable in practice, and abstracted for portability across diverse platforms.
KEY RESPONSIBILITIES
Minimum Qualifications:
Preferred Qualifications:

Share
Day to day activities will involve:

Share

Share
This position requires a hybrid working schedule in the San Jose or Milpitas office.
As
High-performance AI compute engineer, you will be instrumental in defining and delivering the next generation of enterprise-grade AI infrastructure. As a principal engineer within our GPU and CUDA Runtime team, you will play a critical role in shaping the future of high-performance compute infrastructure. Your contributions will directly influence the performance, reliability, and scalability of large-scale GPU-accelerated workloads, powering mission-critical applications across AI/ML, scientific computing, and real-time simulation.
You will be responsible for developing low-level components that bridge user space and kernel space, optimizing memory and data transfer paths, and enabling cutting-edge interconnect technologies like NVLink and RDMA. Your work will ensure that systems efficiently utilize GPU hardware to its full potential, minimizing latency, maximizing throughput, and improving developer experience at scale.
KEY RESPONSIBILITIES
Minimum Qualifications :
Preferred Qualifications

Share
The application deadline is being extended, expected to close August 29th, 2025.
Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.
Your Impact
You will engage and drive the teams working on Design for Reliability. You will build and innovate in data analytics to enable faster quality through AI/ML data models. You will plan and develop test teams to drive FA and Qualification activity.
Responsibilities
Minimum Qualifications
Preferred Qualifications

Share
Your Impact
Minimum Qualifications
Preferred Qualifications
These jobs might be a good fit