

Share
Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.
Gross pay salary$170,000—$210,000 USDThese jobs might be a good fit

Share
What you’ll be doing:
Understand, analyze, profile, and optimize AI training workloads on state-of-the-art hardware and software platforms.
Guide development of future generations of artificial intelligence accelerators and systems.
Develop detailed performance models and simulator infrastructure for computing systems accelerating AI training, and implement and evaluate hardware feature proposals.
Collaborate across the company to guide the direction of machine learning at NVIDIA; spanning teams from hardware to software and research to production.
Drive HW/SW co-design of NVIDIA’s full deep learning platform stack, from silicon to DL frameworks.
What we need to see:
PhD in CS, EE or CSEE and 3+ years; or MS (or equivalent experience) and 6+ years of relevant work experience.
Strong background in computer architecture, with a proven track record of architecting features in shipping high-performance processors.
Background in artificial intelligence and large language models, in particular training algorithms and workloads.
Experience analyzing and tuning application performance on state-of-the-art hardware.
Experience with processor and system-level performance modelling, simulation, and evaluation before silicon exists.
Programming skills in C++ and Python.
Familiarity with GPU computing across all layers of the AI stack, from DL frameworks like PyTorch down to CUDA.
You will also be eligible for equity and .

Share
What you'll be doing:
You will be contributing to power estimation models and tools for GPU products and systems like NVIDIA DGX.
Early GPU & System Architecture exploration with focus on energy efficiency and TCO improvements at GPU and Datacenter level.
You will help with Performance vs Power Analysis for NVIDIA future product lineup.
Deploy machine learning techniques to develop highly accurate power and performance models of our GPUs, CPUs, Switches, and platforms.
Understand the workload characteristics for GenAI/HPC workloads at Datacenter Scale (multi-GPU) to drive new HW/SW features for Perf@Watt improvements.
Modeling & analysis of cutting-edge technologies like high speed & high-density interconnects.
What we need to see:
Pursuing a MSEE/MSCE, or equivalent experience related to Power / Performance estimation and optimization techniques.
Knowledge of energy efficient chip design fundamentals and related tradeoffs.
Familiarity with low power design techniques such as multi-VT, Clock gating, Power gating, and Dynamic Voltage-Frequency Scaling (DVFS).
Understanding of processors (GPU is a plus), system-SW architectures, and their performance/power modeling techniques.
Proficiency with Python and data analysis packages like: Pandas, NumPy, PyTorch.
Familiarity with performance monitors/simulators used in modern processor architectures.
You will also be eligible for equity and .

Share
What you’ll be doing:
Working with Cloud Service Providers to develop and demonstrate solutions based on NVIDIA’s groundbreaking software and hardware technologies.
Build and deploy solutions at scale using NVIDIA's AI software on cloud-based GPU platforms.
Build custom PoCs for solutions that address customer's critical business needs while applying NVIDIA's hardware and software technologies.
Partner with Sales Account Managers or Developer Relations Managers to identify and secure business opportunities for NVIDIA products and solutions.
Conduct regular technical customer meetings for project/product details, feature discussions, intro to new technologies, and debugging sessions.
Prepare and deliver technical content to customers including presentations about purpose-built solutions, workshops about NVIDIA products and solutions, etc.
What we need to see:
BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.
7+ years of Solutions Engineering (or similar Sales Engineering roles) experience.
Established track record of deploying AI/ ML solutions in cloud environments including AWS, GCP, Azure or OCI
Knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes, data center deployments, etc.
Effective time management and capable of balancing multiple tasks.
Ability to communicate ideas clearly through documents, presentation, etc.
Ways to stand out from the crowd:
AWS, GCP, Azure or OCI Professional Solution Architect Certifications
Hands-on experience with NVIDIA GPUs and SDKs (i.e. CUDA, Triton, TensorRT-LLM, etc.)
Deep understanding of the full software development lifecycle, including best practices for system design, architectural patterns, and comprehensive testing.
Solid working knowledge of Python
System-level experience, specifically GPU-based systems
You will also be eligible for equity and .

Share
What you'll be doing:
As a senior member in our team, you will work with pre-silicon and post-silicon data analytics - visualization, insights and modeling.
Design and uphold sturdy data pipelines and ETL processes for the ingestion and processing of DFX Engineering data from various origins
Lead engineering efforts by collaborating with cross-functional teams (execution, analytics, data science, product) to define data requirements and ensure data quality and consistency
You will work on hard-to-solve problems in the Design For Test space which will involve application of algorithm design, using statistical tools to analyze and interpret complex datasets and explorations using Applied AI methods.
In addition, you will help develop and deploy DFT methodologies for our next generation products using Gen AI solutions.
You will also help mentor junior engineers on test designs and trade-offs including cost and quality.
What we need to see:
BSEE (or equivalent experience) with 5+, MSEE with 3+, or PhD with 1+ years of experience in low-power DFT, Data Visualization, Applied Machine Learning or Database Management.
Experience with SQL, ETL, and data modeling is crucial
Hands-on experience with cloud platforms (AWS, Azure, GCP)
Design and implement highly scalable, fault tolerant distributed database solutions
Lead data modeling, performance tuning, and capacity planning for large-scale, mission-critical storage workloads
Excellent knowledge in using statistical tools for data analysis & insights.
Strong programming and scripting skills in Perl, Python, C++ or Tcl is expected
Outstanding written and oral communication skills with the curiosity to work on rare challenges.
Ways to stand out from the crowd:
Experience in data pipeline and database architecture for real-world systems
Experience in application of AI for EDA-related problem-solving
Good understanding of technology and passionate about what you do
Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic environment
You will also be eligible for equity and .

Share
What you'll be doing:
Develop and implement the business logic in the new End-to-End Data systems for our Planning, Logistics, Services, and Sourcing initiatives.
Lead discussions with Operations stakeholders and IT to identify and implement the right data strategy given data sources, data locations, and use cases.
Analyze and organize raw operational data including structured and unstructured data. Implement data validation checks to track and improve data completeness and data integrity.
Build data systems and data pipelines to transport data from a data source to the data lake ensuring that data sources, ingestion components, transformation functions, and destination are well understood for implementation.
Prepare data for AI/ML/LLM models by making sure that the data is complete, has been cleansed, and has the necessary rules in place.
Build/develop algorithms, prototypes, and analytical tools that enable the Ops teams to make critical business decisions.
Build data and analytic solutions for key initiatives to set up manufacturing plants in US.
Support key strategic initiatives like building scalable cross-functional datalake solutions.
What we need to see:
Master’s or Bachelor’s degree in Computer Science or Information System, or equivalent experience
8+ years of relevant experience including programming knowledge (i.e SQL, Python, Java, etc)
Highly independent, able to lead key technical decisions, influence project roadmap and work effectively with team members
Experience architecting, designing, developing, and maintaining data warehouses/data lakes for complex data ecosystems
Expert in data and database management including data pipeline responsibilities in replication and mass ingestion, streaming, API and application and data integration
Experience in developing required infrastructure for optimal extraction, transformation, and loading of data from various sources using Databricks, AWS, Azure, SQL or other technologies
Strong analytical skills with the ability to collect, organize, and disseminate significant amounts of information with attention to detail and accuracy
Knowledge of supply chain business processes for planning, procurement, shipping, and returns of chips, boards, systems, and networking.
Ways to stand out from the crowd:
Self-starter, collaborative, positive mindset, committed to growth with integrity and accountability, highly motivated, driven, and high-reaching
Solid ability to drive continuous improvement of systems and processes
A consistent record to work in a fast-paced environment where good interpersonal skills are crucial
You will also be eligible for equity and .

Share
What you'll be doing:
Lead sophisticated programs focused on improving the quality and efficiency of data center infrastructure, hardware, and software domains with multi-year strategic roadmaps and cross-
Drive technical execution from requirements gathering through production launch, including writing technical specifications, coordinating release schedules, and ensuring operational readiness across multiple team dependencies
Own server hardware development, testing, and integration efforts for computing products, working closely with original design manufacturers and contract manufacturers on new product introductions at global manufacturing scale
Partner with software development teams to build automation programs for large-scale infrastructure testing and develop solutions that enhance operational performance across highly concurrent, high-throughput distributed systems
Guide enterprise network infrastructure and data center operations initiatives covering servers, storage, networking, power, and cooling systems while serving as domain leader for manufacturing test infrastructure
Lead continuous improvement initiatives for engineering processes, quality management, and operational excellence while leading risk mitigation strategies and critical path oversight
Build trusted partnerships across hardware teams, security professionals, supply chain, operations, and product management to drive technical decisions and resolve sophisticated multi-functional dependencies
What we need to see:
Bachelor's degree in Engineering, Computer Science, Electrical Engineering, Mechanical Engineering, or related technical field, or equivalent experience
12+ years working directly with engineering teams with demonstrated technical program management experience
More than 7 years of practical program or project management expertise being responsible for intricate technology ventures involving teams with multifaceted strengths
5+ years of software development experience with proficiency in programming languages.
5+ years leading hardware product development and new product introduction on a global manufacturing scale
Deep technical expertise in server, network, or storage product architecture and manufacturing test development
Strong understanding of large-scale distributed systems, data center infrastructure, and enterprise network architecture
Experience with Linux/Unix or Windows system administration, database management, and infrastructure automation
Demonstrated ability to lead programs across multiple teams, handle project scope, schedule, budget, and quality, and maintain executive-level relationships
Ways to stand out from the crowd:
8+ years directly leading sophisticated technology projects with experience designing and architecting highly reliable, scalable systems
Track record launching AI or ML server products with new technology enablement such as Liquid Cooling
Experience leading manufacturing test engineering teams within the server, network, or storage sector with expertise in Design for Excellence methodologies
Knowledge of security engineering, cryptography, quality management systems, and supply chain operations
Demonstrated single-threaded ownership of strategic programs with demonstrated ability to deliver groundbreaking systems independently in fast-paced, ambiguous environments
You will also be eligible for equity and .

Share
Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.
Gross pay salary$170,000—$210,000 USDThese jobs might be a good fit