Expoint – all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Nvidia Senior Software Engineer Machine Learning Inference
United States, Texas
994831017

15.10.2025

Share

Log in to apply

US, CA, Santa Clara

US, CA, Remote

time type: Full time

posted on: Posted 15 Days Ago

job requisition id

What you’ll be doing:

Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.
Develop software in C++, Python, and CUDA for seamless and efficient deployment of state-of-the-art LLMs and Generative AI models.
Collaborate with deep learning experts and GPU architects throughout the company to influence Hardware and Software design for inference.

What we need to see:

BS, MS, PhD or equivalent experience in Computer Science, Computer Engineering or a related field.
8+ years of software development experience on a large codebase or project.
Strong proficiency in C++ (required), Rust or Python programming languages.
Experience in developing Deep Learning Frameworks, Compilers, or System Software.
Excellent problem-solving skills and passion to learn and work effectively in a fast-paced, collaborative environment.
Strong communication skills and the ability to articulate complex technical concepts.

Ways to stand out from the crowd:

Experience in developing inference backends and compilers for GPUs.
Knowledge of Machine Learning techniques and GPU programming with CUDA or OpenCL.
Background in working with LLM inference frameworks like TensorRT-LLM, vLLM, SGLang.
Experience working with deep learning frameworks like TensorRT, PyTorch, JAX.
Knowledge of close-to-metal performance analysis, optimization techniques, and tools.

You will also be eligible for equity and .

Full job details

These jobs might be a good fit

Red hat Senior Machine Learning Engineer AI Inference United States, Massachusetts, Boston

Uber Staff Machine Learning Engineer - Causal Inference United States, West Virginia

Amazon Machine Learning Engineer AWS Neuron Inference United States, Washington, Seattle

Nvidia Senior Deep Learning Software Engineer Inference United States, California

Professional CV Builder tool from Expoint.

Get to the top of the "yes list" with a standout CV!

CREATE CV