Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

Nvidia Senior Software Engineer Deep Learning Inference Workflows 
United States, California 
195161532

28.07.2025
US, CA, Santa Clara
time type
Full time
posted on
Posted 2 Days Ago
job requisition id

What you’ll be doing:

  • Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning inference.

  • Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models.

  • Collaborate with teams of deep learning experts, GPU architects and DevOps engineers across diverse teams.

What we need to see:

  • A Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field.

  • 6+ years of software development experience.

  • Strong experience with C++11/C++14/C++17.

  • Strong grasp of Machine Learning concepts, especially Natural Language Processing.

  • Excellent communication skills, and an aptitude for collaboration and teamwork.

Ways to stand out from the crowd:

  • Proficiency in Python

  • Experience in software performance benchmarking, profiling, and optimizations.

  • Background in compiler development

  • Experience in working with TensorRT, PyTorch, ONNX Runtime, JAX, TRT-LLM, vLLM, SGLang, or other ML frameworks.

  • Experience with HuggingFace Diffusers and Transformers libraries.

You will also be eligible for equity and .