Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

Nvidia Principal Software Engineer TensorRT-LLM 
United States, Texas 
120126439

Yesterday
US, CA, Santa Clara
US, CA, Remote
time type
Full time
posted on
Posted 14 Days Ago
job requisition id

What you'll be doing:

  • Architecting and guiding development of robust inferencing software that can be scaled to multiple platforms for functionality and performance

  • Performance analysis, optimization and tuning

  • Closely follow developments in the field of artificial intelligence, and evolve the code design to keep pace

  • Collaborate across the company to guide the direction of AI Inferencing, working with software, research and product teams

What we need to see:

  • Bachelors, Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)

  • 15+ years of relevant software development experience and 2+ years in an architect/tech lead role.

  • Excellent Python or C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong understanding of GenAI serving, awareness of the latest developments in deep learning like LLMs

  • Experience working with LLM inference frameworks like vLLM, SGLang, etc.

  • Experience working with deep learning frameworks like PyTorch, JAX, etc.

  • Excellent written and oral communication skills in English

You will also be eligible for equity and .