Expoint - all jobs in one place
המקום בו המומחים והחברות הטובות ביותר נפגשים
Limitless High-tech career opportunities - Expoint

Nvidia Senior Manager Inference Platform Technical Product Marketing 
United States, Texas 
423903553

Today
US, CA, Santa Clara
US, WA, Remote
US, OR, Remote
US, AZ, Remote
time type
Full time
posted on
Posted 3 Days Ago
job requisition id

What You’ll Be Doing:

  • Lead all of NVIDIA’s inference technical platform go-to-market efforts

  • Develop a plan to showcase the technical attributes of our inference platform to the market and present the plan to an executive audience

  • Work closely with engineering and product management teams to understand key technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (e.g.parallelisms, configurations, etc.)

  • Diligently review and remain up to date on model architectures, frameworks, arxiv papers, whitepapers deployment techniques (e.g. disaggregated serving, KV cache implementations) and identify intersection points between the latest AI models and NVIDIA’s platform to maximize performance and minimize TCO

  • Develop crisp clear positioning, messaging and assets to highlight NVIDIA’s leadership position in inference. Assets (blogs, whitepapers, presentations, analyst briefings, seminars at developer conferences)

  • Closely follow competitive inference announcements and prepare appropriate responses for business and technical/developer audiences

  • Assist on building keynote slides for executives for areas that you’re a subject matter expert

  • Manage a team of technical PMMs managing NVIDIA’s inference and inference software platforms

What We Need to See:

  • A BS Degree in Computer Science or Engineering or related field (or equivalent experience). Masters Degree preferred in a technical product marketing role

  • 7+ overall years of experience in LLM, AI/ML development in an engineering role followed by 5+ years of experience in product management or technical product marketing of AI/ML products

  • 2+ years of experience managing engineering or product marketing teams

  • Deep understanding of modern data center architectures, accelerated computing, distributed inference, deep learning frameworks (PyTorch, TensorFlow, JAX), and inference-specific frameworks & optimizations (Dynamo, Triton Inference Server, TensorRT-LLM, vLLM, SGLang)

  • Market Awareness – Experience conducting technical competitive analysis and synthesizing key insights

  • Collaboration & Influence – Proven ability to work cross-functionally across engineering, product management, sales, and marketing teams

  • Strong Communication, Asset Creation & Storytelling – Ability to translate sophisticated technical concepts into clear, compelling narratives for both technical and business audiences

  • Ability to present to executive audiences, including C-levels

Ways to Stand Out from the crowd:

  • Hands-on experience with AI inferencing workflows using NVIDIA or open-source serving frameworks running on accelerated computing in the data center.

  • Experience working with hyperscale cloud providers

  • Hands-on Technical Competence – Background in software development, AI infrastructure, data center silicon and developing LLM models, AI/ML

  • Demonstrated ability to engage with executive leadership and external partners

  • Published technical content or speaking experience at industry events

You will also be eligible for equity and .