Share
What you'll be doing:
Create products to help developers build better Inference deployments
Develop product strategy, roadmaps, and go-to-market plans
Collaborate with internal and external developers to build product-based roadmaps for model optimization software
Work with leadership to align with and drive company strategy
What we need to see:
Experience with Inference deployment and optimization software (ex. vLLM, SGLang, FlashInfer, TensorRT-LLM, Triton, Dynamo, TorchAO, etc.)
Demonstrable knowledge of GenAI or machine learning concepts, particularly around performance optimization, and software development and delivery
BS or MS degree in Computer Science, Computer Engineering, or similar experience (or equivalent experience)
5+ years of technical product management, or similar, experience at a technology company
Strong communication and interpersonal skills
Ways to Stand Out from the crowd:
Experience leading optimization products for Inference
Working on Open Source & Github-first developer products with deep customer interactions
Knowledge of GPU architecture, HW/SW co-design, and performance profiling
You will also be eligible for equity and .
These jobs might be a good fit