What you'll be doing:
Architecting and guiding development of robust inferencing software that can be scaled to multiple platforms for functionality and performance
Performance analysis, optimization and tuning
Closely follow developments in the field of artificial intelligence, and evolve the code design to keep pace
Collaborate across the company to guide the direction of AI Inferencing, working with software, research and product teams
What we need to see:
Bachelors, Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)
15+ years of relevant software development experience and 2+ years in an architect/tech lead role.
Excellent Python or C/C++ programming and software design skills, including debugging, performance analysis, and test design.
Strong understanding of GenAI serving, awareness of the latest developments in deep learning like LLMs
Experience working with LLM inference frameworks like vLLM, SGLang, etc.
Experience working with deep learning frameworks like PyTorch, JAX, etc.
Excellent written and oral communication skills in English
You will also be eligible for equity and .
משרות נוספות שיכולות לעניין אותך