המקום בו המומחים והחברות הטובות ביותר נפגשים
What You'll Be Doing:
Keeping abreast of the latest advancements in generative AI research.
Prototyping and analyzing emergent techniques in the test-time compute space such as output refinement, speculation, and retrieval. Identifying opportunities for algorithmic as well as system optimizations.
Pioneering the development of innovative optimizations to enable high quality inferencing on NVIDIA GPUs.
Collaborating closely with production teams to incorporate the latest advancements into cutting-edge software frameworks.
What We Need to See:
Master's degree (or equivalent experience) in Computer Science, Artificial Intelligence, Applied Mathematics, or related fields.
A strong foundation in deep learning, with a particular emphasis on generative models and inferencing.
A track record of at least 5 years of relevant software development experience in modern deep learning frameworks such as PyTorch.
Growth mindset and pragmatic attitude.
Ways to Stand Out From the Crowd:
Published research or noteworthy contributions to the field of deep learning, particularly in areas such as inference-time compute, conditional compute, speculative decoding, etc.
Experience with prototyping and/or deployment of emergent test time compute techniques.
Experience with collaborating across algorithms, software and performance teams to deliver high quality solutions.
Familiarity with computer architecture and how it relates to AI algorithms development.
You will also be eligible for equity and .
משרות נוספות שיכולות לעניין אותך