What you’ll be doing:
Conduct applied research and design innovative algorithms in the space of geometric computer vision andimage/video/vision-language/vision-centricfoundation models.
Deploy algorithms on real humanoid robots to gauge sim-to-real transfer of models.
Work hand-in-hand across multiple engineering and research teams to enable foundation models on humanoid robots.
Constantly learn and explore unfamiliar technologies.
What We Need To See:
Masters in Computer Science, Robotics, Deep Learning, or other related fields (or equivalent experience).
5+ years of algorithm development/research experience relevant work/research experience in one or many of the following areas: Vision-language models, Foundation Models, 3D-LLMs, Video generative models and diffusion algorithms, or Action-based transformers.
Experience in training vision-foundation models including 3D-VLMs, ViT, LLaVA, CLIP, Diffusion, and VLMs.
Hands-on experience with deep learning frameworks (e.g., TensorFlow, PyTorch) and proficiency in modern software development practices (version control, testing, CI/CD).
Hands-on experience with running algorithms on real-world robotic systems.
Excellent communication skills and the ability to collaborate efficiently in a cross-functional, distributed team environment.
Proficient in C++ and Python.
You have a thirst and ability to learn and adapt to new technologies.
Ways To Stand Out From The Crowd:
Ph.D. in Computer Science, Robotics, Deep Learning, or other related fields.
Publications in top-tier AI conferences or contributions to open-source projects.
Experience withvision-language-actionmodels.
You will also be eligible for equity and .
משרות נוספות שיכולות לעניין אותך