Role and Responsibilities
As a Multimodal AI - Intern, you will:
Innovate & develop state-of-the-art solutions to industry relevant problems in the field of on-device audio-visual AI
Actively propose and prototype novel research ideas/solutions considering real-world constraints
Work on complex systems and develop research ideas into production ready software
Incorporate software engineering practices at both research and development stages
Communicate and disseminate research results via papers and/or reports
Required Skills:
Perusing PhD degree in ML/AI, Computer Science/Engineering, Mathematics, Statistics, or related disciplines
Strong fundamentals in machine learning and artificial intelligence
First author publications in top ML/AI conferences/journals (e.g., CVPR, ICCV, NeurIPS, ICML, ICLR, ICASSP, INTERSPEECH, IEEE TPAMI, IEEE IoT, IEEE TNNLS, JMLR or similar)
Machine Learning experience in least one of the following domains:
Multimodal LLMs – audio and/or video
Contrastive Learning (e.g. multi-modal feature alignment)
Model compression methods (e.g. quantization, pruning, knowledge distillation)
Demonstrated success:
Strong development skills with Python and/or C/C++ is required.
Experience with programming using popular machine learning frameworks such as PyTorch and/or Tensorflow
Creating comprehensive and well-written documentation.
Familiarity with software engineering practices and tools such as Git
Excellent communication, teamwork and a results-oriented attitude
Proficiency in problem solving and debugging
Desirable Skills:
Experience in multimodal emotion recognition and foundational face models.
Experience in multi-task learning and deception detection.
A proven track record of developing complex training and inference pipelines
Knowledge of embedded and/or distributed machine learning methods and tools
Experience with model optimization, profiling and performance improvement of AI pipelines
Contribution to open source software libraries
Contract Type:6 month internship or 6 month outsourced contract via agency
Hybrid Working:3 days onsite and 2 days working from home weekly
. You can change Country/Language at the bottom of the page. If you are European Economic Resident, please click :
משרות נוספות שיכולות לעניין אותך