Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Microsoft Senior Researcher CoreAI 
Taiwan, Taoyuan City 
901799203

24.04.2025

In this role, you will focus on adapting and grounding multimodal models for product-driven scenarios, integrating text, images, and potentially other data types to create powerful and adaptable AI solutions.

Your work will involve developing deep learning techniques to enhance custom copilot experiences by efficiently adapting large-scale multimodal models. This includes a range of adaptation strategies such as supervised fine-tuning, multimodal fusion, and post-training with Reinforcement Learning.

Required Qualifications:

  • Doctorate in relevant field
    • OR equivalent experience.
  • Experience in machine learning, deep learning, or multimodal research (e.g., language-vision integration, cross-modal learning)

Other Requirements:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • Extensive experience with multimodal foundation models, including vision-language integration, audio-visual modeling, and large-scale multimodal training
  • Strong publication record in top-tier conferences focused on multimodal models, vision-language fusion, or large language models
  • Proficiency in programming languages such as Python, and experience with machine learning frameworks like PyTorch and Triton
  • Experience working with large, complex datasets and developing data pipelines for multimodal training
  • Demonstrated ability to collaborate within interdisciplinary teams and communicate complex, multimodal research concepts effectively

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:


Microsoft will accept applications for the role until April 23, 2025.

Responsibilities
  • Development of multimodal model customization and large-scale training, integrating language, vision, and other data modalities
  • Data preparation, training, and evaluation of multimodal customization tasks
  • Collaboration with Microsoft product groups to integrate multimodal AI solutions across applications
  • Multimodal research and innovation; staying updated with the latest advancements in deep learning, multimodal fusion, and cross-modal interactions.

Other:

  • Embody our and