Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

Microsoft Senior Researcher - CoreAI 
Taiwan, Taoyuan City 
129309414

17.07.2025

is dedicated topost-training methods for both OpenAI and open-source models. Their workcontinual pre-training,large-scale deep reinforcement learningrunning onextensive GPU resources,significant efforts toand synthesizetraining data

develops advanced AI technologies that integrate language and multi-modality fora range ofin developingthose usedopilot and Visual Studio Code, such as code completionsoftware engineering (

such as,, Oscar, Rho-1, Florence, and the open-source Phi models.

We are looking for awith significant experience in large-scale model training, data curation, and hands-on coding, ideally from leading research labs. You will develop LLMs, SLMs, multimodal models, diffusion models, agentic models, and coding models using both proprietary and open-source frameworks. Key responsibilities include improving model quality and training efficiency through advanced techniques and data strategies, and managing the full pipeline from data ingestion, evaluation, to inference.

write efficientcode and debug training jobs, document findings, andin these fields.You may include information about any individualswho canserve asyour referral in your

our culture every day.

Required/Minimum Qualifications

  • Doctorate in relevant field
    • OR equivalent experience.
  • Publication record with over 1000 citations
  • 2+ years of experience in large-scale model training, especially with LLMs, SLMs, multimodal, or code-specific models
  • 2+ years of expertisein data curation and synthesis, creating and refining datasets tooptimizetraining outcomes
  • 2+ years of coding experience in languages such as Python as well as frameworks suhc as PyTorch and Triton with the ability to writeefficient,research or productioncode and debug complex training jobs
  • 2+ years of experience with both proprietary and open-source frameworkswith demonstratedproficiencyin training pipelines and architecture


Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred/Additional Qualifications

  • referably at leading research labs, with published work or real-world deployments
  • Extensive experience with foundation models, including large-scale training,model inference, reinforcement learning, reasoning models,vision-language integration,andaudio-visual modeling
  • Hands-on experience withlarge-scale distributed training or serving, andsystemsofthinking
  • Proficiencyin programming languages such as Python, and experience with machine learning frameworks likePyTorchand Triton
  • Experience working with large, complex datasets and developing data pipelines forLLMtraining
  • Demonstrated ability to collaborate within interdisciplinary teams and communicate complex, multimodal research concepts effectively
  • Startup-style mindset, be agile, solution-oriented, and able tooperatewith minimal overhead
  • Self-driven and organized with the ability to take ownership of projects and document findings clearly and effectively

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

Core Qualifications & Responsibilities

  • Perform large-scale model training — Especially with LLMs, SLMs, multimodal, or code-specific models.
  • Perform data curation and synthesis — Creating and refining datasets to optimize training outcomes.
  • Hands-on coding— Write efficient, production-quality code and debug complex training jobs.
  • Work on both proprietary and open-source frameworks — Demonstrated proficiency in training pipelines and architecture.
  • Full-stack modeling responsibility — From data ingestion and training to evaluation and inference management.

Research & Innovation

  • Contribute to or build on existing innovations like technical report of the well-known models.
  • Develop novel AI solutions that bridge language, vision, and code understanding.
  • Help develop models powering tools like GitHub Copilot, Cursor, and VS Code suggestions.
  • Embody ourand