Expoint – all jobs in one place
המקום בו המומחים והחברות הטובות ביותר נפגשים
Limitless High-tech career opportunities - Expoint

Microsoft Senior Applied Research Scientist - Generative AI & Agentic Systems 
Taiwan, Taoyuan City 
253478087

17.07.2025

a number of

Required/Minimum Qualifications

  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research) OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research) OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
    • OR equivalent experience
  • Hands-on experience in applied research and development within AI/ML, with a significant focus on small/large language models and NLP
  • Expertise in Large Language Models (LLMs) and Small Language Models (SLMs), including their architectures (e.g., Transformers), pre-training, and fine-tuning methodologies (e.g., LoRA, QLoRA, instruction tuning)
  • 1+ years experience with model compression techniques (quantization, pruning) and optimized inference engines for LLMs (e.g., vLLM)
  • 1+ years experience in designing and implementing agentic AI systems, including multi-agent orchestration, planning, tool use, and reasoning
Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred/Additional Qualifications

  • Solid understanding and practical experience in designing and implementing agentic AI systems, including multi-agent orchestration, planning, tool use, and reasoning
  • Proficiency in Python and relevant ML/DL frameworks (e.g., PyTorch, TensorFlow, Hugging Face Transformers)
  • Excellent analytical, problem-solving, and critical thinking skills
  • Proficient written and verbal communication skills, with the ability to articulate complex technical concepts to diverse audiences
  • Proficiency in C#
  • Hands-on experience with NVIDIA Triton
  • Proficiency in accelerated application development on GPU substrate (e.g., CUDA)
  • Experience working with customers deploying AI solutions
  • Comprehensive knowledge of Natural Language Processing principles, algorithms, and common tasks
  • Publication record in top-tier conferences (e.g., NeurIPS, ICML, ACL, EMNLP) is highly desirable

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:Microsoft will accept applications for the role until July 18th, 2025.

Responsibilities

Model Optimization & Deployment:Efficient model training, distillation and fine-tuning (e.g., LoRA, QLoRA, instruction tuning). Design and implement scalable solutions for deploying Large Language Models (LLMs) and Small Language Models (SLMs) in heterogeneous production environments, considering performance, cost, and latency constraints. Optimize inference of language models, leveraging techniques like vLLM, quantization (e.g., AWQ, GPTQ), and model compression.

Prompt Engineering & Workflow:Develop LLM prompts, agents, and query execution workflows, often with tight latency constraints.


Natural Language Processing (NLP):Investigate and implement state-of-the-art methods for NLP tasks, including but not limited to information extraction and semantic understanding.

Contribute to internal knowledge sharing, best practices, and potentially external publications.