a number of
Required/Minimum Qualifications
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
Preferred/Additional Qualifications
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:Microsoft will accept applications for the role until July 18th, 2025.
Model Optimization & Deployment:Efficient model training, distillation and fine-tuning (e.g., LoRA, QLoRA, instruction tuning). Design and implement scalable solutions for deploying Large Language Models (LLMs) and Small Language Models (SLMs) in heterogeneous production environments, considering performance, cost, and latency constraints. Optimize inference of language models, leveraging techniques like vLLM, quantization (e.g., AWQ, GPTQ), and model compression.
Prompt Engineering & Workflow:Develop LLM prompts, agents, and query execution workflows, often with tight latency constraints.
Natural Language Processing (NLP):Investigate and implement state-of-the-art methods for NLP tasks, including but not limited to information extraction and semantic understanding.
Contribute to internal knowledge sharing, best practices, and potentially external publications.
משרות נוספות שיכולות לעניין אותך