Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

Microsoft Senior Software Engineer - AI Search 
Taiwan, Taoyuan City 
192384739

16.10.2025

is leading the way to deliver the next chapter in retrieval augmented generation by combining knowledge with Agents at scale., to meet the demands of complex queries that requireand reflection to deliver high quality resultsfor both people and LLMshave many keyur work

  • building,andmaintainingbackend services with external and internal language model dependencies.
  • with Applied Science to support and drive product evolution at scale.
  • Working directly with GPUsto supportproduction ML workloads at scale
  • Evaluation of productionintegrationsto ensureend-to-endAI qualityof Applied Science deliverables.

You canfollow ourprogression from better search, to Retrieval-Augmented Generation (RAG)to agentic retrieval:

A major aspect of this role is to push our development of knowledge retrieval to the frontier. This requires a capable individual who is well versed inML model GPU hosting integrations and LLM context engineering. Someone who cansolve the hardest. Someone whowhere the latest reasoning models are bestsuited andsupport the integration oflow latency optionsin placeswhere speed is critical.

Learning ,

Required Qualifications

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • 2+ years of experience in development of Azure Services with an understanding of service release and live-site responsibilities.
  • 6+ months of experience developinglarge language models (LLMs).
  • 2+ years of experience developing GPU based models and model hosting.

Other Requirements

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
    • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Preferred Qualifications
  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • Coding in C#AND Pythonin a production system.
  • Coding inC++orJavain a production system.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

Responsibilities
  • Design, build, andmaintainAzure backend services and associated APIs.
  • Partner with Applied Science to bring high quality prompts, ML models and pre and post processing components into production in a secure, reliable, and scalable way.
  • Work directly with GPUs to support production ML workloads at scale. Requires a knowledge of Azure Kubernetes Service (AKS) and Triton GPU containers.
  • Providing evaluation tooling and support of production integrations to ensure end-to-end AI quality of Applied Science deliverables.
  • Contribute to team plans, documents, and communication in a clear and efficient way.