Expoint – all jobs in one place
המקום בו המומחים והחברות הטובות ביותר נפגשים
Limitless High-tech career opportunities - Expoint

Microsoft Site Reliability Engineer II 
Taiwan, Taoyuan City 
267112473

21.05.2025

Required Qualifications:

  • 4+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration
    • OR Master's Degree in Computer Science, Information Technology, or related field.
  • 2+ years of experience with managing reliability of mission critical workloads which requires coordinating with number of partners and teams
  • 2+ years of experience with Networking and Network Protocols

Other Qualifications:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
    • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • 5+ years technical experience in software engineering, network engineering,
    • OR systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology,
    • OR related field AND 2+ years technical experience in software engineering, network engineering,
    • OR systems administration
    • OR Master's Degree in Computer Science, Information Technology,
    • OR related field AND 1+ year(s) technical experience in software engineering, network engineering

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:Microsoft will accept applications for the role until June 3, 2025.


Responsibilities
  • Work closely with product engineering to ensure that the right set of service capabilities are being built to manage the service end to end. Examples include deployment systems, diagnostic capabilities and run time operational insights into key service behaviors.
  • Identify monitoring gaps and drive implementation.
  • Consume and extend telemetry using queries, dashboards, alerts to monitor reliability.
  • Be a part of on-call rotation and monitor all customer reported incidents (CRI), triage them, participate in root-cause analysis, track monitoring gaps, help drive work to ensure these incidents are auto-detected in the future and have reduced time to mitigation and resolution.
  • Coordinate large scale fleet wide maintenance and updates using safe deployment practices. Identify impact of these system changes, coordinate closely with customer facing teams and customers directly to plan maintenance windows and downtime.
  • Work with customer support team for updated trouble shooting guides.
  • Work closely with 3rd party HW vendors and appliance providers to ensure quality and reliability of systems provided to Microsoft.