Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Microsoft Regional Data Centre Incident Manager 
Australia, New South Wales, Sydney 
716895298

09.07.2024

Qualifications:

  • Bachelor's Degree in Engineering, or related field AND 8+ years technical experience in critical environments, network engineering, service engineering, or systems engineering OR equivalent experience.
  • 5+ years technical experience working with large-scale data center operations or distributed systems.
  • 5+ years people management experience.
  • Other Engineering Certifications.

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to, the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. #COICareers

Responsibilities:

Holds teams accountable for managing crisis situations, including leveraging advanced technical expertise, judgment, and decision making to coordinate multiple work streams and resources in crisis situations to drive mitigation plan and resolve crisis by engaging necessary teams and escalating to appropriate stakeholders. Applies diagnostic expertise. Provides guidance to other engineers working to mitigate and resolve issues. Communicates customer impact and other relevant information with key stakeholders, leadership, and customers.

  • Manages teams of engineers to implement reliable, scalable, and high-performance solutions across teams.
  • Guides teams to stay current in knowledge and expertise as the technology landscape evolves, maintaining awareness of industry norms.
  • Develops team's end-to-end technical expertise, regularly identifying skill gaps and raising the collective bar on the team's skill set in alignment with industry standards.
  • Proficient knowledge of Critical Infrastructure within Global Data Centers
  • Holds the team accountable for creating, monitoring, and taking action on telemetry data and provides guidance on telemetry analytics to better identify patterns that reveal errors and unexpected problems that are affecting the system availability, reliability, performance, and/or efficiency.
  • Contributes to developing processes and standards to address complex issues and provides guidance to others as needed.