Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Microsoft Senior Site Reliability Engineer 
United States, Washington 
828695982

01.05.2024

ive deep into global scale service issues and drive improvements toresiliency, availability,latency, and product reliabilityYou willbe responsible forYou will work closely with other engineering teams anda holistic view of our cloud service.Provide excellent technical leadership, raise the technical bar,data and


Required/Minimum Qualifications

  • Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • 4+ years of troubleshooting/debugging experience: telemetry-based analysis (KQL or equivalent preferred), troubleshooting skills across network, hardware, and distributed service layers, withdemonstratedability to debug, fix, andoptimizecode.
  • 3+ years of experience with writing tools, automation / scripting (Powershell, Python or similar), programming (C++, C# or equivalent) and making enhancements in subcomponents within and around services/products to deliver and manage software in production.
  • Willing to work as part of a 24x7 on-call rotation.

Additional or Preferred Qualifications

  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python

    o OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python

    o OR equivalent experience.

    • Communicate effectively and partner well with other disciplines of the project team to deliver high quality solutions from ideas

    • Understanding of how to implement high availability, disaster recovery, and business continuity concepts in online services.
    • Experience aiding understanding of distributed systems and networking is preferred.
    • Effectively manage and prioritize multiple tasks in accordance with high level objectives/projects


Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

Responsibilities
  • As a Senior Site Reliability Engineer you will be part of the Reliability & Resilience team dedicated to driving measurable improvement in service reliability and reducing the negative customer impact of outages through avoidance or mitigation.
  • Delivering projects that improve resiliency and security of the service.
  • Right mix of systems engineering, data science, software development, on-line servicesexperience,and passion for quality to envision.
  • Demonstrated experience with Azure services and capabilities (and/or other cloud platforms like AWS)
  • Good knowledgeofARM artifacts andindustry standarddeployment methodologies.
  • Own availability, performance, and supportability targets for the service.
  • Proactively seeks new knowledge and adapts tonew trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products.
  • Participate in on-call rotations and own, triage, investigate and resolve service issues with an emphasis on broad communications, learning & teaching throughout the process.
  • Evaluate and contribute to service design and architecture to improve the resiliency of the cloud service.
  • Author functional and technical documentation and remain current on relevant technologies and procedures.
  • Bring clarity, create energy, and drive results – set a vision, rally the team behind it, and deliver for our engineers and