Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Microsoft Site Reliability Engineer II 
India, Karnataka, Bengaluru 
572254926

10.12.2024

Job Purpose:

Running software as a service means more than just developing and releasing features. Ensuring reliability and serviceability is critical part of software cycle. This is where you come into the picture. As a Site Reliability Engineer, you will ensure the service of Azure SQL Database or Managed Instance runs smoothly with required reliability and availability. You will design and implement software to automatically resolve issues. You will work closely with feature teams to design, implement and release features that are reliable and serviceable. You will be a cross-domain expert who has a holistic view of our cloud service

Qualifications
Qualifications

Required Qualifications:

  • 3+ years of software development or SQL product support experience.
  • 3+ years of using debugging tools such as Windbg, Visual Studio and Xperf to debug user dumps or live applications.
  • Demonstrated troubleshooting skills in SQL Server/Azure SQL Database with deep understanding in one or more of the following areas:
  • Query Processing
  • Storage Engine
  • SQL Operating System (SQL OS) layer (threading, scheduling, and memory management)
  • High Availability
  • Replication
  • Connectivity
  • Deep understanding of Windows Operating System level concepts such as processes, threading, memory allocation, and the network stack; understanding of how applications are affected by the above, and ability to debug same.
  • Demonstrated influence outside their own immediate team without managerial authority
  • Experience in working with multiple teams, including coordinating large projects, communicating well, and exercising initiative when presented with problems.
  • Proficient programming skills using managed code such as C#/Java. Ability to read native C/C++ code to debug issues and find answers not documented.
  • BS/MS in Computer Science, Engineering and/or equivalent industry experience is preferred.

Preferred Qualifications:

  • Troubleshooting experience in Azure SQL Database or Managed instance.
  • Industry experience in software development with automation-related experience.
  • Source control software such as git.
  • Practical experience running large scale online systems is always an advantage.
  • Familiarity withx86 and x64 assembly languages and Managed IL
  • Familiarity with CodeQL, Code Coverage and other quality capabilities.
  • Knowledge of various memory management designs and applications.
  • Skilled with troubleshooting approaches such as a Bayesian network.
  • Practical Powershell, Java, Python development experience.
  • Experience with:
  • AI and Machine Learning skills
  • Live event streams
  • State machines, Event and Health ModelsMulti-machine deployment operations.
  • Large scale testing
  • Kusto query language
  • Data Warehousing and Reporting
Responsibilities
Responsibilities
  • Act as subject matter expert for configuring, troubleshooting, and monitoring Azure Database/Managed Instance services.
  • Identify opportunities and implement automation to resolve and reduce live-site incidents.
  • Design and implement solutions to improve service health, manageability, reliability, and telemetry.
  • Design and implement diagnostics infrastructure to improve product quality and customer satisfaction.
  • Own, triage, investigate, and resolve service issues with emphasis on broad communications, learning, and teaching throughout the process.
  • Author and maintain functional and technical documentation. Define and maintain process and procedures to run enterprise service.
  • Interact with customers as result of escalation from support for issues including performance and availability.
  • Mentor, develop and deliver training.
  • Ability to meet on-call responsibilities periodically.