Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Microsoft Principal Software Engineer - AIOps 
United States, Washington 
562982253

10.09.2024

With the rise of Artifiial Intelligence (AI) and cloud computing, microservices, and containerization, traditional monitoring tools and manual processes are no longer sufficient to ensureperformance, availability, and security. AIOps leverages machine learning, analytics, and automation to enable faster detection, diagnosis, and resolution of issues, as well as proactive prevention of problems before they occur. By analyzing large volumes of data from multiple sources, AIOps canpatterns, anomalies, and correlations that would be difficult or impossible for humans to detect. This can help cloud engineers improve efficiency, reduce downtime, and enhance the overall user experience. In this AI and Cloud era, AIOps and AI-driven monitoring systems are becoming increasingly important for today's complex and dynamic cloud service systems.

health issues and automatically triage, diagnose, and mitigate them. It was first created to solve Azure and Microsoft cloud health and monitoring problems. As we continue to make rapid progress for our internal services, we are also starting to makeavailable for Azure customers. In addition to its intelligent capabilities around anomaly detection, auto triage, and issue prevention, we are also introducing LLM based copilot experience to provide on-call engineers with natural language user interface and a unified intelligence engine to drive AIOps scenarios.

Copilot team.ou willbe responsible forand scale Brain Copilotcore engine, a critical and strategicof Brain. You willead efforts and discussions for the architecture of aspects of products/solutions. Independently creates andothers for a clear and articulated plan for testing and assuring the quality ofsolutions, anddefines success for outcomes of tests. Leads identification of dependencies and the development of design documents for a product, application, service, or platform. Creates and applies metrics to drive the quality and stability of code, as well asappropriate codingpatterns and best practices. Holds accountability for the impact of build deployments on both users and other services.Leadsefforts to collect, classify, and analyze complex data and analyses on a range of metrics.Leadsof internal tools and creation of new tools that will be useful for creating the product,if methods are still applicable for the current solution. Demonstrates andan up-to-date understanding of both global and local regulations for technologies and system applications to ensure regulations are followed and met. Escalates situations internally to resolve complex blockers.Additionally, you will be expected to stay up to date with the latest trends and technologies in AIOps,/LLM, cloud computing, and related fields, and apply them to Brain Copilot as. This candidate will have a demonstrated technical background in software engineering,machine learninglarge cloud distributed service, as well as leadership, communication, and interpersonal skills.

Microsoft Azure Edge + Platform (E+P) is a globally distributed team of engineers, architects, program managers, product managers, business program managers, business administrators, user experience researchers and designers whoare responsible forthe platform for Microsoft and for delivering Microsoft’s edge vision. We create the most reliable and trustworthy OS and platform services to empower Microsoft and our customers to achieve more. We unlock the next wave of opportunity at the edge through an at-scale ecosystem driving widespread adoption of our Microsoft cloud services.

Required Qualifications:

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, or Python OR equivalent experience.
  • 5+ years of hands-on experienceon leading architecture for large scale systems, designing and coding on mission criticalscalable distributed servicesand running live sites.

Other Requirements

Ability to meet Microsoft, customer and/or government security screening requirements arefor this role.These requirements include but are not limited to the following specialized security screenings:

to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • Master's Degree in Computer Scienceor related technical field.
  • 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python ORBachelor's Degree in Computer Scienceor related technical field.
  • 10+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $137,600 - $267,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $180,400 - $294,000 per year.Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:Microsoft will accept applications for the role until Spetember 12, 2024.


Responsibilities

As a Principal Software Engineer - AIOps, you will be responsible for the following:

  • Leads the development ofarchitecture anddesign documents anddeterminesthe technology that will beleveragedand how they will interact.
  • Leads design discussions with the team and shares findings/learnings from investigations, holding ownership for design decisions.
  • Creates, implements,optimizes, debugs, refactors, and reuses code toestablishand improve performance and resilience, maintainability, effectiveness, and return on investment (ROI).
  • Creates and applies metrics to drive the quality and stability of code, as well asappropriate codingpatterns and best practices.
  • Holds accountability as a Designated Responsible Individual (DRI), working on call tomonitorsystem/product/service for degradation, downtime, or interruptions.Leadsefforts to reduce incident volume, looking globally at incidences and providing broad resolutions. Escalates issues toappropriate owners.
  • Remains current by investing time and effort into staying abreast of current developments.
  • Proactively seeks new knowledge and adapts tonew trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers.
  • Leadsefforts to ensure the correct processes are followed to achieve a high degree of security, privacy, safety, and accessibility across solutions and teams.
  • Creates and assures the presence of visible evidence todemonstratecompliance for products.
  • Develops andmaintainsa deep understanding of the implications of onboardingnew technologiesfollowing expectations of compliance at Microsoft.
  • Defines and develops standardized, repeatable, scalable solutions to guarantee quality.Identifiesbest practices and coding patterns and provides deepexpertisein the coding and validation strategy.
  • Leads by example andmentorsothers to produce extensible and maintainable code used across products.