Our team is looking for a Principal Group SWE Manager to help build, lead and grow a strong customer-centered engineering team. You will work with Azure CXP Program Managers and Data Scientists as well as other Azure engineering teams and the Field, Marketing and Support organizations to define and deliver critical, customer-facing features and the tools, infrastructure and end-to-end solutions required for all the rapidly expanding programs in Azure CXP. You will get the opportunity to actively participate in the hiring and building of your own team and the rest of the org. Please find the details of the responsibilities .
• Guides team within and across teams in producing extensible, and maintainable code. Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and return on investment (ROI).
• Reviews debugging tools, logs, telemetry, and other methods, and acts as an expert for others to verify assumptions through writing and developing code proactively before issues occur and reactively as issues occur across products and multiple teams.
• Guides teams and leads identification of dependencies and the development of design documents for a product, application, service, or platform. Leads identification of other teams and technologies that will be leveraged, how they will interact, and when one's system may provide support to others.
• Guides team in creating clear and articulated plan for testing, and defines success for test outcomes.
• Guides others through efforts and discussions for architecture of aspects of products/solutions (e.g., design, cost). Creates proposals for architecture and design documents, and leads testing of hypotheses and proposed solutions.
• Acts as an expert and guides team experimentation to determine the effectiveness of changes and monitors developments for prototyping and testing across products and multiple teams, interprets results, and makes a decision on next steps or ship decision from results.
• Guides team to drive multiple group's project plans, release plans, and work items in coordination with appropriate stakeholders (e.g., project managers).
• Acts as an expert to others for deployment appropriate environments. Establishes standards for the correct measures to deploy products.
Reliability and Supportability
• Guides team and leads efforts to collect, classify, and analyze complex data and analyses on a range of metrics (e.g., health of the system, where bugs might be occurring).
• Guides team and acts as an expert for Designated Responsible Individual (DRI) and monitors other engineers across product lines, working on call to monitor system/product/service for degradation, downtime, or interruptions. .
• Integrates, designs, and reviews others work across a team or product to integrate instrumentation for gathering telemetry data on system behavior such as performance, reliability, availability, utility, and safety mechanisms.
• Acts as an expert for others' operations of live service as complex issues arise on a rotational, on-call basis. Reviews systematic issues and ensure solutions.
Engineering Excellence:
• Guides team and leads efforts to ensure the correct processes are followed to achieve a high degree of security, privacy, safety, and accessibility across solutions and teams.
• Identifies skills needed and ensures engineering team's skills remain current by investing time and effort into staying abreast of current developments.
• Guides the decision-making process around tool development. Oversees resourcing of tool development and reuse within the team. Ensures the team uses open sources and reuses them as applicable.
• Understand User Requirements
• Guides partnership with appropriate stakeholders