Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Amazon Software Development Manager AWS Incident Tooling & Response 
Ireland, Dublin 
494788076

18.11.2024
DESCRIPTION

AWS Resilience owns service that prevent and respond to availability and security issues for all AWS Services. In other words, we’re the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security and availability. You’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.As a Software Development Manager on the team, you will manage automated tooling roadmaps and delivery for the detection and resolution of issues within AWS infrastructure. You will work closely with the team managing the incident response and with leadership to gather new requirements. Based on learning from past incidents you will drive further improvements into our automation, tooling, and processes so that the next event is shorter or avoided entirely. You will coordinate across project teams to expand use of our tooling to additional areas across Amazon. If you're looking for a team with great growth potential and an opportunity to make a huge impact, this is the team to join.Key job responsibilities
Define and Deliver Business Priorities
You will be a key contributor and owner of the direction of the AWS Incident Management team. You will define, plan, track and deliver on strategic goals for the team, while ensuring that the team remains unblocked and focused.Cross-Site, Cross-Team Coordination
You will be responsible for coordinating with your counterparts and sister teams to ensure that a clear communication channel exists between AWS Incident tooling and Response teams. You will also work closely with the alarming systems to create and maintain a proper end to end experience from detecting, alarming to mitigating incidents.Performance Management/Team Health
You will own all facets of performance and career management for the team. You will ensure the operational load of your team remains manageable and as minimal as possible.