Finding the best job has never been easier
Share
As part of the EC2 HA team, you will work on highly scalable tools and software services to measure fleet health, identify failure patterns and generate automated health reports. You will work with partner teams to improve existing failure classifications and create new failure classifications. You will use data science techniques to identity spikes in failures across the fleet. You will work to ensure that the failures patterns are root caused and fixed to ensure a healthy AWS fleet. You will drive innovation and development of new tools and services to cover new operational and health metrics.Key job responsibilities
Designing and developing cutting edge highly reliable and scalable distributed systems.Mentoring other engineers
A day in the life
You will use data analytics and various large data sets to efficiently detect and root cause EC2 server and instance failures
You will exercise the highest bar for security in both code and operations.Seattle, WA, USA
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- Bachelor's degree
- Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby.
- Proficiency in Computer Science fundamentals such as object-oriented design, data structures, algorithm design, problem solving, and complexity analysis
These jobs might be a good fit