Finding the best job has never been easier
Share
Key job responsibilities
- Design and implement large-scale systems processing petabytes of data daily
- Build and maintain high-quality, thoroughly tested software solutions- Develop real-time monitoring and analysis capabilitiesA day in the life
You'll work in an agile environment, designing and implementing solutions that operate at Amazon scale. This could involve:- Building real-time data processing systems that analyze service health
- Developing mechanisms to surface and prevent reliability risks
- Contributing to systems that process and analyze logs from thousands of services
- Enables root cause analysis within minutes of issues occurring
- Processes and analyzes massive amounts of log data daily
- Creates tools to help services maintain availability under any conditions
- Develops frameworks for assessing and improving service resilience
- Builds systems to ensure safe deployment of code and configuration changes
- Provides actionable insights for improving service reliabilityLearn more about the wider organisation here:
- Bachelor's degree or equivalent
- Experience programming with at least one modern language such as Java, C++, or C# including object-oriented design
- Experience contributing to the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems
These jobs might be a good fit