Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Amazon Site Reliability Engineer Managed Operations 
Germany, Berlin 
306136673

10.06.2024
DESCRIPTION


A day in the lifeOver the course of a week, this could look like; Monday morning you root caused why some deployments recently failed, and in the afternoon, you made fixes for those bugs. Tuesday and Wednesday you executed a highly sensitive time critical change to production. Thursday and Friday you were developing software with your team to remove humans from the loop on problems like you worked on over the previous two days, driving a common source of error out of the system and improving its reliability.About the team
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Work/Life Balance
Berlin, BE, DEU

BASIC QUALIFICATIONS

- Able to participate in an 24x7 oncall rotation
- 3+ years of experience in software development or related field with proficiency in at least one modern programming language such as Java, Typescript, Python, or Ruby
- Experience operating and troubleshooting reliable, scalable software systems
- Able to troubleshoot at all levels, from network to operating systems to software applications
- Successful applicants must have the legal right to work in Germany


PREFERRED QUALIFICATIONS

- Excellent communication and problem-solving skills across languages
- Experience operating 24x7 high-availability, distributed software applications and performance tuning software applications and optimizing fleet utilization
- Strong understanding of network fundamentals (DNS, DHCP, TCP/IP, routing, load balancing, load shedding) and experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar)
- Experience scripting operating system tasks in Bash, Python, etc. and with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar)
- Experience operating services in AWS