Finding the best job has never been easier
Share
Amazon is looking for a highly motivated Principal Systems Development Engineer to drive technical operational efficiency across AWS. This role will tackle intrinsically hard problems, venturing beyond comfortable approaches when necessary. You will learn, educate, and advocate, acquiring expertise as needed, pioneer new spaces, and inspire others as to what’s possible.This is a highly visible role that requires constant learning and working across AWS. We need engineers who can balance being hands-on while also working with senior management to help shape the strategic direction to improve operations across AWS.A day in the life
You’ll balance your time between operating production systems and making long-term improvements to the reliability, availability, and performance of those software systems. An example week could look like: Monday you provide meaningful feedback on the most critical upcoming change whilst guiding the most senior technical talent in your organization to make more decisions without you. Tuesday you identified a major reliability risk in the interplay between systems in your care and designed a cohesive solution. On Wednesday you lead the design review with the relevant technical leaders, receiving consensus on a path forward. Thursday, you influenced your senior management to take goals and make investments to achieve that outcome. Friday, you begun developing part of that system which would have the most impact on the reliability of the overall system.
* Requirement to participate in On-Call rotation.
* Fluency in written and spoken English is required.
* Successful applicants must have the legal right to work in Germany.
* Amazon will provide relocation support for successful applicants relocating within the European Union.
* 10+ years of experience in software development or related field
* Experience operating and troubleshooting reliable, scalable software systems
* Proficient in at least one modern programming language such as Java, Typescript, Python, or Ruby
* Able to troubleshoot at all levels, from network to operating systems to software applications
* Highly Proficient in operating 24x7 high-availability, distributed software applications
* Desire to dive deep into, and find opportunities to improve, the reliability, availability, and performance of distributed software systems.
* Experience leading strategic team efforts requiring work from multiple team members
* Experience actively mentor other engineers
* Experience performance tuning software applications and optimizing fleet utilization
* Strong understanding of network fundamentals (DNS, DHCP, TCP/IP, routing, load balancing, load shedding)
* Proficient with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar)
* Proficient with operating services in AWS
* Experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar)
* Experience scripting operating system tasks in Bash, Python, etc.
These jobs might be a good fit