The point where experts and best companies meet

MongoDB Senior Site Reliability Engineer
Germany, Berlin, Berlin
331163343

24.06.2024

Responsibilities

Design and build the infrastructure for a global cloud service that comprises hundreds of thousands of MongoDB clusters, processes a billion metrics per day, and replicates tens of billions of database writes to our backup service
Design, implement, and troubleshoot the automation and monitoring of services that seamlessly spans the globe - including several cloud providers
Become an expert in infrastructure performance, helping us optimize from the application level all the way through the firmware
Build for resilience. Our goal is that nobody’s pager goes off, ever. Are we there yet? No. Are we really close? Very. While we work on that - participate in a weekly on-call rotation
Improve our infrastructure capabilities, optimizing for cost, simplicity, and maintainability

Requirements

You have experience running a mission critical service at scale
An understanding of information security issues
Prior experience running critical production systems in a Linux environment
Firm grasp of at least one modern programming language, beyond basic scripting
Solid understanding of web and network protocols and standards (HTTP, TLS, DNS, etc)
Bachelor’s degree in Computer Science or equivalent experience
Experience writing automation tools & eagerness to "automate all the things"

Nice to haves

Experience building large applications from scratch, complete with CI/CD infrastructure
Experience in networking, security, hardware or OS performance tuning
Experience with at least one of the major cloud providers (Amazon Web Services, Google Compute, Microsoft Azure)
Experience managing kubernetes clusters or some other container orchestration infrastructure
Experience with observability of large scale distributed systems

What's in it for you

Generous compensation package (top-range salary: we pay in the top 95% percentile and our package includes equity and generous benefits)
Opportunities to learn on the job (time to up skill in new technologies)
High level of independence in your day to day work

These jobs might be a good fit

Amazon Site Reliability Engineer Managed Operations Germany, Berlin

Get to the top of the "yes list" with a standout CV!

CREATE CV