Partner with the best
As a Site Reliability Engineer II, you will be responsible for:
- Providing ongoing operational support for complex distributed applications.
- Solving complex problems promptly and avoiding recurrence through proactive troubleshooting, and automation systems programming.
- Providing guidance to engineers and developers to increase confidence that their services are performing as expected
- Monitoring, investigating, and analyzing performance and availability by (co)designing, managing, and tracking product-related SLIs/SLOs
- Working closely with product engineers to advocate reliable and scalable system design for supportability, resilience and reliability
- Leveraging skills in data analysis, network diagnostics and debugging tools to characterize performance and recommend improvements
Do what you love
To be successful in this role you will:
- Have 2 years of experience and a Bachelor's Degree in Computer Science or its equivalent experience
- Show experience in scripting or procedural languages (Python, Perl, Shell, C/C++, Java, etc.)
- Show fluency working in a UNIX/Linux computing environment
- Have experience with Unix/Linux.
- Be familiar with infrastructure-as-code tools such as Terraform and have NoSQL/Cassandra experience.
- Have proficiency with a configuration management tool such as Ansible, Salt Stack, Chef, Puppet, or similar
- Have experience with continuous integration / continuous deployment tools such as Jenkins, Git hub Actions, or similar
Learn more
Not sure if this job is the right match for you or want to learn more about the job before you apply? Schedule a 15-minute exploratory call with the Recruiter and they would be happy to share more details.