Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Red hat Senior Site Reliability Engineer 
New Zealand 
652281243

07.07.2024

What you will do

The day-to-day responsibilities of an SRE involve working with live systems and coding automation. As an SRE you will be expected to:

  • Contribute code to increase the scalability and reliability of the service

  • Contribute software tests and participate in peer review to increase the quality of our codebase

  • Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration

  • Participate in a regular on-call schedule, including occasional paid weekends and holidays

  • Practice sustainable incident response and blameless postmortems

  • Resolve customer issues escalated from the Red Hat Global Support team

  • Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve

What you will bring

  • A bachelor's degree in Computer Science or a related technical field involving software or systems engineering

  • Experience programming in at least one of these languages: Python, Golang, Java, C, C++ or another object-oriented language.

  • Experience working with public clouds such as AWS, GCP, or Azure.

  • Experience troubleshooting an as-a-service offering (SaaS, PaaS, etc.) Have the ability to collaboratively troubleshoot and solve problems in a team setting.

  • Experience working with complex distributed systems.

  • Direct experience with Kubernetes or OpenShift is a plus.

  • Demonstrated ability to debug, optimize code and automate routine tasks.

  • Basic understanding of Unix/Linux operating systems.

Desired skills

  • 5+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure

  • 3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus

  • 3+ years of experience with enterprise configuration management software like Ansible by Red Hat, Puppet, or Chef

  • 2+ years of experience programming with at least one object-oriented language; Golang, Java, or Python are preferred

  • 2+ years of experience delivering a hosted service

  • Demonstrated ability to quickly and accurately troubleshoot system issues

  • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP

  • Solid communications skills and experience working directly with and presenting to customers

  • 1+ year(s) of experience with Kubernetes is a plus

  • 1+ year(s) of experience with docker-based containers is a plus