Finding the best job has never been easier
Share
Using your expertise in SRE principles of automation and continuous improvement, you will help create an environment where availability, reliability, and security are incorporated through the entire application lifecycle, not treated as an afterthought. As an SRE, you will build tooling to automate the building, testing, deployment, promotion, monitoring, alerting, and maintenance of the Red Hat Ansible Managed Application on Azure and Ansible as a Service on AWS.
What You Will Do:
Develop and maintain software to automatically provision, upgrade, monitor, and heal Red Hat Ansible Automation Platform managed applications in Azure.
Write Ansible automation playbooks to reduce toil.
Support the operations of Red Hat Ansible Automation Platform by responding to and troubleshooting system alerts.
Provide engineering support to Red Hat's global technical support team to resolve customer issues.
Perform root cause analysis on outages and work with Ansible Automation Platform engineering teams to improve the overall cloud offering.
Participate in a global on-call rotation, including periodic weekend and holiday on-call duties.
Required Skills:
Software development experience using a general purpose language. Python or GoLang are a plus.
Linux administration experience. Red Hat Enterprise Linux (RHEL), CentOS, or Fedora are a plus.
Kubernetes administration
Understanding of computer networking including DNS.
Basic knowledge of software development life cycle tools, like GitHub and Jenkins.
Software development life cycle (SDLC) and agile or scrum processes
Excellent written and verbal communication skills in English
Experience supporting a customer-facing service.
Basic knowledge of monitoring systems.
Passion for learning new technologies, building elegant software systems, troubleshooting complex technical issues, and automation.
Experience with the following is considered a plus:
Writing Ansible playbooks and administering the Red Hat Ansible Automation Platform
Familiarity with data center networking and routing protocols are a plus.
Cloud native development/administration experience (Azure preferred)
Operations experience with a production user-facing application
Prior experience working on a globally distributed, remote team
Operations support system (OSS) contribution
Microsoft Azure and Azure Resource Manager Templates
These jobs might be a good fit