The point where experts and best companies meet
Share
What you will do:
Lead and grow a team of SREs maintaining the overall health of OpenShift hosted properties
Own the health, reliability and availability of OpenShift hosted properties
Provide coaching, oversight and escalation support to the regional team of SREs
Ensure that incidents are managed and resolved quickly, and that retrospectives and root-cause analysis is completed within expected timelines
Oversee the creation and maintenance of knowledge article and standard operating procedures (SOPs) for performing maintenance tasks, applying configuration changes, and remediating problems in the environment
Manage regional shift schedules, ensuring 24x7 resource availability
Participate in sprint planning and release cycles of SRE tooling
Schedule maintenance windows, considering customer and SRE resource requirements
Coordinate with teams across the organization to reduce operational friction and automate wherever possible
Resolve customer issues in cooperation with Red Hat's global customer support team
What you will bring:
1+ years experience managing engineering teams
Must be comfortable managing distributed, remote staff
Ability to understand and discuss deep technical issues with engineers
Demonstrated experience with contemporary project management methodologies such as Agile, kanban and / or scrum
1+ years of experience with cloud providers such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure
1+ years of experience with Kubernetes is a plus
These jobs might be a good fit