The point where experts and best companies meet
Share
What you will do:
Write and support Red Hat Ansible playbooks
Design and write automation software to provision, upgrade, monitor
Troubleshoot system issues, Identify single points of failure and other high-risk architecture issues; propose and implement more resilient resolutions
Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve
Contribute code to increase the scalability and reliability of the service
Contribute software tests and participate in peer review to increase the quality of our codebase
Collaborate with peer groups across the company to ensure consistent standards-based implementations and practices
Manage, deploy, and operate cloud solutions at scale
Help and develop peers through knowledge sharing, mentoring and collaboration
Participate in a regular on-call schedule, including occasional paid weekends and holidays
Interface with internal and external security/audit teams to ensure all storage systems are secured against internal and external threats
Provide technical expertise for system design, installation, configuration, performance tuning, capacity planning, troubleshooting, and problem resolutions
Create and maintain standard operating procedures (SOPs) for performing maintenance tasks, applying configuration changes and remediating problems in our environment
What you will bring:
Development experience with the ability to write scripts (shell/Python and Ansible)
Experience with supply chain management (SCM) and configuration management tools like Git and Red Hat Ansible Automation Platform
Understanding of Virtualization (Xen/KVM, Red Hat Enterprise Virtualization (RHEV)
Demonstrated working knowledge of public cloud infrastructure (like AWS, Google Cloud, Microsoft Azure, RH OpenStack, Docker)
Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
Solid communications skills and experience working directly with and presenting to customers
Strong documentation skills to include creating and updating detailed environment design, installation, and support documentation
Following is considered a plus:
Although not strictly required, experience with open source storage technologies such as OpenShift, OpenStack and RHEV
Ability to quickly learn new technologies and follow industry trends
Proficiency using various work planning/process tools such as Jira & ServiceNow
1+ year(s) of experience with Kubernetes or OpenShift is a plus
1+ year(s) of experience with docker-based containers is a plus
2-3 years of experience working as a system administrator or system engineer in a Linux environment are preferred
These jobs might be a good fit