Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

IBM Site Reliability Engineer 
Ireland 
131092159

08.05.2024

Your Role and Responsibilities

In this Site Reliability Engineer role, you will work closely with the Data Center, the entire Cloud development organization and IBM vendors to support, maintain and operationally improve the cloud infrastructure. Your focus will be the following key responsibilities:

•Support the compliance and security integrity of the environment through your work
• Partner with other teams, functional managers, and program managers to deliver mission-critical services to the market
• Support development of new and existing capabilities for our compute, storage and network services
• Work with Engineering to:
o Define operational requirements
o Automate operational requirements
• Work with Support and Development to:
o Identify and resolve issues
o Discuss and plan integration requirements

Required Technical and Professional Expertise
Experience in hands-on production administration of large system environments, including virtual platforms.
• Experience in establishing, following, and improving operational procedures within a mission critical environment
• Experience in data center infrastructure or relevant work experience
• Experience in virtualization technologies e.g. VMWare, KVM, VirtualBox.
• Experience in large-scale infrastructure design, engineering, and support
• Experience in IT Change, Incident, Problem, Asset management
• Must be efficient in writing, debugging and maintaining scripts (Bash and Python)
• Must be extremely comfortable using and navigating within a Linux environment
• Ability to do low level debugging and problem analysis by examining logs and running Unix commands
• Experience with configuration management systems (Ansible / Chef)
• Hands on knowledge of using Splunk or ELK
• Must have the ability to perform debugging and problem analysis by examining logs and running Unix commands
• Must have experience in dealing with bringing incidents to resolution and leading a group during the troubleshooting
• Working knowledge with Network and Storage technologies
• Working knowledge with ServiceNow, JIRA, Confluence, and GitHub
• Excellent written and verbal communication skills
• Comfortable operating in fast paced environment

  • Strong Experience in Microservices (Kubernetes, Docker)


Preferred Technical and Professional Expertise

• 2+ years of experience with GitHub, Perl and Python
• 2+ years of experience in virtualization environments such as AWS /Softlayer/Zen/VMWARE