Your Role and ResponsibilitiesThe infrastructure running industries like transportation, energy, insurance, banking, or healthcare is quickly changing as the world’s relationship with technology evolves. Companies have more choices than ever before between on-premises, off-premises, or a hybrid approach.
As an Infrastructure Specialist, you will be responsible for keeping up with the latest and greatest of these changes and for using your expertise to deliver cloud solutions that meet the needs of our customers and products.
Your primary responsibilities include:- Responsible for the day-to-day operations for the IBM Cloud Bare Metal environment. This includes in-depth understanding an analysis of customer escalations, root-cause analysis, and the development of action plans to be presented to senior management.
- Understanding of 8D, FMEA and other accepted industry quality processes.
- Accountable for contributing to the life-cycle-management of the server environment and the subsystems/components that support the Bare Metal offerings.
- Establish strong working relationships with key IBM stakeholder organizations to support the creation of Create Bare Metal analytics and failure rates for servers and components.
- Take ownership of identifying top issues impacting customers and server reliability
- Work within a strong team environment to drive root cause analysis and corrective action for those top issues.
- Manage assigned projects aimed at improving diagnosability and MTTR to reduce overall resolution time.
- Utilized communications skills to effectively communicate the status of committed projects and escalations for Compute Operations management.
- Partner with other teams, functional managers, and program managers to enable the delivery of mission-critical infrastructure services to our target market.
- Support the Bare Metal Operations team in efforts to develop new and enhance existing capabilities for the IBM Bare Metal infrastructure.
Required Technical and Professional Expertise
- 10+ years working in high-performance engineering team
- 10+ years of experience in data center infrastructure or relevant work experience
- 10+ years of experience in large-scale infrastructure design, engineering, and support
- 10+ years of experience in IT Change, Incident, Problem, Asset management
- 10+ years of working knowledge with one or more operating systems: RHEL, CentOS Linux, and Windows Servers.
- Working knowledge with server monitoring technologies
- Working knowledge with Network is a plus: Routing, VLAN, Firewall/ACL, Load Balancing (Citrix SDX/VPX a big plus), etc.
- Working knowledge with ServiceNow, JIRA, Confluent, and GitHub
Preferred Technical and Professional Expertise
- In-depth understanding and working knowledge with server technologies
- Working knowledge with how Virtualization, Network, and Storage technologies work in the data center and cloud environments
- Demonstrated root-cause analysis, VIRT resolution process, and process improvement
- Experience with design and development of complex systems
- Working knowledge with ServiceNow, JIRA, Confluent, and GitHub
- ITIL Foundation V4 certification is a plus
- Excellent verbal and written communication skills
- Highly responsible, motivated, able to work with little direction
- Ability to troubleshoot complex problems and customer issues