The shift toward the consumption of IT as a service, i.e., the cloud, is one of the most important changes to happen to our industry in decades. At IBM, we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud. With industry leadership in analytics, security, commerce, and cognitive computing and with unmatched hardware and software design and industrial research capabilities, no other company is as well positioned to address the full opportunity of cloud computing.
We are looking for a dynamic, curious, high-potential, deep technical leader who desires a broadening assignment with the opportunity to deliver substantial business value to IBM. A leader, who innovates & shares our passion for winning in the cloud marketplace. The The ideal candidate should be strong in ‘getting things done’, have an entrepreneurial spirit, communicates well, have a great deal of energy, and enjoy working as part of a global collaborative team. Candidates for this position will need strong Delivery and Execution skills, to help projects at all phases overcome technical obstacles. In this role, you will be responsible for setting the direction for operations,to deliver value to our clients in a fast-changing cloud landscape. The SRE team is dedicated to ensuring that the IBM Cloud is at the forefront of cloud technology, from Storage & Network architecture and compute clusters to flexible infrastructure services. We are building IBM’s next generation cloud platform to deliver performance and predictability for our customers’ most demanding workloads
Hire and develop high performing technical talent with a particular focus on delivering operations and SRE solutions.
During Incidents to lead multiple service teams to a fast resolution and return to BAU for customers.
Manage Site Reliability Engineers including team’s day to day operation, all quarterly reviews, evaluations, and career development.
Drive the team to establish comprehensive development plans and innovative solutions to problems and challenges that meet desired outcomes.
Allocate and balance resources across multiple platforms to meet needs of business priorities
Provide management oversight to several activities running in parallel, address issues/concerns with speed, and enable coarse corrective actions.
Report on operating status to program stakeholders as needed
Performs other duties as required
Develop, implement, and monitor day-to-day operational systems and processes
Enhance all existing monitoring solutions and implement robust monitoring for all platforms globally
Analyze current operational processes and performance, recommending solutions for improvement where necessary
Managing critical customer issues, this requires on going communication with Service SRE, development, and customer support teams
Lead the team with integrity and to establish and maintain a trusting, inclusive, and productive environment
.
Exposure to team leadership and operational excellence
Technical Skills – Good hold on Cloud technologies in Networking, Storage and Compute.
Experience with managing team of team size 15-20 with varied skills like SREs and developers,
Expert in Agile and Scrum Methodology.
Excellent leadership and management skills with emphasis on mentoring, motivating, and driving a large team to success.
Managing critical customer issues, this requires on going communication with Service SRE, development, and customer support teams
Experience using Splunk and or other dashboards
Understanding of web technologies and technology stack
Working knowledge with Network and Storage technologies
Working knowledge with ServiceNow, JIRA, Confluence, and GitHub
Preferred Technical and Professional Expertise
You love collaborative environments that use agile methodologies to encourage creative design thinking and find innovative ways to develop with cutting edge technologies
Ambitious individual who can work under their own direction towards agreed targets/goals and with creative approach to work
Intuitive individual with an ability to manage change and proven time management
Proven interpersonal skills while contributing to team effort by accomplishing related results as needed
Up-to-date technical knowledge by attending educational workshops, reviewing publications
Working knowledge & experience with Networking /Storage/ Databases in the Cloud