Ensure availability and responsiveness of application by setting up and maintaining the required documentation method and tools
You will provide expertise and insights for project engineering teams and advise on best approaches to solve for avoiding infrastructure and security challenges.
Define roadmaps and milestones for devOps tasks in support of multiple projects handled on a monthly and quarterly basis
Handle resolution of blockers, escalation to stakeholders, and provisioning of resources
Meet with stakeholders and internal teams to communicate and agree on plans and manage notifications when issues arise
Document plans for maintenance, schedules and status to leadership team and stakeholders
Manage Ansible, Jenkins, Tekton and other CI/CD solutions
Diagnose environmental issues and introduce/implement technologies to solve them
Provision and maintenance of DevOps Infrastructure for projects
Monitor and support of platform infrastructure and manage escalations
Look for enhancements and innovative solutions to help the services scale and improve existing technical support tools, procedures, or processes.
Develop troubleshooting techniques to effectively identify and investigate issues and provide advice and guidance to clients
Work in a global team, collaborating with IBMers to share recommendations, solutions and ideas
Potential on-duty rotation including weekend and holiday support as needed basis
Required Technical and Professional Expertise
8-12 years of relevant industry experience
Minimum of 2 years of experience in a DevOps Developer or Engineer role or similar.
Minimum of 3 years of SRE experience
Strong experience in cloud deployment and deployment of monitoring capabilities
Proficiency in scripting and Python programming language
Experience in analytics and interactive visualization platforms like Grafana
Strong understanding of CI/CD pipelines and tools like Tekton
Experience in Terraform modules.
Experience with cloud-based services
Working knowledge with ML Ops and maintenance
Proficiency in containerization technologies (e.g., Docker, Kubernetes, OCP).
Experience in Monitoring and operating Kubernetes clusters
Understanding of networking and security concepts
Self-starter, organised with self-learning skills, ability to work independently
Great communication skills, self-managed and a team player
Ability to work effectively as part of a worldwide, agile development team.
Fluent English
Preferred Technical and Professional Expertise
Skills for implementation, operations and maintenance of DevOps environnent – MLOps, DevOps, UX Design
Experience in deploying and maintaining services and pipelines in IBM Cloud containerized environments.
Experience in automating the deployment and scheduling of micro- services