Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

IBM Intern Conversion - Site Reliability Engineer 
Canada, Ontario, Markham 
192969521

09.09.2024
Start dates for this position are in 2025

As a Site Reliability Engineer, you will work in an agile, collaborative environment to build, deploy, configure, and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes.
Your primary responsibilities include:

  • Deployment and Configuration: The process of installing Couchbase Enterprise software and configuring buckets for the offerings.
  • Cluster Management and Troubleshooting: Activities involved in troubleshooting issues with clusters and determining appropriate hardware configuration for them.
  • Security and Compliance Implementation: Implementing security measures such as setting up certificates and ensuring compliance with ITCS-104, ISO, SOC standards, and associated regulations.
  • Maintenance and Support: Tasks related to applying Couchbase security patches and upgrades, supporting Cassandra and Mongo for pager duty rotation, and collaborating with Couchbase Product support for issue resolution.


Required Technical and Professional Expertise

  • Availability and Flexibility: Willingness to work in shifts or support 24 x 7 coverage as per the business needs.
  • Couchbase, Mongo, and Cassandra: Excellent knowledge of Couchbase, with a solid foundation in Mongo and Cassandra.
  • Linux Proficiency: Exceptional knowledge of Linux operating systems.
  • Operation and Support Experience: Demonstrated experience in handling day-to-day operations, alert management, incident support, migration tasks, and break-fix support.
  • Shell Scripting and Ansible Skills: Good knowledge of shell scripting and the Ansible configuration management tool.


Preferred Technical and Professional Expertise

  • Kubernetes/OpenShift: Strongly preferred experience in working with production Kubernetes/OpenShift environments.
  • Change Management Expertise: Experience with change management workflows.
  • ELK/EFK Stack Familiarity: Experience with the ELK/EFK stack, which includes ElasticSearch, Logstash/Fluentd, and Kibana.
  • Distributed Event Streaming Platform Experience: Experience with platforms such as Kafka.
  • SQL and NoSQL Datastore Experience: Experience with SQL and/or NoSQL datastores, including DB2 and Oracle data services.
  • Application Load Balancing Concepts: including F5 and ELB.