Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Citi Group Site Reliability Engineer 
Canada, Ontario 
84514816

22.04.2025
Responsibilities:
  • Work with highly motivated SRE and Support team members to design, implement and maintain monitoring solutions using tools like Grafana, Kibana and App dynamics.
  • Contribute to the projects and sprints related to firming up of SLOs/SLIs for Developer pipeline applications ensuring high availability and performance.
  • Look to proactively identify bottlenecks in application performance and help implement solution.
  • Harness the knowledge of Python/Javascript/Go to come up with scripts to automate and streamline operational tasks.
  • Curious and be open to work on projects related to Generative AI and task automation which will be a major focus of 2025.
  • Take ownership over complex problems and work with the team in driving resolution.
  • Look for proactive solutions but don’t be afraid to get stuck in when incidents demand.
  • Analyze system logs and metrics to identify root cause of issues
  • Effectively communicate technical details to both technical and non-technical audiences.
  • Work collaboratively with development, operations, and other teams to ensure alignment and smooth execution.
  • Contribute to a culture of continuous improvement and knowledge sharing.
Being an SRE, you will –
  • Get exposed to cutting edge and latest technology in use by a market leader by Citi like - Tekton, Harness involving OpenShift.
  • Have a hands-on experience in Gen AI implementation and creating automation model from idea inception to product delivery.
  • Learn the ground-up process of building observability for the supported applications and be part of the team that designs SLO/SLIs for improving the application performance.
Skills Required:
  • Proficiency in Python, Java or Nodejs preferred.
  • A good understanding of the Software development lifecycle and Pipeline management.
  • Basic understanding of observability principles and SLO/SLIs
  • Experience in using monitoring tools like Grafana, Kibana, Prometheus, and AppDynamics.
  • Basic working knowledge of data visualization tools like Tableau.
  • Understanding of Agile concepts and related process.
  • Working or theoretical knowledge on openshift, Tekton and Harness pipelines
  • Excellent problem-solving and analytical skills.
  • Strong communication and collaboration skills.
Applications Support


Time Type:

Full time

View the " " poster. View the .

View the .

View the