Finding the best job has never been easier
Share
Overview of the Role
- Experience in SRE principles like SLOs, Error Budgets, Resiliency, Toil Reduction, Chaos Engineering, and more
- Experience with Cloud stacks such as Docker, Kubernetes, Openshift, PCF or AWS
- Experience writing code in Java, Go, Shell, Python, Unix Shell Scripting or a similar language.
- Experience with Sql/NoSql databases like Oracle, MongoDB, etc.
- Experience in Config Management tooling e.g. Ansible, Chef, Puppet or SaltStack
- Competent with API, web services and microservices development
- Strong analytical, algorithmic, and problem-solving skills
- Ability to quickly learn new concepts and software
- Understanding of GenAI concepts
Responsibilities:
The SRE provides technical and business support for users of Citi Applications. This includes providing quick resolutions, driving stability, efficiency and effectiveness improvements to help us and the business succeed.
Focusing on stability, quality and functionality of Citi’s tech stack against service level expectations.
Ability to conduct blameless postmortem, develop executive briefings, assess major incident impacts and drive service improvements to prevent repeat of an incident.
Understanding of GenAI, Artificial Intelligence, Machine Learning, predictive analytics, etc.
Practice Resiliency Engineering of our eco system using tools such as Chaos Monkey, Gremlin, etc.
Ability to take accountability and drive initiatives including presenting Infront of leadership
Implementing SLOs, Error Budget and actionable alerts such as auto scaling, self-healing, etc.
Drive Toil Reduction and Automation
Qualifications:
10+ years experience in an Application Support role.
3+ experience in Site Reliability Engineering Role
Experience with some programming languages and willingness/ability to learn.
Effective written and verbal communications including ability to explain technical issues in simple terms that non-IT staff can understand.
Demonstrated analytical skills
Ability to plan and organize workload
Bachelor’s/Universitydegree in Computer Science or equivalent experience
Certification in Site Reliability Engineer, Sales Force or Cloud Based Certification like AWS or Google is a plus
Anticipated Posting Close Date:
View the " " poster. View the .
View the .
View the
These jobs might be a good fit