Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Citi Group Site Reliability Engineer Hybrid 
United States, Texas 
101004077

26.07.2024

Overview of the Role


- Experience in SRE principles like SLOs, Error Budgets, Resiliency, Toil Reduction, Chaos Engineering, and more

- Experience with Cloud stacks such as Docker, Kubernetes, Openshift, PCF or AWS
- Experience writing code in Java, Go, Shell, Python, Unix Shell Scripting or a similar language.
- Experience with Sql/NoSql databases like Oracle, MongoDB, etc.
- Experience in Config Management tooling e.g. Ansible, Chef, Puppet or SaltStack
- Competent with API, web services and microservices development
- Strong analytical, algorithmic, and problem-solving skills
- Ability to quickly learn new concepts and software

- Understanding of GenAI concepts

Responsibilities:

  • The SRE provides technical and business support for users of Citi Applications. This includes providing quick resolutions, driving stability, efficiency and effectiveness improvements to help us and the business succeed.

  • Focusing on stability, quality and functionality of Citi’s tech stack against service level expectations.

  • Ability to conduct blameless postmortem, develop executive briefings, assess major incident impacts and drive service improvements to prevent repeat of an incident.

  • Understanding of GenAI, Artificial Intelligence, Machine Learning, predictive analytics, etc.

  • Practice Resiliency Engineering of our eco system using tools such as Chaos Monkey, Gremlin, etc.

  • Ability to take accountability and drive initiatives including presenting Infront of leadership

  • Implementing SLOs, Error Budget and actionable alerts such as auto scaling, self-healing, etc.

  • Drive Toil Reduction and Automation

Qualifications:

  • 10+ years experience in an Application Support role.

  • 3+ experience in Site Reliability Engineering Role

  • Experience with some programming languages and willingness/ability to learn.

  • Effective written and verbal communications including ability to explain technical issues in simple terms that non-IT staff can understand.

  • Demonstrated analytical skills

  • Ability to plan and organize workload

  • Bachelor’s/Universitydegree in Computer Science or equivalent experience

  • Certification in Site Reliability Engineer, Sales Force or Cloud Based Certification like AWS or Google is a plus

Applications Support

Full timeIrving Texas United States$125,760.00 - $188,640.00


Anticipated Posting Close Date:

Jul 30, 2024

View the " " poster. View the .

View the .

View the