Finding the best job has never been easier
Share
You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime.
Responsibilities
• Drive Site Reliability Engineering agenda to improve availability, reliability, and performance of services
• Drive observability for our applications.
• Drive optimise operate initiative, example, reduction of operation toil
• Work with application teams in setting up SLI, SLO and Error budget for their applications
• Work with enterprise team in deploying SRE enablers/initiatives.
• Minimum of 3 years technology experience (preferably in the financial industry)
• Bachelor’s in Computer Science, a related technical field that involves programming, or equivalent practical experience.
• Experience in one or more of the following: Java Script, Java and Python.
• Experience with APM system as ELK, Grafana, Prometheus, Dynatrace and AppDynamics, etc
• Understands key SRE concepts such as Toil, SLI, SLO, Error Budgets, MTTD, MTTR, etc
• Strong, committed, and reliable team player, able to take direction but also willing to contribute to discussions on design and strategy.
• Possess strong interpersonal and communication skills to be able to deal with and form good relationships with other technology teams through day-to-day support and project work
These jobs might be a good fit