As a Site Reliability Engineer (SRE) you will:
- Adopt new tools and practices that enable the team to gain deeper insights into system/services behavior, performance and reliability
- Strive for and enable proactive approach to Incident Management, alerting and recommended actions to reduce risk of failure
- Be able to leverage artificial intelligence (AI) and machine learning (ML) technologies
- Apply latest security standards in our day to day work and software solutions
- Build and maintain automation code and tools
- Build observability based on user experience
- Collaborate closely with Development teams, other technical and non-technical teams in international environment to achieve commitments
- Be part of weekend and overnight on-call rotation if technical expert is needed during Live site situations (downtime or performance degradation of services in scope).
What you bring:
Following skills and competencies will bring you closer to meeting with us:
- BSc degree in Computer Science, Software Engineering, Telecommunications or related technical area
- Experience as DevOps/SRE or other relevant position
- Good understanding of cloud architecture and/or cloud platforms (AWS, Azure, GCP, AliCloud)
- Experience with Linux/Unix
- Be able to work efficiently in critical situations and affinity to quickly analyze and solve problems
- Be real team player, self-motivated and continues learner
- Excellent communication skills
- Fluency in English
If you have knowledge and experience in any of the following areas it will be considered an advantage:
- Programing languages like Python, Bash, Go, etc
- Monitoring, logging and visualization tools like Dynatrace, Kibana, Grafana, etc
- CI/CD tools like Jenkins, Concourse
- Git, GitHub, Terraform, Databases
- Kubernetes
- Experience working in an Agile environment
- Experience with OpenStack