Your Role and ResponsibilitiesKey Responsibilities:
- Maitain high-available product and service on cloud
- Identify issues, ensure minimal downtime and drive them towards a resolution
- Automate repetitive tasks using scripts and tools, reduce manual interventions
- Collaborate with development teams – roll out new services, ensure stability and reliability
- Improve operational practices, ensure efficenty and innovation
- Share knowlegde, ideas and solutions with global team
Required Technical and Professional Expertise
- Experience in a large-scale, distributed Linux/Unix environment
- Understanding of containerization technologies
- Experience with maintaining and scaling Kubernetes-based applications on cloud infrastructure
- Experience with scripting and automation (Bash, Python, Go, Jenkins, Ansible)
- Familiarity with the usage of Cloud Platforms (IBM Cloud, Amazon Web Services, Microsoft Azure)
- Strong debugging and problem-solving skills
- Passion for building and maitaning reliable and resiliant systems
- Basic understanding of networking
Preferred Technical and Professional Expertise
- Go/python development skills
- Understanding of cloud storage and networking
- Experience with Infrastructure as Code
- Experience with any source version control system
- Experience with observability (e.g., Prometheus, Grafana, Sysdig)