At least 10+ years of prior demonstrated experience in a Site Reliability Engineering, DevOps, or an Infrastructure-focused role including 3+ years of experience managing or leading the team
Proficiency in one or more programming languages (eg. Java, Python), SRE technologies relevant to automation, monitoring and incident response.
Experienced in leading and managing high performance SRE teams.
Proven track record in managing complex SRE projects, enterprise services at a large scale
Excellent communication and interpersonal skills, ability to effectively communicate with cross functional teams, stakeholders and leadership teams.
Building and operating container orchestrating systems like Kubernetes or EKS.
Strong programming experience in Java building web, middleware or backend applications.
Deep understanding of Oracle or similar relational databases and NoSQL databases such as MongoDB.
Firsthand experience in performance tuning of applications and databases.
Knowledge of HTTP/S, TCP, DNS, web application load balancing.
Deep understanding of security concepts and protocols - authentication, authorization, signing, encryption, SSL/TLS, SSH/SFTP, PKI, X509 certificates and PGP.
Automation advocate - Passion for automation and reluctance for manual implementation
Desire to build, grow, and mentor a team
A strong sense of ownership. At the same time, you're a great teammate who communicates clearly and transparently
Self-motivated, inquisitive, and always looking to learn more.
Experience managing, scaling, and troubleshooting Java applications
Bachelor or Masters in Computer Science or other related discipline.