

Share
Responsibilities:
· Design, develop, modify automation processes to deploy large-scale, enterprise-grade applications.
· Partner with R&D, Product Management and Architecture teams to deliver key operational metrics and feedback that support high product availability.
· Investigate reported and suspected outages and other issues.
· Oversee documentation of issues and resolutions. Maintain knowledge base of known issues.
· Produce, oversee production of, and maintain documentation, such as architecture diagrams, System Designs, and Runbooks.
· Work alongside fellow SREs, DevOps Engineers, and developers to solve problems and investigate anomalies.
· Participate in an on-call rotation to help maintain our 99.95% uptime SLO.
Requirements:
· 7+ years of experience working with AWS is required, with a proven track record of building complex infrastructure.
· Strong experience with networking, containerization (Kubernetes) and Linux system administration.
· Deep understanding of Infrastructure as Code (Terraform, Ansible, Cloud Formation etc.)
· Strong experience with Kubernetes preferably with AWS EKS.
· Experience with scaling and maintaining high availability production systems.
· Experience with DevOps release, versioning, build management, automation scripts, CI tools such as GitHub actions.
· Excellent verbal and written communication skills in English.
· Serve as a guide to the team and provide the best practices as a DevOps Tech Lead.
· Must be a U.S. Citizen
These jobs might be a good fit