As a Site Reliability Engineer you will be responsible for providing the platform for mission critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish.
Minimum Qualifications
In depth experience in a Site Reliability Engineering, DevOps, or Infrastructure focused role
Must be an expert and have in-depth professional experience working with Kubernetes
Experience operating large scale multi tenant Infrastructure as a Managed service
Able to troubleshoot issues across the entire infrastructure stack
Ability to implement and coordinate telemetry using monitoring and observability tools such as Splunk, Grafana, and Prometheus
Outstanding organizational and communications skills
Preferred Qualifications
Proficient in GoLang
Knowledge of the Linux operating system and its variations
Experience with GitOps, CI/CD tools, and deployment strategies like Spinnaker, Argo