What you’ll do:
- Build, secure, and evolve platforms which power the applications that make up Vimeo
- Build and maintain tooling which makes manual infrastructure work obsolete and enables self service for engineers
- Improve observability and reliability of applications to reduce outages to an absolute minimum, while reducing MTTA and MTTR
- Write and maintain thorough documentation to share with your teammates around the world, allowing them all to function as a cohesive unit
- Contribute to an internal self-service infrastructure platform used by all engineers for application development and deployment.
- Participate in a weekly on-call rotation shared between offices in the US and India, which includes responding to production incidents and providing internal support to other engineers at Vimeo
- Whatever it takes (within reason) to make Vimeo faster, simpler, more scalable, more reliable, and more efficient to operate
Skills and knowledge you should possess:
- At least six years of professional experience in software development with high proficiency in at least one general purpose programming language (C/C++, Go, Java, Ruby, PHP, Python, etc.)
- Deep understanding of the architectural patterns of high-scalability distributed systems
- Significant experience maintaining, optimizing, and administering Kubernetes deployments
- Strong knowledge of container orchestration and Linux system internals
- Significant experience with major cloud providers (Google Cloud, AWS)
- Significant experience with deploying and administration of MySQL
- Experience with “Infrastructure as Code” platforms such as Terraform
- Experience with observability systems, such as Datadog, Grafana, Prometheus, VictoriaMetrics, OpenCensus, and Graphite
Bonus points (nice skills to have, but not needed):
- Knowledge of ArgoCD, Atlantis, Varnish, Memcached, and/or Chef
- Experience with generalized build systems (make, bazel, please, etc.) or language-specific build systems (SWC, Turborepo, etc.).