5+ years of software development or production operations experience in a large-scale environment
Proficiency in authoring and releasing code in Go, Python, or Java using common configuration management and software delivery platforms
Experience operating production applications at scale, including well designed performance testing, HA and disaster recovery concepts, capacity planning, and managing distributed systems on internal and public cloud infrastructure, principally Kubernetes
Understanding of the Linux Operating System, containers and virtualization, standard networking protocols, and components
Strong sense of ownership and integrity demonstrated through clear communication and collaboration
Demonstrates excellent troubleshooting and problem solving skills using the scientific method
Proficiency with the architecture, deployment, performance tuning, and troubleshooting of open source data analytics and data governance technologies, especially Apache Spark, Flink, Hive, Hadoop/HDFS, or other related software
The successful candidate is frustrated with toil and has an acute drive to both automate manual operations and evolve them into automatic processes