Proven experience developing production-grade software in Python, Go, or Java.
Strong sense of ownership and integrity demonstrated through clear communication and collaboration
Experience and confidence around incident response and incident management
Experience/knowledge in managing and scaling distributed systems in a public, private, or hybrid cloud environment
Experience/knowledge with the Prometheus ecosystem
Acute drive to automate manual operations and to improve them through repeated iteration
Comfortable with Open Source configuration management and orchestration tools (such as Helm, Puppet, and Spinnaker)
Familiarity with micro-services architecture and container orchestration with Kubernetes
Master’s degree in Computer Science or a related field is preferred.
Demonstrated ability to investigate complex systemic and latent reliability issues and collaborate cross-functionally with software and systems teams to implement sustainable solutions.
Experience automating workflows and reducing operational toil through scalable solutions.
Use of configuration management and deployment tools
Monitoring of systems and services, optimization of performance, and resource utilization
Collaborating with a global and asynchronously communicating team.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.