Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Apple Cloud Monitoring SRE 
United States, California, Cupertino 
935185189

04.04.2024
Key Qualifications
  • Minimum 5+ years of handling services in a large scale environment.
  • Strong sense of ownership and integrity demonstrated through clear communication and collaboration
  • Experience and confidence around incident response and incident management
  • Experience in managing and scaling distributed systems in a public, private, or hybrid cloud environment
  • Experience with the Prometheus ecosystem
  • Practical experience in Python, bash scripting. Theoretical knowledge of Go, Java, and/or Scala.
  • Acute drive to automate manual operations and to improve them through repeated iteration
  • Comfortable with Open Source configuration management and orchestration tools (such as Helm, Puppet, and Spinnaker)
  • Experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks
  • Familiarity with micro-services architecture and container orchestration with Kubernetes
  • Expertise in Software Design and Development
  • Responsibilities:
  • You will perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes.
  • You will drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization.
  • You will participate in code reviews for projects primarily written in Python, Java, and Scala, built on open source product such as FiloDB, and running on virtual and containerized platforms.
  • You will represent the SRE organization in design reviews and operational readiness exercises for new and existing services.
  • Use of configuration management and deployment tools
  • Monitoring of systems and services, optimization of performance, and resource utilization
  • Runbook implementation for everyday maintenance actions
  • Incident response, diagnosis, and follow-up on system outages or alerts
  • Collaborating with a global and asynchronously communicating team (don’t worry if you have never worked remotely; we’ll help you get used to it)
Education & Experience
B.S. in computer science or similar field or equivalent experience.
Pay & Benefits
  • At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $138,900.00 and $256,500.00, and your base pay will depend on your skills, qualifications, experience, and location.Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.