Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Apple Site Reliability Engineer SRE 
United States, California, San Diego 
195750724

Yesterday
In this role, you will help lead our cloud based infrastructure team for Apple’s Video Computer Vision Organization. As a main contributor to our SRE team you will develop and maintain infrastructure, tooling, and engineering services for cloud based applications. You will be responsible for system bringup, deployment, reliability, security and service scalability. This role is highly cross-functional and you will work very closely with various highly skilled software development / ML teams developing cutting edge algorithms.
Your core responsibility is to provide operational support of multiple cloud based applications with an emphasis on deployment, security, scalability and reliability running on AWS and Apple infrastructure. Our technologies include Terraform, Argo, Docker, Python, Postgres, Prometheus, in combination with custom Apple software and tooling. Common technologies you’ll manage include: Kubernetes (eks), Elasticsearch, Redis, RDS, ELB, and other AWS based services. This role will also help drive solutions for hybrid infrastructure (on and off prem) and drive infrastructure architecture for our AWS based cloud platform.
  • Experience building systems both on-premise (data center) and on public cloud (AWS, GCP or Azure welcome).
  • Have deployed and operated schedulers such as Kubernetes, AWS ECS or EKS.
  • Ability to write code in one of many high level languages (Python preferred)
  • BS and a minimum of 3 years relevant industry experience
  • MS in Computer Science/Computer Engineering (or equivalent experience).
  • 5+ years supporting large scale in production applications in an SRE role.
  • 3+ years managing SRE teams and supporting mission critical applications.
  • 3+ years of Hybrid Cloud infrastructure management.
  • Experience with AWS large-scale application deployment and service management through Terraform, Argo, or similar.
  • Expert knowledge of Linux, Python, Docker, Kubernetes, Postgres, Redis, along with operations and monitoring.
  • Professorial approach to working with team members, teaching best practices and leveling up the engineers around you.
  • Be seen as a leader among software development teams, championing collaboration and shared ownership in technology decisions and knowledge transfer within the team.
  • Expertise in networking with an emphasis on security.
  • Working knowledge of deploying microservices and working experience on strategies to support Apple’s scale.
  • Vast experience using Linux with knowledge of kernel/system tuning
  • Last but not least, you are battle-tested and have a few interesting production tales
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.