Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Apple Sr Cloud Site Reliability Engineer & Ai Data Platforms 
United States, California, Sunnyvale 
923396629

06.06.2024
Description
RESPONSIBILITIES:- Focus on automation and providing insight for the Infrastructure service reliability and availability through extensible services & platforms. - Design, implement and maintain software & tools for large-scale distributed systems especially Big Data stack of technologies like Iceberg, S3, HDFS, Hive, Ranger. - Experience in operating and deploying container orchestration systems like Kubernetes &/ YARN. - Utilize core computer science data structures, algorithms, and software tools in one of the languages - Python, Golang, Java or other JVM languages. - Experience in managing data pipelines using Kafka, Flink, Spark, Airflow & Jupyter. - Work with platform tools and automation systems including deployment automation practices especially across multi-AZ or DC infrastructure using CM tools like Saltstack, Ansible, Terraform, etc. - Build & Support CI/CD tools to port & manage applications on AWS & Kubernetes - Build automation to enable self-healing systems. - Ensure compliance with appropriate security standards. - Deploy and debug systems built for horizontally scalable multi-tenant deployments.
Key Qualifications
  • 8+ years of experience in SRE/MLOps.
  • Experience operating and maintaining production systems in linux and public cloud infrastructure providers like AWS (EC2, EBS, S3, ElasticIP, Route 53, IAM).
  • Experience in cloud native orchestration systems like Kubernetes & enabling AutoScaling for both VM & Containerized workloads.
  • Strong proficiency with Helm and Kustomize for managing Kubernetes applications and configurations.
  • Possess good working knowledge of load balancers, firewalls, TCP/IP networking architecture and core technologies (http, dns, routing, etc).
  • Usage of configuration management tools: Ansible/Puppet/Chef/Saltstack.
  • Experience in GitOps or CICD tools: Spinnaker/Jenkins/Flux/ArgoCD.
  • Strong programming skills in Unix & Python/Java.
  • Experience with capacity planning, utilization reviews and performance tunings.
  • Should have critical thinking, good debugging and problem solving skills.
  • Experience in implementing, managing and refining business continuity solutions.
Education & Experience
BS in computer science with 7-10 years or MS plus 5-7 years experience or related experience.
Additional Requirements
  • - Work closely with multiple cross functional teams to effectively co-ordinate and manage business user expectations.
  • - Leadership, critical thinking and excellent verbal and written communication skills
  • - Working on creating new utilities for operational efficiency.
Pay & Benefits
  • At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $170,700 and $300,200, and your base pay will depend on your skills, qualifications, experience, and location.Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
  • Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.