Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Cyberark Senior Site Reliability Engineer 
United States 
848864609

Yesterday

What you will do:

  • Management of AWS infrastructure components such as VPCs, EC2, EKS, S3, tagging schemes, CloudFormation, etc.

  • Deployment and management automation of cloud-based infrastructure and software

  • Working with configuration management tools in both Windows and Linux - Terraform, Ansible, CloudFormation

  • Ensuring cloud-based architecture meets availability and recoverability requirements

  • Architecture and implementation of cloud-based monitoring, alerting and reporting – Datadog, CloudWatch, ELK, Grafana

  • Develop tools to enable teams for greater output and reliability.

  • Develop and enforce SRE best practices by defining and tracking key reliability metrics such as SLOs, SLIs to ensure system resilience and operational excellence

Qualifications
  • B.S. in Computer Science or equivalent experience

  • Minimum 2 years of experience managing AWS infrastructure

  • Minimum of 5 years of experience with systems engineering and software development

  • Solid understanding/experience of containerization services such as Docker

  • Working knowledge of open-source tools such as Terraform, Grafana, Logstash, Elasticsearch, Ansible

  • Solid understanding/experience of web services, databases and relating infrastructure/architectures

  • Solid understanding of backup/restore best practices

  • Strong level of expertise programming in Java / Python/Golang or equivalent language

  • Excellent Troubleshooting Skills

  • Experience supporting an enterprise-level SaaS environment

  • Security Experience a plus