Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

JFrog Site Reliability Engineer 
Dominican Republic, Santo Domingo 
553445121

27.03.2025
As a Site Reliability Engineer in JFrog you will…
  • Support the building and managing of scalable, reliable services and infrastructure to support JFrog SaaS services
  • Drive the reliability, performance, and availability of our SaaS products, ensuring service-level objectives are met or exceeded
  • Apply SRE best practices, including incident management, performance and capacity planning, and disaster recovery flows
  • Adhere to Incident management framework ensuring timely identification, escalation and resolution of incidents
  • Develop and manage large-scale systems with CI/CD in mind, to support multiple production environments and use cases
  • Tackle large-scale production issues and bring out-of-the-box thinking to the table
  • Implement SRE tools, technologies, and methodologies that align with meeting JFrog’s SaaS uptime & reliability goals
To be a Site Reliability Engineer in JFrog you need...
  • 2+ years of relevant DevOps or SRE experience in large-scale production environments
  • 1+ years of infrastructure automation, configuration management, or container orchestration using Kubernetes, Docker, Terraform, and Ansible
  • 1+ years in Python or any other advanced programming language
  • Excellent communication, and collaboration skills with an ability to work effectively across globally-distributed teams
  • Experience in managing container and infrastructure orchestration tools (e.g. Kubernetes, Terraform)
  • Hands-on experience administering public clouds (AWS, GCP, or Azure)
  • Experience with building CI/CD pipelines for applications and microservices (Jenkins/ArgoCD)
  • Experience with Chaos, alerting & observability tools (Gremlin, PagerDuty, Opsgenie, New Relic, Coralogix)