המקום בו המומחים והחברות הטובות ביותר נפגשים
You Are
Your Role and Responsibilities
• Manage deployments of Apptio services to AKP
• Streamline the deployment process
• Improve observability of the services within your purview by reviewing KPI dashboards and alerting
• Mentor junior to mid-level engineers
• Author and maintain documentation of deployment and monitoring processes
• Write and use runbooks to troubleshoot and triage production issues
• Detect issues and handle Tier 3 troubleshooting
• Drive online “swarm” collaboration sessions
• Collaborate with service developers
• Participate in on-call rotation
• Perform maintenance of the platform (patching, resets, upgrades, etc.)
• Operate independently and own end-to-end delivery of solutions
• Have significant input in the product roadmap and be able to articulate effectively the benefits of alternative technologies
Required Technical and Professional Expertise
• 5+ years’ experience in an SRE or adjacent role
• Functional understanding of at least one programming language and source control (Preferably• Expertise with distributed application deployment and management via Kubernetes
• Expertise with container technologies (e.g., Kubernetes, Docker)
• Expertise with Infrastructure-as-code (IaC) concepts (Terraform)
• Expertise with cloud provider services, preferably AWS
• Ability to work with RESTful systems and their APIs
• Familiarity with observability (e.g., Prometheus, Open telemetry)
• Demonstrated fluency with the English language skills
Preferred Technical and Professional Expertise
משרות נוספות שיכולות לעניין אותך