מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
We seek a highly experienced and motivated SRE Manager to lead a team of 4 Site Reliability Engineers. You will play a crucial role in maintaining the reliability and efficiency of our services, ensuring that our workforce-enabling products and services are reliable while coordinating with cross-functional teams across various geographical regions. You will have a proven track record of leading top-performing teams in complex, fast-paced environments and will excel in organizing and motivating a team amidst rapid growth and change.
RESPONSIBILITIES
You will lead, mentor, and develop a team of 4 SREs, fostering a culture of collaboration, innovation, and continuous improvement.
You will communicate effectively with stakeholders at all levels, providing updates on team performance, project status, and incident resolutions.
You will ensure an appropriate balance exists between incident management's reactive work and the proactive work of reducing future issues.
You will develop and implement strategies to improve the reliability, performance, and scalability of the products and services supported by the SRE team.
You will collaborate with cross-functional teams (engineering, product, and operations) to drive critical projects and initiatives.
You will influence and improve our incident management lifecycle to identify, mitigate, and learn from reliability risks.
You will oversee the design, implementation, and maintenance of monitoring, alerting, and incident response systems.
You will ensure the team follows best practices in infrastructure as a code, continuous integration/deployment (CI/CD), and system automation.
You will cultivate and maintain high-trust relationships with internal and external partners.
You will advocate for the SRE team within the broader organization, representing their needs and concerns.
WE VALUE
Curiosity about how complex socio-technical systems successfully operate at scale when failure is inevitable
People who see influence as their preferred tool for cultivating relationships and helping the organization improve
Collaboration and continuous improvement are fundamental to growing the team’s impact over time
A desire to learn and readiness to mentor others both within and outside of the team
SKILLS AND EXPERIENCE
Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
Proven success in leading high-performing SRE or DevOps teams in a large-scale, fast-paced environment
Outstanding communication and interpersonal skills, with the ability to build strong relationships with team members and stakeholders
Strong technical background with hands-on experience in cloud computing, system architecture, automation, and monitoring
Excellent problem-solving skills with a focus on root cause analysis and proactive improvements
Exceptional organizational skills, with the ability to manage multiple priorities and projects simultaneously
Experience with tools and technologies such as AWS, Kubernetes, Terraform, Prometheus, Grafana, Jenkins, and similar.
משרות נוספות שיכולות לעניין אותך