המקום בו המומחים והחברות הטובות ביותר נפגשים
Job Category
Software EngineeringJob Details
Responsibilities:
Identify and analyze recurring technical themes across incidents, releases, and problem management data. Recommend improvements across multiple clouds and advocate for systemic changes.
Drive engineering teams to memorialize recommended improvements into their roadmaps and deliver sustainable, impactful solutions.
Define and prioritize paved path development functions for the Site Reliability Engineering organization, ensuring these efforts align with the greatest potential impact.
Conduct reviews of high-impact incidents and problems to ensure appropriate levels of remediation across platforms and services.
Collaborate with service owners to drive root cause mitigation, corrective actions, and incident detection improvements.
Foster an environment of proactive reliability and resilience, ensuring all platforms meet high standards of technical excellence.
Communicate effectively across technical and executive audiences, advocating for necessary changes and championing cross-cloud collaboration.
Minimum Qualifications:
10+ years of engineering experience, including a focus on reliability engineering, post incident analysis.
Proven experience driving systemic technical improvements across platforms and teams in large-scale, distributed systems.
Experience with various architectures and platforms; and proficient in both windows and linux/unix,debuggers/understandingstacktraces, architectural patterns.
Strong communication and leadership skills, with a track record of influencing and driving change across engineering and business organizations.
Experience with incident analysis, root cause identification, and defining technical remediation strategies.
Extensive knowledge of service reliability, observability practices, and availability metrics.
Familiarity with development in object-oriented programming languages (e.g., Python, Java) and experience with cloud-based architecture.
A related technical degree required.
Preferred Qualifications:
Experience leading cross-functional initiatives to implement technical improvements.
Expertise in incident management processes and operational excellence practices.
Hands-on experience with data analysis and visualization tools to drive technical insights, specifically SQL, Big Data, NoSQL, Memstores/memcache.
Check out ourwhich explains our various benefits, including wellbeing reimbursement, generous parental leave, adoption assistance, fertility benefits, and more.
Check out our
If you require assistance due to a disability applying for open positions please submit a request via this.
Posting Statement
משרות נוספות שיכולות לעניין אותך