About the role
In this role, you will support our live streaming pipeline team and day-to-day live-streaming operations for Netflix. As a Live Streaming Pipeline SRE, you will be
responsible for the reliability of our live streaming pipeline (transmission, encoding, packaging, origin). Instrumenting end to end observability and visualizing the data to achieve the desired availability at scale.
Working with cross functional teams in the preparation, validation, and execution of live streaming focused initiatives.You will impact multiple areas of the live event lifecycle, from the planning phase through testing and event launch days. You will be leading innovation initiatives, driving new features that will enhance our live streaming services, encoding & content delivery.
Responsibilities
Drive continual improvement in resilience, observability, monitoring, instrumentation, and automation with the primary goal to maintain highly scalable and reliable services worldwide
Implement, automate, execute, and analyze the results from a broad range of live streaming delivery focused functional, performance, resilience, and fault injection testing
Coordination, collaboration, and partnership across multiple stakeholders for the smooth execution of live-streaming events
Aggregate, analyze, and correlate large amounts of server and application performance data. Use the innovative Netflix Big Data platform as a highly flexible, specialized and efficient toolset for service delivery optimization and system reliability improvements
Participate in an on-call rotation and be able to work with flexible hours based on the live events schedule
Qualifications
5+ years service reliability/operational experience running large scale, high performance systems & internet services with focus on live-streaming and video-on-demand (VOD) delivery
Experience with video transport protocols such as RTP, RTMP, SRT, UDP, Zixi, RIST, HLS, MPEG-DASH
Knowledge of and proven experience with HTTP cache/proxy technologies. Experience supporting live-streaming delivery at scale
Expert-level knowledge of Unix or Linux system engineering fundamentals (networking, storage, operating systems) at scale.
Proficient understanding of networking principles, transport, and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S
Experience with using distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
Proficient in a programming language such as Python or Go
Ability to work in a highly collaborative environment and to communicate effectively with internal and external partners
Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience)
Job is open for no less than 7 days and will be removed when the position is filled.
משרות נוספות שיכולות לעניין אותך