About the role
In this role, you will support our live-streaming pipeline team and day-to-day live-streaming operations for Netflix. As a Live Streaming Pipeline SRE, you will be responsible for the reliability of our live streaming pipeline (transmission, encoding, packaging, origin). Instrumenting end-to-end observability and visualizing the data to achieve the desired availability at scale. Working with cross-functional teams in the preparation, validation, and execution of live streaming-focused initiatives. You will impact multiple areas of the live event lifecycle, from the planning phase through testing and event launch days. You will be leading innovation initiatives, driving new features that will enhance our live streaming services, encoding & content delivery.
Responsibilities:- Drive continual improvement in resilience, observability, monitoring, instrumentation, and automation with the primary goal of maintaining highly scalable and reliable services worldwide.
- Implement, automate, execute, and analyze the results from a broad range of live streaming delivery-focused functional, performance, resilience, and fault injection testing.
- Coordination, collaboration, and partnership across multiple stakeholders for the smooth execution of live-streaming events.
- Aggregate, analyze and correlate large amounts of server and application performance data. Use the innovative Netflix Big Data platform as a highly flexible, specialized, and efficient toolset for service delivery optimization and system reliability improvements.
- Participate in an on-call rotation and be able to work with flexible hours based on the live events schedule.
Qualifications:- 5+ years service reliability/operational experience running large scale, high performance systems & internet services with focus on live-streaming and video-on-demand (VOD) delivery.
- Experience with video transport protocols such as RTP, RTMP, SRT, UDP, Zixi, RIST, HLS, MPEG-DASH.
- Knowledge of and proven experience with HTTP cache/proxy technologies. Experience supporting live-streaming delivery at scale.
- Expert-level knowledge of Unix or Linux system engineering fundamentals (networking, storage, operating systems) at scale.
- Proficient understanding of networking principles, transport, and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S.
- Experience with using distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc).
- Proficient in a programming language such as Python or Go.
- Ability to work in a highly collaborative environment and to communicate effectively with internal and external partners.
- Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience).
Our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $100,000 - $720,000.