Finding the best job has never been easier

Netflix CDN Site Reliability Engineer SRE L4/L5
United States, Oregon
161021525

20.03.2025

Work Type

our in-house custom-built network and server infrastructure responsible for

Location

We are hiring for two open positions: one is onsite based in Los Gatos, and the other is open to US Remote.

Responsibilities

Drive continual improvement in resiliency, observability, monitoring, instrumentation, and automation with the primary goal to maintain a highly scalable and reliable CDN platform worldwide.
Aggregate, analyze, and correlate large amounts of server and application performance data. Use the innovative Netflix Big Data platform as a highly flexible, specialized and efficient toolset to identify opportunities for platform optimization, system reliability improvements as well as identifying patterns/anomalies for further investigation.
Provide technical design and engineering assistance to ISP partners to integrate our Open Connect Appliances.
Handle Tier 3 escalation and participate in an on-call rotation for the CDN platform production issues.

Qualifications

3+ years Service Reliability/Operational experience running large scale, high performance systems & internet services with focus on performance and reliability.
Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience)
Strong working knowledge of networking concepts and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S with focused experience on CDNs and HTTP cache/proxy technologies.
Skilled in designing, creating and maintaining automation written in a programming language such as Python.
Expert-level knowledge managing and debugging Unix/Linux systems (engineering fundamentals, networking, storage, operating systems) at scale.
Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
Strong understanding of applied statistics and the ability to code systems that identify outlier behavior in large systems.
Some experience with container and container orchestration technologies (Docker, Kubernetes).
Ability to work in a highly collaborative environment and to communicate cross functionally with internal and external partners.

Job is open for no less than 7 days and will be removed when the position is filled.

These jobs might be a good fit

Netflix CDN Site Reliability Engineer L4/L5 - Live Streaming Open Co... United States, California

Get to the top of the "yes list" with a standout CV!

CREATE CV