Typically we will hire engineers from backgrounds such as Site Reliability Engineer (SRE), Software Engineer, Systems Engineer, Systems Development Engineer, DevOps Engineer, Systems Administrator, or similar.
Production Systems Engineer Responsibilities
Build and develop tooling solutions to automate business critical processes in service of managing the health of the Meta production hardware fleet
Troubleshoot, diagnose and root cause system failures, working with key partners to identify and deliver solutions
Proactively identify opportunities to fix or enhance tooling, hardware and processes
Build subject matter expertise in one or more of the specialist areas covered by the RTP (Release To Production) team in Dublin
Scientific approach to troubleshooting, root-cause analysis and investigation
Minimum Qualifications
An engineering degree is typical, or related technical discipline, or equivalent work experience
4+ years experience coding in a higher-level language (Python, PHP, Java, Go, Rust, C++)
Experience building, maintaining and debugging production services or platforms - usually (but not necessarily) in a linux/unix environment
Knowledge of server architecture and components across Compute/Storage/AI Systems/Networking
Preferred Qualifications
4+ years experience coding in a higher-level language (Python, PHP, Java, Go, Rust, C++)
Experience managing and debugging hardware platforms in a cloud environment
Demonstrated ability to drive projects to successful business outcomes