Design and implement drivers (and/or Firmware) for (network) ethernet adapter functions, Transport stack for RDMA, control functions with the host/accelerators.
Design and implement Platform services such as programming, monitoring, and controlling system components (Optics, PHY, FPGAs, sensors, fan control, power etc).
Develop and enhance HPC collective communication and parallel computing libraries such as NCCL, RCCL, OneCCL, and MPI
Debug complex, system-level, multi-component issues that typically span across multiple layers from Kernel, and user-mode applications.
Minimum Qualifications
Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta.
Proficient in programming in C/C++/Python
Hands on experience with debugging large scale systems
Preferred Qualifications
Experience in one of the following areas -
Experience with Linux Kernel, especially drivers and network stack
Working knowledge of transport stack particularly RDMA (RoCEv2)
Experience with parallel computing platforms such as CUDA, RoCM and OpenCL
Platform services (program, control, and monitor Optics, PHY, FPGAs, sensors, fan control, power etc), BSP/Board Support Package, Operating Systems, Kernel, Bootloader, Power Management, RTOS, Linux.
Experience with Qemu, FPGA Emulation environment is a plus
Start preparing
Learn about how to prepare for your interview with our interview guide, tips, and interactive experiences.
Data Center
About Meta
$117,000/year to $173,000/year + bonus + equity + benefits
Individual pay is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base salary, Meta offers benefits. Learn more about at Meta.