Expoint – all jobs in one place
The point where experts and best companies meet
Limitless High-tech career opportunities - Expoint

Apple On-device ML Performance Infrastructure Engineer 
United States, Washington, Seattle 
696922012

Yesterday
As an engineer in this role, you will primarily focused on building performance infrastructure to present high level views of ML inference behavior which are built by gathering lower level data from execution delegates and relating the data to the high level model. You’ll work with models created by the most popular ML frameworks (PyTorch, JAX, MLX, etc) and will analyze the execution to ensure the stack achieves full machine performance on Apple Silicon. The role also has exposure to building higher level APIs and toolings to enable developers to visualize, diagnose, and debug correctness and performance issues while onboarding models to on-device deployment. We are building the first end-to-end developer experience for ML development that, by taking advantage of Apple’s vertical integration, allows developers to iterate on model authoring, optimization, transformation, execution, debugging, profiling and analysis. Providing actionable feedback to ML developers w.r.t. The details of their model’s inference behaviors is crucial to achieving performance.
  • Bachelors or Masters in Computer Science or relevant disciplines.
  • Highly proficient in C++. Familiarity with Python.
  • Familiarity with Operating Systems, embedding programming, parallel programming.
  • Knowledge of ML fundamentals including training regimes, evaluation and deployment/inference.
  • Understanding of ML architectures, compilers, runtimes, system performance, and system software engineering.
  • PhDs in Computer Science or relevant disciplines.
  • On-device ML stack, such as TFLite, ONNX, ExecuTorch, etc.
  • ML authoring framework (PyTorch, TensorFlow, JAX, etc.).
  • Compiler stack (MLIR/LLVM/TVM etc.).
  • Experience with accelerators, GPU programming.
  • Experience with OS kernel programming, computer architecture or performance analysis
  • Experience with using developer tools such as vTune and Nvidia Nsight
  • A passion/interest for ML, particularly applied to on-device use cases.
  • Good communication skills, including ability to communicate with cross-functional audiences.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.