Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Apple Model Optimization Engineer
United States, California, Cupertino
588020043

19.05.2025

We work on developing, prototyping and productizing state of the art algorithms for neural network model compression. Our algorithms are implemented using PyTorch and optimizations are geared towards efficient deployment via Core ML. We optimize models across domains, including NLP, vision, text and image generative models etc. Key responsibilities of this role are: * Setting up, and/or streamlining CI and automation pipelines. Adopting the best practices and integrating with the latest Apple internal CI services for the same. * Making enhancements to the release process, automating nightly builds, setting up scheduled CI runs for different levels of testing etc. * Making innovations in model testing and benchmarking (accuracy and latency), for various combinations of model types in different domains (vision, text, audio etc) and compression algorithms (quantization, pruning, palettization etc), discovering trends, effects of various hyper parameters etc. * Be passionate about engineering efficiency, finding innovative ways to reduce test time while maintaining a high bar of test coverage * Obsess about user experience and improving it. You are someone who is excited to fix bugs, understand user pain points and actively participates in supporting the users.* Developing integration of the model optimization library with other training engines and data platforms at Apple. * Keeping the code base updated to work with the latest versions of Python, PyTorch, numpy etc. * Set up and debug training jobs, datasets, evaluation, performance benchmarking pipelines. Ability to ramp up quickly on new training code bases and run experiments. Run detailed experiments and ablation studies to profile algorithms on various models, tasks, across different model sizes. * Improving model optimization documentation, writing tutorials and guides* Self prioritize and adjust to changing priorities and asks

BS/MS in Computer Science or related field
Relevant internship experience

Demonstrated ability to design user friendly and maintainable APIs
Proficiency in at least one ML authoring framework, such as PyTorch, TensorFlow, JAX, MLX
Experience in training, fine tuning, and optimizing neural network models
Experience in the area of model compression and quantization techniques, specially in one of the optimization libraries for an ML framework (e.g. torch.ao).

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

These jobs might be a good fit

Apple Model Optimization Engineer Algorithmic development United States, California, Cupertino

Apple Model Optimization Engineer Quality ML DevOps United States, California, Cupertino

Apple Model Optimization Engineer PyTorch Infrastructure Developme... United States, California, Cupertino

Apple Research Engineer Large Visual Generative Model Optimization United States, California, Cupertino

Professional CV Builder tool from Expoint.

Get to the top of the "yes list" with a standout CV!

CREATE CV