Expoint – all jobs in one place
The point where experts and best companies meet
Limitless High-tech career opportunities - Expoint

Apple Model Optimization Engineer Quality ML DevOps 
United States, California, Cupertino 
445284610

08.06.2025
Key responsibilities of this role are: - Making enhancements to the release process, automating nightly builds, and setting up scheduled CI runs for different levels of testing etc. - Making innovations in model testing and benchmarking (accuracy and latency), for various combinations of model types in different domains (vision, text, audio etc) and compression algorithms (quantization, pruning, palettization etc), discovering performance/accuracy trends, effects of various hyper parameters etc. - Finding innovative ways to reduce test time while maintaining high quality test coverage - Keeping the code base updated to work with the latest versions of Python, PyTorch, numpy etc. - Set up and debug training jobs, datasets, evaluation, performance benchmarking pipelines. Ability to ramp up quickly on new training code bases and run experiments. - Run detailed experiments and ablation studies to profile algorithms on various models, tasks, across different model sizes. - Improving model optimization documentation, writing tutorials and guides- Self prioritize and adjust to changing priorities and asks
  • Bachelors in Computer Sciences, Engineering, or related discipline.
  • 2 years of industry experience (including internships)
  • Highly proficient in Python programming
  • Expertise in shell programming, experience with setting up and/or maintaining CI pipelines for at least one production software codebase
  • Good communication skills, including ability to communicate with cross-functional audiences
  • Demonstrated ability to design user friendly and maintainable APIs
  • Proficiency in at least one ML authoring framework, such as PyTorch, TensorFlow, JAX, MLX
  • Experience in training, fine tuning, and optimizing neural network models
  • Experience in the area of model compression and quantization techniques, specially in one of the optimization libraries for an ML framework (e.g. torch.ao).
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.