Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Apple Machine Learning Engineer- LLM 
United States, California, Cupertino 
269775565

03.04.2025
Product Operations partners with a variety of different engineering and operations teams, our team leads development of machine learning solutions. We deliver projects from end-to-end: problem statement and conceptualization, proof-of-concept, and participation in final deployment! You will also perform ad-hoc statistical analyses. You will also work closely with data engineers to generate detailed business intelligence solutions. You will be expected to conduct presentations of analyses to a wide range of audiences including executives.
You will also perform ad-hoc statistical analyses. You will also work closely with data engineers to generate detailed business intelligence solutions. You will be expected to conduct presentations of analyses to a wide range of audiences including executives.
  • 3+ years experience in machine learning algorithms, software engineering, and data mining models with an emphasis on large language models (LLM) or large multimodal models (LMM).
  • Masters in Machine Learning, Artificial intelligence, Computer Science, Statistics, Operations Research, Physics, Mechanical Engineering, Electrical Engineering or related field.
  • Proven experience in LLM and LMM development, fine-tuning, and application building. Experience with agents and agentic workflows is a major plus.
  • Experience with modern LLM serving and inference frameworks, including vLLM for efficient model inference and serving.
  • Hands-on experience with LangChain and LlamaIndex, enabling RAG applications and LLM orchestration.
  • Strong software development skills with proficiency in Python. Experienced user of ML and data science libraries such as PyTorch, TensorFlow, Hugging Face Transformers, and scikit-learn.
  • Familiarity with distributed computing, cloud infrastructure, and orchestration tools, such as Kubernetes, Apache Airflow (DAG), Docker, Conductor, Ray for LLM training and inference at scale is a plus.
  • Deep understanding of transformer-based architectures (e.g., BERT, GPT, LLaMA) and their optimization for low-latency inference.
  • Ability to meaningfully present results of analyses in a clear and impactful manner, breaking down complex ML/LLM concepts for non-technical audiences.
  • Experience applying ML techniques in manufacturing, testing, or hardware optimization is a major plus.
  • Proven experience in leading and mentoring teams is a plus.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.