Expoint – all jobs in one place
Finding the best job has never been easier
Limitless High-tech career opportunities - Expoint

Apple AIML - Staff Machine Learning Engineer Reinforcement 
United States, California 
46613232

Yesterday
As part of this group, you will be doing large scale machine learning and deep learning research and development to improve Open Domain Question Answering (using both structured knowledge graph data and unstructured web data) and Summarization as well as developing fundamental building blocks needed for Artificial Intelligence. This involves developing sophisticated machine learning and large language models (LLMs) to understand user queries, retrieve and rank relevant documents across multiple sources and synthesize information across documents to provide user with a direct answer that best satisfies their intent and information seeking needs. Additionally, you will research and develop the state-of-the-art LLMs for summarizing personal data such as emails, messages, and notifications.
In this role, you will work on LLM based question answering and Apple Intelligence features to provide concise, accurate, and grounded information to users to help them complete their tasks quickly on Apple devices. Your core responsibilities will include:* Designing and developing advanced Reinforcement Learning technologies in the post-training of generative model, and delivering the end-user experience.* Driving cross-functional technical initiatives, collaborating with research, engineering and production teams to translate theoretical advances into deployable systems.* Developing novel and cutting-edge RL algorithms and improving existing ones.* Staying up to date with the latest RL research and integrate best practices into the team's workflow.* Working on the end-to-end ML lifecycle: algorithm design and implementation, data collection, model training, evaluation, and deployment.
  • 10+ years of ML experiences in search, natural language processing/understanding. Conversational AI.
  • Proven experience for LLM post training, including but not limited to SFT, RLHF, RLAIF, Reward Modeling, Chain-of-thought, agentic LLM.
  • Hands-on experience building RL pipelines and training agents in simulation or real-world environments.
  • Growth mindset and ability to learn new technologies
  • MS or Ph.D. in Computer Science, Machine Learning with a specialty in reinforcement learning, or a related field
  • Deep expertise in reinforcement learning-based post-training on LLM models, reward modeling, RLHF, RLAIF, Chain-of-thought, and agentic AI R&D.
  • Deep understanding of cutting edge RL algorithms and large language model.
  • Deep understanding in LLM pre-training, post-training.
  • Strong product intuition and ownership
  • Excellent communication skills
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.