Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Apple Staff GenAI Engineer Apple Data Platform 
United States, California, Cupertino 
186031852

14.04.2025
RESPONSIBILITIES INCLUDE: * Define and drive the technical vision, roadmap, and strategy for Apple Data Platform’s GenAI components, enabling scalable development and deployment of AI applications powered by LLMs and agentic workflows* Guide the design and development of platform capabilities such as agent orchestration, RAG integration, LLM model configuration, prompt tooling, and fine-tuning pipelines* Drive efforts around LLM inference optimization, including caching, prompt tuning, and latency improvements, ensuring efficient and cost-effective model usage across applications* Collaborate with engineering, product, and operations teams across Apple to ensure effective adoption of GenAI capabilities in high-impact workflows* Partner with stakeholders to build reusable AI agents that enhance productivity, automate reasoning tasks, and integrate securely with internal systems and tools* Mentor new hires and fellow engineers, fostering growth in technical depth and platform mindset* Establish best practices and processes that ensure engineering excellence, operational sustainability, and a seamless developer experience* Promote a healthy, inclusive, and innovation-driven team culture with a focus on experimentation, learning, and long-term platform impact
  • 8+ years of software development experience
  • 3+ years of experience as a technical lead, guiding teams through complex design decisions and setting high benchmarks for code quality, performance, and scalability
  • In-depth understanding of large language models (LLMs) and their application in AI-driven solutions, including inferencing, embedding, and knowledge base integration (RAG) for improved data retrieval and contextualization
  • Hands-on experience designing and building GenAI platforms that allow users to create, configure, and deploy AI applications supporting features like agent orchestration, prompt engineering, RAG integration, and model selection
  • Experience building AI agents capable of complex multi-step reasoning and tool usage, with a focus on reliability, traceability, and composability
  • Proven experience in fine-tuning and customizing foundation models to improve task-specific performance and domain alignment
  • Deep knowledge of LLM inference optimization techniques, including prompt tuning, caching, quantization, and latency reduction across different model families
  • Strong programming skills in Python, Java, or similar languages, with an emphasis on AI/ML systems development and platform engineering
  • Demonstrated ability to work cross-functionally and influence product development through a combination of technical leadership and user-centered thinking
  • Passion for operational excellence, automation, and delivering scalable, developer-friendly AI infrastructure
  • B.S, M.S. or PhD Degree in Computer Science/Engineering, or equivalent work experience
  • Expertise in AWS Cloud
  • Hands on experience in using Kubernetes as orchestration layer
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.