Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Apple AI Evaluation Data Scientist - Health
United States, West Virginia
6041671

04.09.2025

In this role, you will be at the forefront of developing and validating evaluation methodologies for Generative AI systems in health and wellbeing applications. You will design comprehensive human annotation frameworks, build automated evaluation tools, and conduct rigorous statistical analyses to ensure the reliability of both human and AI-based assessment systems. Your work will directly impact the quality and trustworthiness of customer-facing health products.

In this role you will: - Design and analyze human evaluations of AI systems to create reliable annotation frameworks, and ensure validity and reliability of measurements of latent constructs- Develop and refine benchmarks and evaluation protocols, using statistical modeling, test theory, and task design to capture model performance across diverse contexts and user needs- Conduct statistical analysis of evaluation data to extract meaningful insights, identify systematic issues, and inform improvements to both models and evaluation processes- Independently run and analyze experiments for real improvements

Bachelor's degree (or equivalent experience) in a empirical field with emphasis on quantitative methodologies of human behavior, including HCI, Psychometrics, Quantitative or Experimental Psychology, Educational Measurement, Language Assessment, or a relevant field
Proficiency in Python and ability to write clean, performant code and collaborate using standard software development practices (e.g. Git)
Strong statistical analysis skills and experience in crafting experiments, validating data quality and model performance
Experience in building and extending data and inference pipelines to process large scale datasets

MS and a minimum of 3 years of relevant industry experience or PhD in relevant fields
Real-world experience with LLM-based evaluation systems and human annotation and human evaluation methodologies
Experience in rigorous, evidence-based approaches to test development, e.g. quantitative and qualitative test design, reliability and validity analysis
Customer-focused mindset with experience or strong interest in building consumer digital health and wellness products
Strong communication skills and ability to work cross-functionally with technical and non-technical stakeholders

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Full job details

These jobs might be a good fit

Apple AI Evaluation Data Scientist United States, West Virginia

Apple AI Evaluation Engineer - Health United States, California, Cupertino

Apple Data Scientist - Search Evaluation United States, West Virginia

Apple AIML - Data Scientist Evaluation United States, West Virginia

Professional CV Builder tool from Expoint.

Get to the top of the "yes list" with a standout CV!

CREATE CV