In this senior technical leadership role, you will define our model evaluation strategy and determine and develop appropriate methodologies to improve model accuracy. Our goal is to deliver offline evaluation insights that drive model development and user experience improvements while upholding the privacy and quality standards that Apple is known for. With the advent of Apple Intelligence, you will face novel challenges in developing representative datasets, building realistic simulation environments, and developing scalable end-to-end evaluation pipelines that can evolve rapidly with changes in system architecture. As Siri becomes more personal, you will develop innovative evaluation solutions that are grounded in realistic personal contexts. As a senior engineering leader, you will cultivate relationships with stakeholders across Siri and Apple to adopt state-of-the-art approaches to model evaluation and to adapt them to the unique needs of specific Siri models and systems. Your leadership will be instrumental in fostering a culture of continuous improvement and data-driven decision-making across Siri teams.