As an engineer in this role, you will be primarily focused on the ingestion and optimization of ML programs from different authoring frameworks (such as PyTorch) into CoreML using a combination of graph capture, conversion, and compilation pipelines.KEY RESPONSIBILITIES:- Develop technologies to quickly onboard new ML models to our on-device stack, including contributions to ML authoring frameworks.- Understand different ML operations, architectures, and graph representations in different authoring frameworks. Keep abreast of latest innovations in this space.- Architect and build CoreML’s model representation that can efficiently represent program semantics from the authored frameworks, while allowing for peak execution performance. - Define and develop optimizations such as quantization, operator transformations, fusions, etc. to make models more amenable to efficient on-device deployment