In this role, you will be empowered to: • Implement ML algorithms using Apple Neural Engine SoC, with an emphasis on performance and power• Add support for new hardware feature into the Apple Neural Engine compiler stack• Run performance analysis and optimization of ML workloads running on Apple Neural Engine• Evaluate existing hardware blocks and contribute to the definition of new hardware blocks • Collaborate with the hardware team to review hardware specifications; in addition, you will work closely with the design and micro-architecture team to understand the functional and performance goals of the design, and design appropriate tests• Partner with the driver/firmware teams to integrate HW acceleration in our software stack