Share
Key job responsibilities
This role will help lead efforts in building distributed inference support into PyTorch using XLA and the Neuron compiler and runtime stacks. Strong software development skills in C++/Python and ML knowledge are both critical to this role. The position will identify optimization opportunities by performing comparative analysis and benchmarking with alternative solutions. The role will develop and automate solutions to ensure the accuracy of AI accelerators while optimizing their performance. This position will develop a set of deep AI toolchains to simplify and abstract the low-level AI accelerator modules.A day in the life
Work/Life Balance
Mentorship & Career Growth
- 3+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience
- 3+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Fundamentals of Machine learning and deep learning models, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model execution.
- Bachelor's degree in computer science or equivalent
These jobs might be a good fit