Prototype end-to-end solutions to improve distributed training and disaggregated inference performance. Analyze and optimize communication flows across application, transport, and network layers. Develop system software spanning communication libraries, drivers, and...