Experience designing and developing ML infrastructure/frameworks for training and inference. Experience of model quantization, tensor parallelism, and inference optimizations (e.g ONNX Runtime, TensorRT, vLLM). Experience building machine learning models using...