Design and implement end-to-end automation solutions for HPC infrastructure (Compute, network and storage) using Kubernetes operators, Terraform, and Ansible. Analyzing compute, storage, network traffic patterns during distributed training/inference operations across...