In this role, you will be responsible for supporting, testing, and deploying HPC infrastructure products at our operations' core. You will help plan, code, build, test, deploy, operate, and monitor our Infrastructure-as-Code solutions for HPC server infrastructure.Your responsibilities will include:Demonstrating strong troubleshooting skills by independently identifying and resolving issues.Monitor system performance and availability, and remediate issues as necessary.Develop automation for common development and operational tasks.Maintaining clear, current documentation of system configurations, including creating detailed justifications, training materials for complex topics, status reports, and procedural guides.Collaborate with Application, infrastructure, network, and storage engineering teams to find balanced solutions to engineering problems.Assessing future capacity requirements and evaluating new product features or enhancements.