Expoint - all jobs in one place

המקום בו המומחים והחברות הטובות ביותר נפגשים

Limitless High-tech career opportunities - Expoint

Apple High Performance Computing HPC Engineer 
United States, Texas, Austin 
808725163

Today
In this role, you will be responsible for supporting, testing, and deploying HPC infrastructure products at our operations' core. You will help plan, code, build, test, deploy, operate, and monitor our Infrastructure-as-Code solutions for HPC server infrastructure.Your responsibilities will include:Demonstrating strong troubleshooting skills by independently identifying and resolving issues.Monitor system performance and availability, and remediate issues as necessary.Develop automation for common development and operational tasks.Maintaining clear, current documentation of system configurations, including creating detailed justifications, training materials for complex topics, status reports, and procedural guides.Collaborate with Application, infrastructure, network, and storage engineering teams to find balanced solutions to engineering problems.Assessing future capacity requirements and evaluating new product features or enhancements.
  • Proven experience in an HPC support role in an enterprize environment with 500+ node clusters.
  • A Bachelor’s degree in Computer Science
  • Experience deploying and managing schedulers such as SLURM, LSF, and/or NC.
  • Deploying and configuring FEA Solvers to run on HPC
  • Experience with NVIDIA GPU compute.
  • Strong Linux administration skills.
  • Experience with InfiniBand—including IBoIP and RDMA
  • Experience with multiple flavors of MPI
  • Experience with machine learning and deep learning concepts, algorithms, and models.
  • Background in Software Defined Networking and AI/HPC cluster networking.
  • Familiarity with deep learning frameworks such as PyTorch and TensorFlow.
  • Experience with automation and configuration management tools like Ansible, Cobbler & Puppet.
  • Experience developing and securing containerized applications and HPC environments beneficial (e.g., Apptainer).
  • Experience with virtualization technologies is beneficial.
  • Strong interpersonal skills that enable you to influence others and negotiate positive outcomes for customers.
  • Excellent time management and prioritization skills.
  • A willingness to learn.
  • A focus on continuous self-improvement, teamwork, innovation, and results.
  • The ability to maintain a professional demeanor in any situation.