Expoint - all jobs in one place

המקום בו המומחים והחברות הטובות ביותר נפגשים

Limitless High-tech career opportunities - Expoint

KLA HPC Cluster Architect Manager 
United States, California, Milpitas 
694261766

12.03.2025

The ideal candidate will have a strong understanding of HPC infrastructure, Experience in deriving Hardware Specs based requirements, and proficiency in product lifecycle management. They will engage with teams to understand their requirements, drive development for our HPC platforms, and collaborate with other teams for integration. The candidate should also have expertise in Hardware System Design, Linux Systems Administration, container orchestration, networking, security, diagnostics tooling and performance tuning. Experience integrating, testing, and optimizing the integration of HPC with storage and data platforms is also essential.

Principal Responsibilities

  • Drive team growth and development, providing mentorship and support to team members.

  • Ensure the successful execution of projects, meeting deadlines and delivering high-quality results.

  • Work with various OEMs to understand their Product offerings and Roadmaps to create optimal HPC Solution Offerings.

  • Collaborate with other sub-system teams on developing HPC Cluster Roadmaps that meet Product Requirements.

  • Collaborate within a customer-focused teams to design, develop, test, and deploy Embedded HPC infrastructure in alignment with business needs.

  • Foster strong relationships with Product and Program Management, Software engineering, Mfg and Service teams to ensure the HPC Platforms effectively meet their requirements.

Qualifications/Skills

  • 3+ years’ experience in managing, and mentoring teams.

  • Knowledge of Linux Hardware Ecosystem centered around CPU, GPU and PCIE Architecture.

  • Deep understanding of Linux Operating systems, Networking with practical experience in tuning HPC workloads.

  • Experience with configuration management and automation tools, such as Chef, Ansible, Salt, Packer

  • Experience with building monitoring and alerting on logs and metrics with excellent troubleshooting and analytical skills.

  • Experience with and a strong understanding of containers(docker/singularity).Container orchestration with Kubernetes a Plus.

  • Maintain a grounded approach, making decisions based on data and strategic goals rather than emotions and clearly articulate the decisions.

  • International traveling couple times a year will be required.

Minimum Qualifications

  • Engineering Degree (Preferably CS, EE)
  • 3+ Years experience in managing people.
  • Experience working with HPC Technologies.