Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Nvidia Solutions Architect - AI HPC Cloud 
United States, California 
933757184

24.06.2024

What you'll be doing:

  • Work with NVIDIA Product Teams to understand new product requirements including HPC and AI/ML Products.

  • Finding Optimum Solutions to deploy these products in a Datacenter or a Lab environment using sophisticated design techniques, services and tools.

  • Assist in roll-out and deployment of new development features aimed at supporting the latest NVIDIA hardware and technologies.

  • Work closely with world-class engineers, architects, technical product managers and application developers setting the best strategies in place for a product launch.

  • Defining and implementing full scale solutions for product onboarding into our hosted and private cloud environments.

  • Solve sophisticated problems involving multi-site deployments of NVIDIA products.

  • Collaborate with multi-functional teams, including system engineering, software engineering, mechanical/thermal engineering, operations, data center teams, external vendors, and other partners to successfully deliver a reliable and robust platform from concept to prototype to deployments.

  • Directly contribute to the overall quality of deployments and improve time to market next gen products.

  • Integrate and Optimize Cluster Deployment methods and manage SW stack deployments, including provisioning these services into the cloud.

What we need to see:

  • Bachelor's or Master's Degree in Computer Science or Software Engineering, or equivalent experience.

  • 10+ years of relevant experience.

  • 5+ years of Linux and Scripting experience.

  • Solid background on OS Kernels and system engineering.

  • A track record of quickly understanding new technologies outside of your domain expertise and deploying systems in sophisticated configurations from hardware through multiple layers of software in a fast-paced environment.

  • Strong technical skills and understanding of embedded systems, orchestration & automation systems, data centers and cloud architecture, as well as excellent communication and planning skills.

  • Strong problem-solving ability and experience in product engineering/failure analysis and debug/ HW or test design.

  • Understanding of dense datacenter design including compute, Storage and networking.

Ways to stand out from the crowd:

  • Understanding of software engineering principles and enterprise system architecture with an automate and Scale approach.

  • Experienced with compute clusters administration, automation as well as experience with productivity tools and process automation is big plus

  • Experience in large scale QA environments, for product bring ups.

  • Special skills in large-scale computing and cluster computing (MPI), data center design include high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience.

  • Strong background on Windows & Linux administration.

You will also be eligible for equity and .