The point where experts and best companies meet
Share
What you'll be doing:
Work with NVIDIA Product Teams to understand new product requirements including HPC and AI/ML Products.
Finding Optimum Solutions to deploy these products in a Datacenter or a Lab environment using sophisticated design techniques, services and tools.
Assist in roll-out and deployment of new development features aimed at supporting the latest NVIDIA hardware and technologies.
Work closely with world-class engineers, architects, technical product managers and application developers setting the best strategies in place for a product launch.
Defining and implementing full scale solutions for product onboarding into our hosted and private cloud environments.
Solve sophisticated problems involving multi-site deployments of NVIDIA products.
Collaborate with multi-functional teams, including system engineering, software engineering, mechanical/thermal engineering, operations, data center teams, external vendors, and other partners to successfully deliver a reliable and robust platform from concept to prototype to deployments.
Directly contribute to the overall quality of deployments and improve time to market next gen products.
Integrate and Optimize Cluster Deployment methods and manage SW stack deployments, including provisioning these services into the cloud.
What we need to see:
Bachelor's or Master's Degree in Computer Science or Software Engineering, or equivalent experience.
10+ years of relevant experience.
5+ years of Linux and Scripting experience.
Solid background on OS Kernels and system engineering.
A track record of quickly understanding new technologies outside of your domain expertise and deploying systems in sophisticated configurations from hardware through multiple layers of software in a fast-paced environment.
Strong technical skills and understanding of embedded systems, orchestration & automation systems, data centers and cloud architecture, as well as excellent communication and planning skills.
Strong problem-solving ability and experience in product engineering/failure analysis and debug/ HW or test design.
Understanding of dense datacenter design including compute, Storage and networking.
Ways to stand out from the crowd:
Understanding of software engineering principles and enterprise system architecture with an automate and Scale approach.
Experienced with compute clusters administration, automation as well as experience with productivity tools and process automation is big plus
Experience in large scale QA environments, for product bring ups.
Special skills in large-scale computing and cluster computing (MPI), data center design include high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience.
Strong background on Windows & Linux administration.
You will also be eligible for equity and .
These jobs might be a good fit