Service Reliability Operations Administrator jobs at Nvidia
Advance your career in high tech with Expoint. Discover job opportunities as a Service Reliability Operations Administrator and join top companies in the industry such as Nvidia. Sign up today and take control of your future.
Design, build, deploy, and run internal tooling built on top of cloud infrastructure to provide foundations for operational excellence. Design, implement, ship, and maintain essential data pipelines that will be...
Design, deploy and support large-scale, distributed GPU clusters to run high-performance AI and machine learning workloads. Continuously improve infrastructure provisioning, management, and monitoring through automation. Ensure the highest level of...
Reliability (DfR) qualification and testing for various packaging including GPU/CPU, power model, memory IC, wifi module and etc. Work with design and operations teams to create board and system reliability...
The team will provide their services 24/7 with a follow-the-sun environment which will span continents. You will report directly to a manager in the United States. Each team member will...
Develop automated tools to efficiently deploy, provision, and maintain extensive GPU clusters interconnected via NVLink and InfiniBand. Implement modern DevOps tools to automate software updates, perform maintenance tasks, and monitor...
Design and deliver robust implementable solutions for complex supply chain planning requirements using modeling approaches. Lead small projects to implement new and improve existing functionalities including articulating requirements and translating...
Design and implement state-of-the-art GPU compute clusters. Optimize cluster operations for maximum reliability, efficiency, and performance. Drive foundational improvements and automation to enhance researcher productivity. Tackle strategic challenges in large-scale,...
Design, build, deploy, and run internal tooling built on top of cloud infrastructure to provide foundations for operational excellence. Design, implement, ship, and maintain essential data pipelines that will be...
Discover your dream career in the high tech industry with Expoint. Our platform offers a wide range of Service Reliability Operations Administrator jobs opportunities, giving you access to the best companies in the field, like Nvidia. With our easy-to-use search engine, you can quickly find the right job for you and connect with top companies. No more endless scrolling through countless job boards, with Expoint you can focus on finding your perfect match. Sign up today and follow your dreams in the high tech industry with Expoint.