Reliability Availability Serviceability Expert jobs at Nvidia
Advance your career in high tech with Expoint. Discover job opportunities as a Reliability Availability Serviceability Expert and join top companies in the industry such as Nvidia. Sign up today and take control of your future.
Company (1)
Job type
Job categories
Job title (1)
United States
State
City
23 jobs found
01.09.2024
N
Nvidia Senior Site Reliability Engineer United States, California
Develop frameworks and scripts to automate workflows and deployments in a private cloud environment that houses several compute servers with NVIDIA GPUs. Specific focus on building and stabilizing our virtualization...
Build automation workflows for self-service and auto-healing capabilities. Work with teams to deploy new data center infrastructures. Plan and implement optimizations for compute infrastructure consumption. Create alerting, reports, and dashboards...
Design, implement an on-prem HPC infrastructure supplemented with cloud computing to support the growing IT needs of Nvidia. Design and implement scalable and efficient Storage solutions tailored for data-intensive applications,...
Design, implement and support operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, real time monitoring, logging and alerting. Engage...
Develop, test, and deploy data collectors, pipelines, and services to enhance use of our AI/ML and chip development infrastructure. Participate in the full life-cycle of tool development, test, and deployment....
Develop a team of SREs, providing mentorship, guidance, and support in achieving team goals. Nurture a culture of collaboration, innovation, and continuous improvement within the SRE team. Your team will...
Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus on performance at scale, real time monitoring, logging and alerting. Engage in and improve the...
Develop frameworks and scripts to automate workflows and deployments in a private cloud environment that houses several compute servers with NVIDIA GPUs. Specific focus on building and stabilizing our virtualization...
Discover your dream career in the high tech industry with Expoint. Our platform offers a wide range of Reliability Availability Serviceability Expert jobs opportunities, giving you access to the best companies in the field, like Nvidia. With our easy-to-use search engine, you can quickly find the right job for you and connect with top companies. No more endless scrolling through countless job boards, with Expoint you can focus on finding your perfect match. Sign up today and follow your dreams in the high tech industry with Expoint.