Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Nvidia Principal Infrastructure SW Engineer AI Cloud Services 
United States, Texas 
985928690

01.12.2024
What you'll be doing:
  • Guide development and operations of cloud services that enable external developers to easily access the latest AI models, optimizations, and serving techniques

  • Lead and directly contribute to implementation of key infrastructure features to enable product goals and improve productivity of internal engineers

  • Mentor engineers to develop their technical skills and ability to make an impact

  • Collaborate with product and engineering leads on feature roadmaps and execution planning

  • Promote and support methodologies that improve efficiency, product quality, security, and scalability.

  • Identify and seize opportunities to build common infrastructure that can be shared across various AI-related services

What we need to see:
  • MS, or PhD in Computer Science, Computer Engineering, or closely related field (or Bachelors with additional equivalent experience).

  • 12+ years of relevant experience as a developer, technical lead, and/or engineering manager

  • Proven technical skills in architecting, designing, implementing and delivering high-quality cloud services.

  • Proficiency in one or more programming languages (e.g., Python, TypeScript, Go)

  • Proficiency in SW development and DevOps best practices (SW development life cycle, developer workflows, continuous integration, infrastructure as code, etc.)

  • Experience building applications or services that incorporate AI

  • Excellent interpersonal skills and a collaborative, pragmatic approach to solving problems.

Ways to stand out from the crowd:
  • Experience building and operating publicly accessible services that incorporate AI at scale

  • Strong grasp of the latest trends in AI inference serving and performance optimization

  • Deep knowledge of GPU infrastructure management and/or CUDA applications

  • Experience with multiple major cloud platforms (AWS, Azure, GCP, OCI, etc.)

You will also be eligible for equity and .