Expoint – all jobs in one place
Finding the best job has never been easier
Limitless High-tech career opportunities - Expoint

Nvidia Senior Systems Software Engineer TAO API 
United States, California 
224747864

02.07.2025
US, CA, Santa Clara
time type
Full time
posted on
Posted 30+ Days Ago
job requisition id

In this role, you will contribute to the development and optimization of advanced machine learning software and frameworks. You will also play a critical role in enhancing our CI/CD pipelines and DevOps practices to ensure efficient and reliable delivery of our software solutions. We are looking for someone with extensive experience in machine learning, strong leadership skills, and a deep understanding of CI/CD and cloud infrastructure.

What you’ll be doing:

  • Contribute the development and maintenance of advanced machine learning software and frameworks, optimizing for performance and scalability.

  • Enhance CI/CD pipelines to streamline the development, testing, and deployment of large-scale machine learning models.

  • Implement and manage cloud infrastructure for continuous integration, delivery, and deployment, ensuring high availability and scalability.

  • Collaborate with cross-functional teams, including engineering, QA, and research, to improve development workflows and enhance software delivery speed and quality.

  • Troubleshoot and resolve complex issues related to software development, containerization, and cloud infrastructure in production environments.

  • Write and maintain robust documentation for development and deployment processes.

  • Communicate effectively with technical and non-technical stakeholders to set shared expectations and ensure visibility around the release and deployment process.

  • Lead code reviews, testing, and debugging to ensure high-quality code and efficient workflows.

  • Mentor and guide junior engineers, fostering professional growth and enhancing team capabilities.

What we need to see:

  • Bachelor’s or master’s degree or equivalent experience in Computer Science, Information Systems, Engineering, or other related fields.

  • 7+ years of experience in software engineering, with hands-on experience in CI/CD, cloud infrastructure, and advanced machine learning frameworks.

  • Proficiency with automation and orchestration tools, including Docker, Kubernetes, Jenkins, and Terraform or Ansible, or similar CI Tools.

  • Experience with cloud platforms like AWS, Azure, or GCP.

  • Strong programming skills in Python and/or other relevant languages.

  • Experience in developing and deploying scalable software solutions.

  • Strong analytical and problem-solving skills with a focus on practical and scalable solutions.

  • Ability to work effectively in a collaborative environment and manage multiple tasks and projects.

  • Familiarity with version control systems and configuration management.

Ways to stand out from the crowd:

  • Demonstrated ability to learn quickly and adapt to new technologies.

  • Strong academic record, with achievements in competitive programming or related areas.

  • Extensive experience with advanced AI tools and frameworks, including LLMs and NVIDIA Blueprints.

  • Contributions to open-source projects, demonstrating a collaborative and innovative mindset.

  • Experience in deploying machine learning models on edge devices or platforms.

You will also be eligible for equity and .