Expoint – all jobs in one place
המקום בו המומחים והחברות הטובות ביותר נפגשים
Limitless High-tech career opportunities - Expoint

Nvidia Senior Staff Software Engineer - Enterprise AI Platform 
United States, California 
959599401

28.07.2025
US, CA, Santa Clara
time type
Full time
posted on
Posted 2 Days Ago
job requisition id

What you will be doing:

  • Own the end-to-end lifecycle of software development, from concept to deployment, including architecture design, development, testing, and scaling

  • Understand internal micro-services, platforms, third party platforms and growing open-source code-repos to best leverage them during AI product development

  • Able to contribute to internal platforms and build re-usable components that can connect to enterprise data sources and power search, chatbots and other gen AI applications

  • Develop AI applications, platforms and systems enabling unified experience across applications and driving insights for end-to-end user experience

  • Build services that can support Inference, Training jobs, Ingestion Jobs

  • Understand the eco-system of data connectors and build secure AI applications which can access structured, unstructured data from a variety of databases at scale

  • Ensure system reliability, performance, and security at scale.

  • Help build and maintain our Continuous Delivery pipeline with the goal of moving changes to production faster and safer, while ensuring key operational standards.

  • Create and implement strategies to support business growth and technological advancements, ensuring flexibility and adaptability.

  • Provide peer reviews to other specialists including feedback on performance, scalability, and correctness.

  • Keep abreast of emerging trends and technologies in AI, software development, and system architecture.

  • Are a strong advocate of proven methods in software engineering and bring a detailed approach to testing, continuous delivery, and reducing technical debt.

What we need to see:

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent experience.

  • 8+ years of proven experience building sophisticated applications and APIs in On-prem, Cloud and hybrid cloud environments at large scale preferably in Python

  • Proven experience to build full stack applications including UI, backend, infrastructure

  • Proven expertise of performance, reliability in sophisticated distributed systems and the teams that build them

  • Strong proficiency in multiple programming languages and technologies relevant to AI and system development

  • Familiarity with gen AI application building, AI application deployments, Model Deployments (LLMs, Embeddings, Re-rankers, OCR etc)

  • Has delivered software with full understanding of deploying applications in Kubernetes clusters along with GPU and CPU pod scheduling (Ability to understand on Prem)

  • Proven track record to lead complex projects and deliver results in a fast-paced, multifaceted environment.

  • Extremely motivated, highly passionate, and curious about new technologies. Take pride in your work and strive to achieve incredible results and possess superb communication and planning skills.

  • Excellent leadership, problem-solving, analytical and communication skills, capable of inspiring and leading a technical team.

Ways to stand out from the crowd:

  • Experience enhancing enterprise efficiency and employee experience through the effective use of Generative AI based solutions.

  • Background with Kubernetes, Openshift, ML ops as well asexperience with Model deployments (Inference, Training)

  • Self-motivation and a drive to get things to “done”.

  • Excellent programming, debugging, performance analysis, and test design skills using python is a plus.

You will also be eligible for equity and .