Expoint – all jobs in one place
The point where experts and best companies meet
Limitless High-tech career opportunities - Expoint

Nvidia Senior Generative AI Software Engineer 
United States, Texas 
566803557

Today
US, CA, Santa Clara
US, CA, Remote
time type
Full time
posted on
Posted 5 Days Ago
job requisition id


What you'll be doing:

  • You will own and evolve the Cosmos open-source and internal research codebases, crafting core infrastructure that supports our foundation model research and deployment.

  • Refactor and modularize large research-driven code into clean, testable, maintainable libraries for use across teams.

  • Integrate and adapt off-the-shelf models into our pipelines as preprocessors, postprocessors, or evaluation components.

  • Build model-serving endpoints (e.g., with Gradio or FastAPI) to enable researchers and internal users to experiment with models interactively.

  • Design, implement, and maintain evaluation pipelines, providing high-quality tooling to the broader team to measure model quality and track improvements.

  • Improve configuration hygiene and reproducibility using systems like Hydra, and ensure smooth overrides, templates, and environment switching.

  • Lead efforts in packaging and release of Python modules using modern tools (uv, just, pydantic) for both OSS and internal consumption.

  • Set the standard for code health, test coverage, and release readiness across the team. Write documentation and automation to scale good practices.


What we need to see:

  • Expert-level proficiency in Python, with a strong foundation in modular design, abstraction boundaries, and collaborative codebase evolution.

  • Fluency with PyTorch, including the ability to run, debug, and patch inference-time model behavior in research-level codebases. Comfort modifying pre/post-processors, model wrappers, and checkpoint logic.

  • Proven experience in refactoring large codebases—cleaning up legacy implementations, eliminating anti-patterns, and paying down tech debt to improve long-term maintainability.

  • Strong grasp of configuration systems, especially Hydra, with an emphasis on reproducibility, override logic, and environment scoping.

  • Familiarity with Python packaging tools like uv, just, and pydantic, including experience managing environment consistency and shipping libraries as artifacts.

  • Strong instincts around code health: API design, directory structure, writing unit and integration tests, exception hygiene, docstrings, and dependency isolation.

  • Comfortable deploying models internally via Gradio or similar frameworks to enable interactive evaluation and feedback from researchers or downstream users.

  • BS or MS (or equivalent experience) in Computer Science, Software Engineering, or a related technical field and 10+ years of industry experience.


Ways to stand out from the crowd:

  • Proficiency in model configs, especially Hydra! Comfortable crafting hierarchical config systems with reusable templates, environment scoping, and overrides for evaluation, inference, or release.

  • Prior work cleaning up sophisticated generative model codebases—adding tests, improving wrappers, and instrumenting code for observability and debugging.

  • Demonstrated success raising engineering quality in a research setting: taking exploratory code and evolving it into a robust, production-friendly module.

  • Track record of mentoring teammates on software engineering best practices and proactively identifying long-term structural risks in fast-moving teams.

  • Passion for building ML tooling that is not only functional, but also elegant, intuitive, and maintainable by others.

You will also be eligible for equity and .