Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Nvidia Solutions Architect Generative AI 
United States, California 
39425656

31.07.2024

accelerated computing platforms.

developer andbuilding scalable industry-specific enterprise AI solutionsfrom project scoping to POC to production.be immersed in a diverse, cultivating environment where everyone is inspired to do their life's work.

you will be doing:

  • You will enable our strategic service delivery partners to build enterprise AIsolutionusing RAG based end-to-end workflows by connecting LLMs to domain specific data.

  • Collaborate with developers and onboard them to NVIDIA AI platforms and services byprovidingdeep technical guidance.

  • Develop tools and recipes which helpenterpriseexplore, train, and deploy retrieval models and other componentsrequiredin LLM RAG systems.

  • Provideexpertisein the operationalization and deployment of enterprise RAG systems in production on the NVIDIA AI platform.

  • You will work with partners and customers to understand their technical needs and find enablement opportunities to expand adoption andutilizationof NVIDIA Generative AI and LLM products.

  • You will detail and communicate standard processes, build repeatable reference architecture, and understand solution trade-offs. Share findings and feedback to improve products and services.

  • Sc degree in Computer Science, Software Engineer, ML Engineer, or related fields (or equivalent experience).

  • 5+ years of relevant work experience in developing and deploying ML models and enterprise applicationssuchas a Software Engineer or ML Engineer usingPytorchor TensorFlow.

  • Excellent programming skills in Python with strong background in software design, debugging and optimization.

  • Proventrack recordof building enterprise RAG-basedsystemsusing open-sourceframeworkssuch asLlamaIndex,LangChain,Malevis, Haystack etc.

  • Experience developing production LLM powered applications and tools with natural language interface at scale.

  • Excellent practical knowledge of Generative AI and LLMs. Ability to train BERT, GPT and Megatron Models for information retrieval and RAG applications.

  • Excellent communication and presentation skills to effectively collaboratewith


Ways to stand out from the

  • expertiseand hands-on experience with NVIDIA AI products. Some products of interest includeacceleratedML (RAPIDS and Sparks-RAPIDS), Natural Language Processing and Large Language Models (NVIDIANeMo), and Generative AI technologies (AI Foundations).

  • Experience deploying RAG into production at scale across a range of models and platforms.

  • Understanding ofMLOpslife cycle and experience withLLMOpsworkflows.

  • Experience in customer facing role as well as ability to scope projects and estimate required effort to build end-to-end applications.

You will also be eligible for equity and .