Expoint - all jobs in one place

המקום בו המומחים והחברות הטובות ביותר נפגשים

Limitless High-tech career opportunities - Expoint

Nvidia System Software Engineer - RAG 
United States, Texas 
765791998

24.06.2024

NVIDIA is looking for a

System Software Engineer - RAG

What You'll Be Doing:

  • Develop and optimize Python-based data processing frameworks, ensuring efficient handling of large datasets on GPU-accelerated environments, vital for LLM training.

  • Contribute to the design and implementation of RAPIDS and other GPU-accelerated libraries, focusing on seamless integration and performance enhancement in the context of LLM training data preparation and RAG pipelines.

  • Lead development and iterative optimization of components for RAG pipelines, ensuring they demonstrate GPU acceleration & the best performing models for improved TCO.

  • Collaborate with teams of LLM & ML researchers in the development of full-stack, GPU-accelerated data preparation pipelines for multimodal models Implement benchmarking, profiling, and optimization of innovative algorithms in Python in various system architectures, specifically targeting LLM applications.

  • Work closely with complementary teams to understand requirements, build & evaluate POCs, and develop roadmaps for production level tools and library features within the growing LLM ecosystem.

  • Build amazing products to improve employee productivity using Gen-AI & Co-pilot experiences!

  • Collaborate with your peers to craft, develop, test, and maintain integrated applications and features.

  • Develop integrated systems enabling unified experience across applications and driving insights for end-to-end user experience.

  • Help build and maintain our Continuous Delivery pipeline with the goal of moving changes to production faster and safer, while ensuring key operational standards.

  • Provide peer reviews to other specialists including feedback on performance, scalability, and correctness.

  • Actively contribute to the adoption of frameworks, standards, and new technologies

What We Need To See:

  • Bachelor’s or Master’s Degree program in Computer Science, Computer Engineering, or a related field (or equivalent experience).

  • 6+ years of demonstrated experience in a similar or related role

  • Python programming expertise with Deep Learning (DL) frameworks such as PyTorch.

  • Experience delivering software in a cloud context and is familiar with the patterns and process of handling cloud infrastructure

  • Knowledge of MLOps technologies such as Docker-Compose, Containers, Kubernetes, data center deployments etc.

  • Excellent in-depth hands-on understanding of NLP, LLM, MLLM, Generative AI , and RAG workflows

  • Self-starter with a passion for growth, enthusiasm for continuous learning and sharing findings across the team

  • Extremely motivated, highly passionate, and curious about new technologies.

  • Outstanding communication skills for distilling sophisticated topics down to understandable, impactful conclusions as well as the ability to work successfully with multi-functional teams, principals, and architects. Coordinates optimally across organizational boundaries and geographies.

You will also be eligible for equity and .