The point where experts and best companies meet
Share
What You'll Be Doing:
Study and develop groundbreaking techniques in deep learning, graphs, machine learning, and data analytics, and perform in-depth analysis.
Collaborate with developers and cross-functional teams to identify current and emerging challenges.
Design and implement end-to-end generative AI solutions, specializing in Large Language Model (LLM) training, efficient deployment strategies, and sophisticated Retrieval-Augmented Generation (RAG) workflows.
What We Need to See:
MS (or equivalent experience) with 6+ years of software development; 2+ years relevant work experience in developing and deploying AI solutions
Proven full-stack development experience with a focus on improving application performance and user experience
Proficiency in Python, C++ programming, and Deep Learning frameworks
Ability to work independently and as part of a team
Motivated self-starter with strong analytical anddebugskills
Ability to balance multiple simultaneous projects
Excellent verbal and written communication skills
Ways to Standout from the crowd:
Experience with CUDA programming and benchmarking and analyzing performance AI Agentic systems
Expertise in training, fine-tuning, and evaluating LLMs using popular frameworks such as TensorFlow or PyTorch
Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms
Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM
Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms ·
is widely consideredto beWe have some of the most forward-thinking and hardworking peopleworking for us.
You will also be eligible for equity and .
These jobs might be a good fit