Expoint – all jobs in one place
המקום בו המומחים והחברות הטובות ביותר נפגשים
Limitless High-tech career opportunities - Expoint

Nvidia Solutions Architect Data Science 
United States, Oregon 
419174036

Yesterday
Singapore, Singapore-Suntec Tower
Thailand, Remote
time type
Full time
posted on
Posted 14 Days Ago
job requisition id

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.

A Solution Architect is the first line of technical expertise between NVIDIA and our customers, as well as our partners. Your duties will vary from solutions design, industry and marketing speaking engagements, training delivery, troubleshooting, project coordination, customer relationship management and more. In this role you will primarily support Thailand, but will need to support the wider South East Asia region if required.

What you'll be doing:
  • Identify new opportunities for NVIDIA technology and solutions, by effectively communicating and positioning the product portfolio and analysing customer requirements.

  • Independently lead technical sales activities in collaboration with the extended team, with the goal of providing total solutions.

  • Identify customer requirements and develop complete generative AI solutions designs using NVIDIA and our ecosystem partners' products.

  • Work closely with ISVs, Solution Integrators and OEMs in solution development and delivery, including proof of concepts.

  • Collaborate with sales and business development teams to support pre-sales activities, including technical presentations and demonstrations of LLM and RAG capabilities.

  • Provide feedback to NVIDIA's product management teams, contributing to the evolution of generative AI technologies.

  • Lead workshops and trainings on NVIDIA's technologies

  • Develop cloud native, multi-agent designs leveraging LLM and RAGs for specific use cases.

  • Help customers to optimise training and inference performance runtime, at scale.

What we need to see:
  • A passion for customer success and innovative AI solutions. Someone motivated, able to work independently, proactively with minimal direction.

  • Exceptional communication skills, good interpersonal skills, and enjoy collaborative work.

  • 5+ years of hands-on experience in a technical role, specifically focusing on generative AI, with emphasis on training Large Language Models (LLMs).

  • BSc or Master in Data Science, Computer Science or related subjects.

  • Proven track record of deploying and optimizing LLM models in production environments.

  • Expertise in training and fine-tuning LLMs using popular frameworks such as TensorFlow and PyTorch.

  • Superb communication and collaboration skills with the ability to explain complex technical concepts to both technical and non-technical audiences.

  • Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.

  • Professional or Native language proficiency in English and Thai

  • Ability to travel up to 30% of the time to support customer in Thailand and South East Asia.

Ways to stand out from the crowd:
  • Hands-on experience with NVIDIA's NeMo SDKs and Triton Inference Server

  • Demonstrable ability to optimize LLM models for inference speed, memory efficiency, and resource utilization.

  • Familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes) for scalable and efficient model deployment.

  • Professional fluency in one of these languages: Mandarin