Expoint – all jobs in one place
המקום בו המומחים והחברות הטובות ביותר נפגשים
Limitless High-tech career opportunities - Expoint

Nvidia Solutions Architect AI Hyperscalers 
United States, California 
748870638

Yesterday
US, CA, Santa Clara
time type
Full time
posted on
Posted 4 Days Ago
job requisition id

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by extraordinary technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing.

What you’ll be doing:

  • As a key technical member of a focused account team, you will serve as the main point of contact for NVIDIA products, enabling internet giants and cloud providers to have an innovative AI/ML software infrastructure.

  • Work directly with best-in-class engineering teams to secure design wins, address challenges, bring solutions to production, and support them throughout their lifecycle.

  • Become a trusted advisor to your customer by understanding their environment, constraints, and long-term strategy. Translate these insights into product requirements and innovative solutions.

  • Help your customer enhance the value of NVIDIA technology, and provide feedback to NVIDIA for future product improvements.

  • Facilitate the resolution of customer issues, offering timely and proactive communications to mitigate risks.

  • Lead workshops, demos, and proof-of-concepts to showcase NVIDIA’s AI/ML capabilities.

  • Guide customers on standard processes for scalable AI model deployment and inference optimization.

What we need to see:

  • Minimum of a BS/MS in Computer Science, Electrical Engineering, or equivalent experience.

  • At least 5+ years of engineering experience with a proven track record in AI/ML-focused projects or enterprise-grade solutions.

  • Proven understanding of Linux, including solving, optimization, and customization for AI/ML workloads.

  • Strong understanding of data science and machine learninginfrastructure—softwareand hardware.

  • Professional-level communication skills, including the ability to tailor messages for varying technical audiences and maintain composure in high-pressure situations.

  • Excellent follow-up and interpersonal skills, with a true passion for problem-solving.

  • Proficient in Python, with the ability to develop scripts and build custom tools. Experience with parallel programming or GPU acceleration (e.g., CUDA) is helpful.

  • Shown eagerness to learn and apply new technologies.

Ways to stand out from the crowd:

  • Experience with Chatbots, RAG pipelines, vector databases, and distributed training or inference workloads.

  • Experience or background in HPC (High Performance Computing) environments for AI or ML applications.

  • Familiarity with multi-node GPU clusters and performance tuning for large-scale AI workloads.

  • Experience developing in cloud and/or virtualized environments, containerized solutions, with knowledge of Docker, Kubernetes

  • Experience with common deep learning frameworks such as PyTorch or JAX.

You will also be eligible for equity and .