Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Nvidia Windows AI Software Engineer 
United States, California 
535293096

31.07.2024

What You’ll Be Doing:

  • Partnering with NVIDIA software, research, architecture, and product teams, aligning strategies and technical needs for fostering the ecosystem of AI on a Windows RTX PC.

  • Perform in-depth analysis and optimization of AI models, AI frameworks, data processing pipelines, and inference backends to ensure the best performance on current and next-generation GPU architectures.

  • Identify and implement compute and memory optimizations across the full AI inference stack on RTX Windows PC to ensure the best performance and quality of service.

  • Develop model compression and fine-tuning techniques to reduce resource consumption and improve performance, enabling efficient deployment and better user experience.

  • Design and implement an optimized framework for running AI NPCs in gaming applications as part of the NVIDIA ACE Platform.

  • Collaborate with Microsoft to drive the advancements in APIs, AI frameworks, and platforms for developing and deploying AI inferencing applications.

  • Ensure the effective deployment of directed tests through collaboration with the automation team, thereby ensuring the robustness of automated testing.

What We Need to See:

  • Bachelor's, Master's, or PhD in Computer Science, Software Engineering, Mathematics, or a related field (or equivalent experience).

  • 2+ years of experience in AI inferencing pipelines and applications using ML/DL frameworks like PyTorch, ONNX Runtime, DirectML preferred.

  • Excellent C++ programming and debugging skills with a strong understanding of data structures and algorithms.

  • Strong analytical and problem-solving abilities, with the capacity to multitask effectively in a dynamic environment.

  • Outstanding written and oral communication skills, enabling effective collaboration with management and engineering teams.

Ways To Stand Out From The Crowd:

  • Understanding of modern techniques in Machine Learning, Deep Neural Networks, and Generative AI with relevant contributions to major open-source projects will be a plus.

  • Proficiency in lower-level system/GPU programming, CUDA, and developing high-performance systems.

  • Hands-on experience with building applications using graphics APIs like OpenGL, DirectX, Vulkan, etc.

  • Consistent track record of delivering end-to-end products with geographically distributed teams in multinational product companies.

You will also be eligible for equity and .