Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Nvidia Senior Distributed Acceleration Engineer RAPIDS 
United States, Texas 
828838319

01.12.2024

In this role, you will develop, benchmark, and explore novel tuned custom solutions, as well as existing open-source engines, like Dask, Ray, Spark, and more, which can achieve high-performance goals for multi-GPU workloads. This is a great chance to take advantage of your distributed systems knowledge, CUDA C++, and Python programming skills. You’ll work closely with the RAPIDS group of stellar engineers building highly optimized multi-GPU CUDA libraries.

What you'll be doing:

  • Analyze, design, and implement optimized GPU algorithms for large-scale data analytics and machine learning

  • Architect and implement distributed GPU algorithms for dense multi-GPU single-node machines and more generally for multi-GPU multi-node environments

  • Expand and improve integration of RAPIDS into relevant high-level frameworks

  • Drive performance analysis, benchmarking, and troubleshooting of associated libraries.

  • Collaborate with a multi-functional team to understand requirements and implement or improve solutions

What we need to see:

  • MS or PhD in Computer Science, Computer Engineering or Electrical Engineering or related field in Deep Learning, Machine Learning, and Computer Vision or equivalent experience.

  • 5+ years of proven experience in Computer Science, Artificial Intelligence, Applied Math, or related field

  • Strong analytical problem-solving skills, algorithms, and mathematics fundamentals.

  • Distributed System experience and development

  • Excellent software development skills: programming, debugging, performance analysis, and test design

  • Good communication and documentation habits.

  • Ability to work independently and manage your own development efforts.

Ways to stand out from the crowd:

  • Experience developing distributed algorithms and running on distributed systems: HPC, Cloud, etc

  • Experience with debugging multi-language and multi-hardware systems

  • Experience with the PyData Stack: NumPy, Pandas, Scikit-Learn, Dask,

  • Prior work on open-source projects

  • GPU programming knowledge is a plus, but if you don’t have it, we’re happy to teach you

You will also be eligible for equity and .