Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Nvidia Senior Math Libraries Engineer 
United States, Texas 
268125946

24.06.2024

What you'll be doing:

As a Sr. Engineer specializing on optimizing for single and multi-processor systems you will be responsible for:

  • Developing scalable HPC math library software using modern tools and languages for various numerical methods including but not limited to Fourier Transforms.

  • Performance tuning, optimization, and benchmarking of algorithms on various architectures.

  • Lead various projects to completion and work with colleagues across teams.

  • Working closely with product management and other internal and external partners to understand feature and performance requirements and contribute to the technical roadmaps of libraries,

  • Find opportunities to improve library performance and abstractions that allow to re-architect code for reduced maintenance cost.

  • Your projects are by nature complex and will require you to find and explain proposed solutions, exercise leadership, and coordinate with multiple teams to achieve your objectives.

What we need to see:

  • MSc or PhD degree in Computer Science, Applied Math, or related science or engineering field of study or equivalent experience.

  • 3+ years experience developing, debugging, and optimizing high-performance parallel numerical applications on modern computing platforms, preferably with GPU acceleration using CUDA.

  • Excellent object-oriented software design and C++ programming skills, including functional and performance tests design.

  • Deep understanding of fundamental signal processing, linear algebra and computations in science, engineering, or deep learning.

  • Proven experience in leading and completing software development projects.

  • Excellent collaboration, communication, and documentation habits.

Ways to stand out from the crowd:

  • Exposure to: functional languages, C++ template metaprogramming, MLIR or LLVM internals.

  • Knowledge of Python language and ecosystem.

  • Good knowledge of compute and communication hardware architecture.

  • Experience developing distributed memory parallel computing software with MPI or a PGAS library (eg, NVSHMEM).

You will also be eligible for equity and .