Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Nvidia Accelerated Compute Systems Performance Architect -- Intern 
China, Shanghai 
50309586

24.06.2024

What you'll be doing:

  • Performing in-depth analysis and optimization to ensure the best possible performance on current and/or next-generation NVIDIA GPUs.

  • Understanding and analyzing the interplay of hardware and software architectures on core algorithms, programming models, and applications.

  • Actively collaborating with the hardware design, software engineering, product, and research teams to guide the direction of accelerated computing.

  • Diving into accelerated computing applications to facilitate software-hardware co-design.

  • Write up and present your work by writing white papers, conference publications, official blog posts, patent applications, etc.

What we need to see:

  • A passion for performance analysis and optimization.

  • Hands-on experience with the massively parallel GPU programming model, e.g. CUDA or OpenCL. Familiarity with APIs for multi-node communication, like MPI or OpenSHMEM/NVSHMEM, is a plus.

  • Solid background in GPU and computer systems architecture.

  • Strong knowledge of C and C++ with a solid understanding of software design, programming techniques, and algorithms. Familiarity with Python is a plus.

  • Good communication and organization skills, with a logical approach to problem solving, good time management, and task prioritization skills.