Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Red hat Principal GPU Kernel Engineer AI Inference 
United States, Massachusetts, Boston 
579054555

17.04.2025

What you will do

  • Write robust and modern C++, CUDA, CUTLASS, and Triton kernels working on high-performance machine learning primitives, performance analysis and modeling, and numerical methods

  • Contribute to the design, development, and testing of various inference optimization algorithms

  • Participate in technical design discussions and provide innovative solutions to complex problems

  • Give thoughtful and prompt code reviews

  • Mentor and guide other engineers and foster a culture of continuous learning and innovation

What you will bring

  • Extensive experience in writing high performance code for GPUs and deep knowledgeof GPU hardware

  • Strong understanding of computer architecture, parallel processing, and distributed computing concepts

  • Experience with tensor math libraries such as PyTorch

  • Modern C++, CUDA, Triton, and CUTLASS experience

  • Mathematical software, especially linear algebra or signal processing

  • Experience optimizing kernels for deep neural networks

  • Experience with NVIDIA Nsight is a plus

  • Strong communications skills with both technical and non-technical team members

  • BS, or MS in computer science or computer engineering or a related field. A PhD in a ML related domain is considered a plus

The salary range for this position is $189,600.00 - $312,730.00. Actual offer will be based on your qualifications.

Pay Transparency

● Comprehensive medical, dental, and vision coverage

● Flexible Spending Account - healthcare and dependent care

● Health Savings Account - high deductible medical plan

● Retirement 401(k) with employer match

● Paid time off and holidays

● Paid parental leave plans for all new parents

● Leave benefits including disability, paid family medical leave, and paid military leave