Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Red hat Senior Machine Learning Engineer AI Inference 
United States, Massachusetts, Boston 
878013849

17.04.2025

What you will do

  • Write robust Python and C++, working on vLLM systems, high performance machine learning primitives, performance analysis and modeling, and numerical methods

  • Contribute to the design, development, and testing of various inference optimization algorithms

  • Participate in technical design discussions and provide innovative solutions to complex problems

  • Give thoughtful and prompt code reviews

  • Mentor and guide other engineers and foster a culture of continuous learning and innovation

What you will bring

  • Extensive experience in writing high performance code for GPUs and deep knowledgeof GPU hardware

  • Strong understanding of computer architecture, parallel processing, and distributed computing concepts

  • Experience with tensor math libraries such as PyTorch

  • Modern C++, CUDA, Triton, and CUTLASS experience

  • Mathematical software, especially linear algebra or signal processing

  • Experience optimizing kernels for deep neural networks

  • Experience with NVIDIA Nsight is a plus

  • Strong communications skills with both technical and non-technical team members

  • BS, or MS in computer science or computer engineering or a related field. A PhD in a ML related domain is considered a plus

The salary range for this position is $170,770.00 - $281,770.00. Actual offer will be based on your qualifications.

Pay Transparency

● Comprehensive medical, dental, and vision coverage

● Flexible Spending Account - healthcare and dependent care

● Health Savings Account - high deductible medical plan

● Retirement 401(k) with employer match

● Paid time off and holidays

● Paid parental leave plans for all new parents

● Leave benefits including disability, paid family medical leave, and paid military leave