מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
What you will do
Write robust Python and C++, working on vLLM systems, high performance machine learning primitives, performance analysis and modeling, and numerical methods
Contribute to the design, development, and testing of various inference optimization algorithms
Participate in technical design discussions and provide innovative solutions to complex problems
Give thoughtful and prompt code reviews
Mentor and guide other engineers and foster a culture of continuous learning and innovation
What you will bring
Extensive experience in writing high performance code for GPUs and deep knowledgeof GPU hardware
Strong understanding of computer architecture, parallel processing, and distributed computing concepts
Experience with tensor math libraries such as PyTorch
Modern C++, CUDA, Triton, and CUTLASS experience
Mathematical software, especially linear algebra or signal processing
Experience optimizing kernels for deep neural networks
Experience with NVIDIA Nsight is a plus
Strong communications skills with both technical and non-technical team members
BS, or MS in computer science or computer engineering or a related field. A PhD in a ML related domain is considered a plus
The salary range for this position is $170,770.00 - $281,770.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך