Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

Nvidia Senior Deep Learning Compiler Engineer - PyTorch 
Netherlands, Utrecht, Utrecht 
541602951

09.09.2025
Finland, Helsinki
Netherlands, Amsterdam
Poland, Warsaw
Switzerland, Zurich
Germany, Munich
time type
Full time
posted on
Posted 4 Days Ago
job requisition id

What you'll be doing:

As a key member of our team, you will be contributing directly to the future of accelerated AI. Your role will be dynamic and deeply technical, placing you at the center of compiler innovation. You will lead the design, implementation, optimization, and maintenance of the core compiler technologies that accelerate massive deep learning workloads. This is a highly collaborative role where you'll work alongside the very engineers who built PyTorch for NVIDIA hardware, helping to pioneer new features and stay at the forefront of framework development. You'll dive deep into performance analysis, scrutinizing workloads running on thousands of GPUs to find optimization opportunities that will shape the future design of Thunder. Furthermore, you will be part of a vibrant ecosystem, working closely with leading compiler, library, and systems teams—including experts behind nvFuser, TVM, XLA, and CUDA—to translate the latest research into practical, high-impact solutions for the open-source community.

What we need to see:

We are looking for engineers who are excited about building powerful, user-centric tools and are comfortable working in a fast-paced, collaborative environment. Here are some of the expertise we would like to see:

  • A Bachelor's, Master's, or Ph.D. in Computer Science or a related technical field (or equivalent experience).

  • 8+ years of relevant work experience

  • A strong command of Python and experience building complex, well-tested software systems.

  • Hands-on experience with deep learning frameworks like PyTorch or JAX. You understand how models are built and where the performance challenges lie.

  • A solid foundation in compiler concepts such as abstract syntax trees (ASTs), intermediate representations (e.g., SSA form), program analysis, and code generation.

  • Excellent communication and collaboration skills, essential for working effectively in a distributed, open-source environment.

Ways to stand out from the crowd:

  • Previous contributions to deep learning compiler projects (e.g., TVM, MLIR, IREE) or deep learning frameworks themselves.

  • Deep expertise in the internals of PyTorch, particularly its compiler stack (TorchDynamo, TorchInductor).

  • Experience with JAX-like functional transformations and their application in a compiler context.

  • Familiarity with parallel programming, distributed systems, and writing high-performance CUDA code.

  • A track record of impactful participation in open-source communities, such as through code contributions, design discussions, or mentorship.