מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
What you will be doing:
Analyze the performance of application code running on NVIDIA GPUs with the aid of profiling tools.
Construct bridge between Graph compilers like PyTorch, OAI Triton and MLIR/LLVM based middle end optimizer.
Identify opportunities for performance improvements in MLIR and LLVM based compiler middle end optimizer.
Design and develop new compiler passes and optimizations to produce best-in-class, robust, supportable compiler and tools.
Interact with Open-source MLIR/LLVM community to ensure tighter integration.
Work with geographically distributed compiler, hardware and application teams to oversee improvements and problem resolutions.
Be part of a team that is at the center of deep-learning compiler technology spanning architecture design and support through higher level languages.
What we need to see:
M.S in Computer Science, Computer Engineering, or related fields (or equivalent experience).
5+ years experience in Compiler Optimizations such as Loop Optimizations, Inter-procedural optimizations and Global optimizations.
Excellent hands-on C++ programming skills.
Understanding of any Processor ISA (GPU ISA would be a plus).
Strong background in software engineering principles with a focus on crafting robust and maintainable solutions to challenging problems.
Good communication and documentation skills and self-motivated.
Ways for you to stand out from the crowd:
PhD preferred
Experience in developing applications in CUDA or other parallel programming language.
Deep understanding of parallel programming concepts.
MLIR, LLVM and/or Clang compiler development experience.
Familiarity with deep learning frameworks and NVIDIA GPUs.
You will also be eligible for equity and .
משרות נוספות שיכולות לעניין אותך