

What you'll be doing:
Design, build and optimize agentic AI systems for the CUDA ecosystem.
Co-design agentic system solutions with software, hardware and algorithm teams; influence and adopt new capabilities as they become available.
Develop reproducible, high-fidelity evaluation frameworks covering performance, quality and developer productivity.
Collaborate across the AI stack—from hardware throughcompilers/toolchains,kernels/libraries, frameworks, distributed training, andinference/serving—andwith model/agent teams.
What we need to see:
Bachelor’s degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); MS or PhD preferred.
5+ years of industry or academia experience with AI systems development; exposure to building foundational models, agents or orchestration frameworks; hands-on experience with deep learning frameworks and modern inference stacks.
Strong C/C++ and Python programming skills; solid software engineering fundamentals.
Experience with GPU programming and performance optimization (CUDA or equivalent).
Ways To Stand Out From The Crowd:
Track record building/evaluating deep learning models, coding agents and developer tooling.
Demonstrated ability to optimize and deploy high-performance models, including on resource-constrained platforms.
Deep expertise in GPU performance optimizations, evidenced by benchmark wins or published results.
Publications or open-source leadership in deep learning, multi-agent systems, reinforcement learning, or AI systems; contributions to widely used repos or standards.
You will also be eligible for equity and .
משרות נוספות שיכולות לעניין אותך