Share
What you will do:
Prototype with Purpose: Rapidly build PoCs using the latest AI models and frameworks to enhance developer workflows across the PyTorch engineering lifecycle (code, test, document, debug, optimize).
Bridge Industry & Implementation: Evaluate innovations from open source, academia, and industry and assess how they can improve the way PyTorch is built and maintained.
Drive Developer Productivity: Create intelligent tools that support faster iteration, better testing, and smoother onboarding for PyTorch engineers—turning AI into leverage.
Embed in the Team: Work alongside core PyTorch engineers to understand friction points and infuse AI-powered solutions into existing workflows and CI pipelines.
Lead by Influence: Set technical direction in the space of AI-enhanced engineering productivity. Act as a multiplier for the team—mentoring, demoing, writing internal blogs, and making innovation accessible.
Contribute to the PyTorch Ecosystem: Your work will extend to open-source tooling or enhancements that benefit the broader PyTorch community.
What you will bring:
8+ years of experience in machine learning, software engineering, or systems design, with significant exposure to AI/ML infrastructure.
Deep understanding of LLMs, AI agents, vector search, prompt engineering, and developer productivity tools.
Strong Python programming skills and experience with PyTorch, Hugging Face Transformers, LangChain, vLLM, or OpenAI/Anthropic APIs.
Demonstrated ability to build fast, impactful prototypes that lead to production-ready tools.
Experience improving engineering workflows through AI tools—code generation, automated test generation, bug triage, documentation assistants, etc.
Familiarity with modern software engineering practices: CI/CD, DevOps, VS Code extensions, or internal tooling development.
Collaborative mindset and excellent communication skills—comfortable working with developers, QE, product managers, and leadership.
Nice to have:
Contributions to PyTorch or other open-source ML/AI frameworks.
Experience building developer tools or productivity platforms within ML orgs.
Familiarity with AI use cases in systems optimization, model compilation, or hardware acceleration.
Exposure to long-context transformer models and techniques like EasyContext, NoLiMa, or KV-cache paging.
These jobs might be a good fit