Lead and Manage the Voyage Inference Engineering Team – Oversee engineering efforts to design, develop, and scale our inference platform, ensuring high performance, reliability, and efficiency
Architect and Optimize Large-Scale Inference Pipelines – Work closely with Research Engineers to deploy new embedding models and ensure seamless integration into production environments
Develop and Scale API Endpoints – Design and maintain the Voyage API to support real-time inference at scale
Integrate with MongoDB Atlas – Work with cross-functional teams to integrate the inference platform into Atlas, enhancing MongoDB’s AI-powered search and vector capabilities
Recruit and Build a High-Performance Team – Attract, hire, and mentor top-tier engineering talent, fostering a culture of innovation, collaboration, and technical excellence
Define Product and Engineering Strategy – Collaborate with product, research, and engineering leaders to define the long-term vision, roadmap, and architecture of the inference platform
Ensure Operational Excellence – Drive best practices for monitoring, reliability, and performance of ML inference services in production
Stay Ahead of AI Trends – Keep up with advancements in ML inference, vector search, distributed computing, and hardware acceleration to maintain MongoDB’s leadership in AI-driven search
Requirements
10+ years of engineering leadership experience, including managing multiple teams and scaling large, distributed systems
Proven experience building and maintaining ML inference platforms in production
Deep expertise in distributed systems, large-scale data processing, and search infrastructure (e.g., Lucene, Elasticsearch, or similar technologies)
Strong understanding of ML model deployment, vector search, embeddings, and inference optimizations
Experience working with cloud-native architectures and platforms like AWS, GCP, Azure, and Kubernetes
Proficiency in high-performance API development and integrating ML pipelines into production systems
Excellent leadership, strategic thinking, and ability to influence cross-functional stakeholders
Strong technical background in Python, C++, Java, or Go and experience with ML frameworks like TensorFlow, PyTorch, or ONNX is a plus
Experience in hiring, mentoring, and scaling world-class engineering teams
Why Join Us?
Own and lead the development of a core AI/ML platform within MongoDB
Work with cutting-edge ML and inference technologies to shape the future of AI-driven search
Collaborate with world-class engineers and researchers on complex, high-scale distributed systems
Drive technical and product strategy at a high-growth, industry-leading company