Expoint - all jobs in one place

המקום בו המומחים והחברות הטובות ביותר נפגשים

Limitless High-tech career opportunities - Expoint

MongoDB Senior Director Engineering Inference Platform 
United States, California, San Francisco 
559998892

14.04.2025
Responsibilities
  • Lead and Manage the Voyage Inference Engineering Team – Oversee engineering efforts to design, develop, and scale our inference platform, ensuring high performance, reliability, and efficiency
  • Architect and Optimize Large-Scale Inference Pipelines – Work closely with Research Engineers to deploy new embedding models and ensure seamless integration into production environments
  • Develop and Scale API Endpoints – Design and maintain the Voyage API to support real-time inference at scale
  • Integrate with MongoDB Atlas – Work with cross-functional teams to integrate the inference platform into Atlas, enhancing MongoDB’s AI-powered search and vector capabilities
  • Recruit and Build a High-Performance Team – Attract, hire, and mentor top-tier engineering talent, fostering a culture of innovation, collaboration, and technical excellence
  • Define Product and Engineering Strategy – Collaborate with product, research, and engineering leaders to define the long-term vision, roadmap, and architecture of the inference platform
  • Ensure Operational Excellence – Drive best practices for monitoring, reliability, and performance of ML inference services in production
  • Stay Ahead of AI Trends – Keep up with advancements in ML inference, vector search, distributed computing, and hardware acceleration to maintain MongoDB’s leadership in AI-driven search
Requirements
  • 10+ years of engineering leadership experience, including managing multiple teams and scaling large, distributed systems
  • Proven experience building and maintaining ML inference platforms in production
  • Deep expertise in distributed systems, large-scale data processing, and search infrastructure (e.g., Lucene, Elasticsearch, or similar technologies)
  • Strong understanding of ML model deployment, vector search, embeddings, and inference optimizations
  • Experience working with cloud-native architectures and platforms like AWS, GCP, Azure, and Kubernetes
  • Proficiency in high-performance API development and integrating ML pipelines into production systems
  • Excellent leadership, strategic thinking, and ability to influence cross-functional stakeholders
  • Strong technical background in Python, C++, Java, or Go and experience with ML frameworks like TensorFlow, PyTorch, or ONNX is a plus
  • Experience in hiring, mentoring, and scaling world-class engineering teams
Why Join Us?
  • Own and lead the development of a core AI/ML platform within MongoDB
  • Work with cutting-edge ML and inference technologies to shape the future of AI-driven search
  • Collaborate with world-class engineers and researchers on complex, high-scale distributed systems
  • Drive technical and product strategy at a high-growth, industry-leading company
$363,000 USD