The point where experts and best companies meet
Share
• Develop and own the deployment pipelines for integrating and deploying LLM models into production environments, including rigorous testing and validation stages.
• Design and implement the ML inference service that orchestrates the interaction between multiple models, such as issue prediction, item recommendation, and response generation, to provide coherent and contextual responses.
• Integrate with various cross-team services to enable retrieval-augmented generation (RAG) systems, combining language models with external knowledge sources and actuation capabilities to deliver informational and action-oriented responses.
• Enhance observability and logging mechanisms to proactively identify and troubleshoot issues, and maintain dialogue state for offline training and model improvement.
• Collaborate closely with Product Managers, UX designers, Applied Scientists, and experienced Software Development Engineers to effectively apply machine learning models and deliver high-quality conversational AI experiences.
• Expertise in designing and implementing real-time inference services for large-scale conversational AI systems, with a focus on optimizing model performance for low latency and high transactions per second (TPS) in a customer service context.
• Strong understanding of LLM (Large Language Model) domain, including experience with model compression techniques, quantization, and efficient serving strategies for conversational AI applications. Ability to balance model accuracy with performance constraints in production environments.
Key job responsibilities
As a key member of the engineering team, you will have influence on our product strategy by helping define the product features, refine system architecture, and follow best practices that enable a quality product. You will be successfully setting the foundation for the next phase of the product and beyond. A commitment to teamwork, hustle, and strong communication skills (to both business and technical partners) are absolute requirements. Creating a reliable, scalable, and high-performance service requires exceptional technical expertise, a sound understanding of the fundamentals of Computer Science, and practical experience building large-scale distributed systems.
A day in the life
Benefits Summary:1. Medical, Dental, and Vision Coverage
2. Maternity and Parental Leave Options
3. Paid Time Off (PTO)
4. 401(k) Plan
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
These jobs might be a good fit