Key job responsibilities- Design and implement advanced deployment pipelines for seamless integration of LLM models into production environments, ensuring rigorous testing and validation protocols.
- Lead the creation of a sophisticated ML inference service that orchestrates complex interactions between multiple models, including issue prediction, item recommendation, and response generation, to deliver coherent and contextual responses.- Innovate and implement observability and logging mechanisms for proactive issue identification, troubleshooting, and maintenance of dialogue states crucial for offline training and continuous model improvement.
A day in the life
Benefits Summary:1. Medical, Dental, and Vision Coverage
2. Maternity and Parental Leave Options
3. Paid Time Off (PTO)
4. 401(k) Plan
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Proven expertise in architecting and optimizing real-time inference services for large-scale conversational AI systems, with a focus on achieving ultra-low latency and high transactions per second (TPS) in mission-critical customer service environments.
- Deep understanding of the LLM domain, including extensive experience with advanced model compression techniques, quantization methods, and efficient serving strategies for high-performance conversational AI applications.
- Demonstrated ability to balance model accuracy with stringent performance constraints in large-scale production environments
- Track record of technical leadership in developing and deploying AI-driven solutions that significantly impact customer experience and operational efficiency.
משרות נוספות שיכולות לעניין אותך