Share
Key job responsibilities
• Lead the development of CS large language model foundations with focus on model performance, latency optimization, and scalability
• Design and implement agentic AI architectures that seamlessly integrate automated chatbots with Associate tooling
• Drive optimization strategies including model quantization, parallelization, and caching to balance performance and latency
• Architect multi-lingual, multi-channel solutions spanning chat, email, and other text-based interfaces
• Partner with AGI and Bedrock teams to effectively leverage and optimize LLM infrastructure
• Guide technical strategy for transforming both customer experience and Associate operations through generative AI
• Master's degree in Computer Science, Machine Learning, or related field
• 10+ years of experience in natural language processing and machine learning
• Proven expertise in LLM optimization and inference techniques
• Strong background in model performance optimization and distributed systems
• Experience building production-scale chatbot or conversational AI systems
• Excellent communication and collaboration skills
• Track record of leading complex technical initiatives
• Ph.D. in related field
• Experience with model quantization, parallelization, and caching techniques
• Background in agentic AI or autonomous systems
• Expertise in multi-lingual NLP systems
• Experience with enterprise-scale LLM deployments
These jobs might be a good fit