Job responsibilities
- Lead the design and development of our AI/ML platform, ensuring robustness, scalability, and high performance.
- Drive the adoption of best practices in software engineering, machine learning operations (MLOps), and data governance.
- Ensure compliance with data privacy and security regulations relevant to AI/ML solutions.
- Maintain consistent code check-ins every sprint to ensure continuous integration and development.
- Enable the Gen AI platform and implement the Gen AI Use cases ,LLM finetuning and multi agent orchestration.
- Communicate technical concepts and solutions effectively across all levels of the organization.
- Manage an AIML Engineering scrum team which includes ML engineers, Senior ML engineers and lead ML engineer.
- Quarterly performance check-ins and feedback to the individual team members.
- Release ownership and unblock the team wherever its needed.
- Help team members to grow in their career & create a positive environment.
Required Qualifications, Capabilities, and Skills
- Master's degree in a STEM field and 10+ years of experience in designing and managing large-scale AI & ML platforms and supporting systems.
- 5+ years of technical manager experience.
- Extensive practical experience with AWS cloud services, including EKS, EMR, ECS, and DynamoDB.
- Experience in Databricks ML lifecycle development.
- Advanced knowledge in software engineering, AI/ML, machine learning operations (MLOps), and data governance.
- Demonstrated prior experience in leading complex projects, including system design, testing, and ensuring operational stability.
- Expertise in computer science, computer engineering, mathematics, or a related technical field.
Preferred Qualifications, Capabilities, and Skills
- Real-time model serving experience with Seldon, Ray, or AWS SM is a plus.
- Understanding of large language model (LLM) approaches, such as Retrieval-Augmented Generation (RAG) and agent-based models, is a plus.