We are seeking a highly skilled and motivated Machine Learning Engineer with a solid focus on Large Language Models (LLMs) and text processing. In this role, you will work on building, fine-tuning, optimising, and deploying LLMs for real-world applications. You'll also be responsible for generating scripts using LLMs, customising models using platforms like Ollama, and integrating them into robust applications. Design and implement solutions leveraging LLMs for text generation, understanding, summarisation, classification, and extraction tasks. Customise and fine-tune LLMs using domain-specific data with tools like Ollama and LoRA/QLoRA. Optimize LLMs for inference speed, accuracy, and resource efficiency (quantisation, pruning, etc.). Integrate LLM functionalities into end-user applications with a strong emphasis on usability and performance. Collaborate with product, design, and engineering teams to translate business requirements into ML-based solutions. Develop APIs and micro-services for model serving and deployment. Monitor model performance and implement continuous improvement and retraining strategies. Stay updated with the latest research and trends in the LLM space and propose innovative ideas.