המקום בו המומחים והחברות הטובות ביותר נפגשים
Job Summary
In this role, you will serve as a bridge between Red Hat and our peer IBM Research team – this will involve participating in the development of novel techniques and research extensions alongside IBM Research. This role also requires engaging with related upstream open source communities and projects. You will develop working relationships across multiple teams, planning and prioritizing sprint work across a small team, along with direct contribution to development projects at a senior level of ability.
What you’ll do
Design, implement, and optimize AI tooling and systems to improve the quality and relevance of generated content produced by models created by the end-to-end pipeline.
Develop retrieval mechanisms to efficiently access and leverage external data sources.
Train and fine-tune generative models with retrieved data to enhance performance and accuracy.
Evaluate model performance and iterate on improvements based on metrics and user feedback.
Work closely with data scientists, product managers, and other stakeholders to understand requirements and deliver effective solutions.
Participate in code reviews and collaborate on best practices within the engineering team.
Stay up-to-date with the latest advancements in AI, natural language processing (NLP), RAG methodologies, and other related technologies.
Participate in upstream generative AI model projects such as lm-eval, ragas.io, LlamaIndex, LangChain, Hugging Face Transformers, vllm, pytorch, etc.
Document system designs, processes, and model performance for transparency and future reference.
Report on project status, challenges, and results to stakeholders.
Serve as a subject matter expert for your assigned component, providing mentorship and expertise to build knowledge and capabilities within Red Hat teams.
Gather and analyze user feedback to refine and enhance AI tooling.
What you’ll bring
Bachelor's degree in computer science or equivalent.
Advanced programming skills in Python and SQL.
Experience in or familiarity with machine learning frameworks (e.g., TensorFlow, PyTorch ).
Understanding of natural language processing (NLP) techniques and models.
Familiarity with retrieval-augmented generation architectures and methodologies.
Experience with data processing and manipulation libraries.
Knowledge of microservices and containerization technologies (e.g., Kubernetes) for LLM deployment.
Strong self-motivation and organizational skills.
Demonstrated ability to context switch between multiple concurrent projects.
Outstanding mentorship and coaching skills.
Excellent written and verbal communication skills.
Positive attitude and willingness to share ideas openly.
Bonus qualifications
Masters or PhD in Machine Learning (ML) / Natural Language Processing (NLP).
Experience with unit testing, integration testing, and performance testing.
Familiarity with participating in an agile development team.
משרות נוספות שיכולות לעניין אותך