The point where experts and best companies meet
Share
Key job responsibilities
Research and evaluate existing large language models and foundation models to identify best model training practices and potential areas of improvement.Develop a comprehensive training strategy, including data preprocessing, model initialization, and optimization techniques.Leverage distributed training techniques to train the foundation model efficiently.Monitor the training process and make adjustments to the strategy as needed to improve convergence and performance.Design a comprehensive evaluation suite to assess the model's performance across a range of metrics and different applications.
- PhD, or Master's degree and 6+ years of applied research experience
- 3+ years of building machine learning models for business application experience
- Experience with neural deep learning methods and machine learning
- Experience programming in Java, C++, Python or related language
- Experience with large-scale data analysis and machine learning model development, preferably in the context of natural language understanding or generation
- Experience in patents or publications in reputable conferences or journals in areas such as machine learning, NLP, or AI.
These jobs might be a good fit