As a Principal Applied Scientist on our team, you'll be responsible for and will engage in:
- Driving projects from design through conception, implementation, experimentation and finally shipping to our users. This requires deep diving into data to identify gaps, coming up with heuristics and possible solutions, using LLMs to create the right model or evaluation prompts, and setting up the engineering pipeline or infrastructure to run them.
- Documenting progress & processing, assisting & guiding junior team members, aligning & unblocking them with other stakeholders in timezones.
- Coming up with evaluation techniques, datasets, criteria and metrics for model evaluations. These are often SOTA models or metrics / datasets.
- Hands on pre-training, fine-tuning, use of language models, including dataset creation, filtering, review, and continuous iteration. This may also require understanding of training frameworks, formats, checkpoints, stacks such as megatron.