We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who:
- Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects
- Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making
- Have experience and/or in-depth understandings about large-scale distributed systems
- Demonstrate an ability to work collaboratively in a fast-paced, innovative environment
Responsibilities
- Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations
- Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack
- Collaborate closely with teams on infrastructure, data, post-training, and multimodality
- Embody our and .
Required/Minimum Qualifications
- · Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C , C#, Java, JavaScript, or Python
- Proven expertise in the area of pretraining
Additional or Preferred Qualifications
- Demonstrated experience in large-scale AI.
- Passionate about conversational AI and its deployment.
- Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.
- Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.
- Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.