Key responsibilities include:* Developing strategies and algorithms for mining large amounts of data numbering in the billions for the purposes of targeted model training. * Addressing challenges in automatic evaluation of generative results, identification and classification of failure cases, and strategies for assessing their prevalence and severity.* Streamlining human-in-the-loop processes for dataset construction, and creating and implementing systems that execute on strategies to account for subjectivity and human error.* Synthesis of training data as well as synthesis of augmentations to real-world data.