Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Amazon ML Data Linguist - BDB Support English Bedrock 
Portugal, Miragaia e Marteleira 
711004885

16.09.2024
DESCRIPTION


Amazon Web Services (AWS) is looking for a data associate to help with annotations, data analysis and quality assurance. As part of the Ai Data Team at AWS you will responsible for delivering high-quality training data to ensure the best performance of the AWS machine learning systems.Key job responsibilities
* Conduct regular training sessions, monitor performance, provide feedback, and analyze calibration tests results to identify trends, gaps in knowledge, and areas of improvement of the BDB Team.
* Build a thorough understanding of data collection and annotation guidelines and various annotation tools.
* Annotate, generate and QA data, identifying linguistic categories based on detailed annotation and adhering to guidelines.
* Perform annotation related tasks; you participate in data generation, collection and quality assurance tasks
* Collaborate with other ML Data Linguists to resolve data ambiguities and annotation disagreements.
* Dive deep into the data to perform qualitative error trend analysis, and devise action plan to improve data quality.
* Provide feedback to Language Engineers and Scientists on tool improvements and annotation processes.
* Diving deep into issues and implement solutions independently
* Contribute to process improvements to reduce handling time and improve resource output.
* Develop a variety of language artifacts crucial for model development such as datasets for training and evaluation.
* Collaborate with LEs, scientists, and Ops Manager to innovate processes, tracker automations, and workflows.

BASIC QUALIFICATIONS

* Bachelor's degree in Linguistics, Philosophy, Cognitive Science, a foreign language, or Literature.
* Strong communication skills and comfortable leading group calls, training sessions and delivering feedback.
* Proficiency in American English vocabulary, sentence structure and nuances and ability to assess naturalness in a wide range of contexts.
* Ability to identifying linguistic ambiguity, and other inaccuracies in linguistic data, as well as identify basic parts of speech, and produce reports of analyzed data.
* Experience with natural language data labeling, data annotation, linguistic annotation or other forms of data markup.
* Teaching experience and/or experience leading a team of peers.
* Knowledge of different domains such as Finance, Health Care, and/or Insurance.
* Ability to generate innovative and diverse inputs to explore various aspects of an AI model's capabilities
* Familiarity with json, yaml, xml or other forms of text markup.
* Ability to navigate a Unix terminal and use common command line tools
* Knowledge of Python, Java or any other scripting language.
* Strong organizational and leadership skills and detail-oriented.
* Comfortable working in a fast paced, collaborative work environment.
* Be able to start at 8 am EST


PREFERRED QUALIFICATIONS

* Master's degree in a relevant field, such as Linguistics, Communications, a foreign language,- computational linguistics or other language or data-related disciplines is a plus.
* Proficient in a foreign language.
* Familiarity with common text processing tools.
* Passion for language, linguistics, human language technology and AI.
* Ability to work in different operating systems (Windows, MacOS, or Linux).
* Strong understanding of NLP concepts and techniquesPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.