You’ll play a key role in establishing scalable processes for creating and evaluating high-quality training data, directly impacting the quality of AI Search.
This involves collaborating with engineers, UX designers, and technical writers to define best practices, ensure consistency, and ultimately scale our clients' UX capabilities from a data quality standpoint.
Below covers everything you need to know about what this opportunity entails, as well as what is expected from applicants.
Top 3 Daily Responsibilities :
- Create and curate platinum sets of high-quality training data to refine our clients' language models, enhancing their ability to generate informative and engaging summaries.
- Work with engineers, UX designers, and technical writers to understand how to scale the development of platinum sets and quality assessments for data curation.
- Maintain consistency and quality across a large scale data curation program : Adhere to established style guidelines and ensure the overall quality and coherence of the generated datasets.
- Develop frameworks for creating platinum / calibration and Golden sets.
Mandatory Skills / Qualifications :
- Education : Bachelor's degree in Communications, Journalism, English, or related field of study.
- Years of experience : 3-5 years of experience with Data Science in Tech or related fields.
- Strong communication and interpersonal skills : Ability to effectively collaborate with engineers, UX designers, and technical writers to understand their needs and translate them into data requirements.
- Ability to clearly document processes and guidelines : This ensures that data curation practices are well-defined, consistent, and easily understood by others.
Non-Essential Skills / Qualifications :
Understanding of natural language processing (NLP) : Familiarity with basic NLP concepts and techniques can be helpful for understanding how language models work and how data quality impacts their performance.
J-18808-Ljbffr