Remote Data Annotator

Insight Global
CA, United States
$25 an hour
Remote
Full-time

Title : AI Data Trainer

Location : remote

Duration : contract-to-hire

PR : $25 / hr

Work Authorization : USC / GC; no subcontracting during the contract duration

Overview :

Large language models are core to our client, and the data we collect is core to the language models we train. While the current iteration of LLMs is trained primarily on web text, the next generation of LLMs will rely on human annotation to create custom datasets to further develop the capabilities of these models.

We are looking for an AI Data Trainer to work closely with engineering and product teams to lead the creation of custom datasets for training specialized models to enable enterprise solutions using LLM's cutting-edge capabilities.

This role requires a diverse set of skills and draws on a range of disciplines. We are therefore considering a broad range of backgrounds for this role, including ML, NLP, HCI, software engineering, and relevant linguistic and social sciences.

Key responsibilities :

  • Collaborate with Data Science and Product teams to define annotation tasks, coordinate resourcing, and review annotated data for quality
  • Develop and disseminate data labeling best practices learned from building enterprise solutions using LLMs
  • Develop labeled data assets according to annotation guides to train and evaluate LLMs in collaboration with Machine Learning Engineers for real-world use cases
  • Collaborate with centralized data and evaluation teams on specialized collection protocols, UIs, and instructions for diverse and creative human annotation tasks

Must Haves :

  • Bachelor’s degree in Linguistics, Library Science, or a related field (open to non-traditional backgrounds as well!)
  • Experience with ontology development and information domain modeling
  • Experience labeling conversational text for analysis as AI trainers
  • Experience with AI interaction, such as prompt generation and open AIs
  • Experience running and managing human annotation jobs for large-scale data collection with quality control and best practices for human annotation
  • Proficiency with SQL, terminal, and command line
  • Proficiency with Jupyter notebooks
  • Ability to follow complex instructions, navigate ambiguity, and work independently
  • Detail-oriented disposition and clear, concise communication skills
  • Curiosity about technology and knack for tackling problems in creative ways

Plusses :

  • Proficiency in Japanese
  • Experience developing labeled data assets according to annotation guides to train and evaluate LLMs in collaboration with ML Engineers for real-world use cases
  • Experience collaborating with centralized data and evaluation teams on specialized collection protocols, UIs, and instructions for diverse and creative human annotation tasks

Compensation : $25.00 / HR

17 hours ago
Related jobs
Insight Global
CA, United States
Remote

We are looking for an AI Data Trainer to work closely with engineering and product teams to lead the creation of custom datasets for training specialized models to enable enterprise solutions using LLM's cutting-edge capabilities. Collaborate with Data Science and Product teams to define annotation ...

Promoted
Intuit
Fontana, California
Remote

As part of this position, you have the opportunity to work 100% remotely, collaborating with an exceptional team from the comfort of your home or office. By providing tax advice, full service return preparation, tax calculations, and managing product/software inquiries, you will be working toward ad...

Promoted
Gainwell Technologies LLC
Roseville, California

Work within the Data Correction Team performing a variety of tasks including (but not limited to) data capture, data correction, verifying data for accuracy and being proficient in all keying programs. Experience working with business solutions software. Hybrid work environment 1-2 days a week in th...

Promoted
University of California-Berkeley
Berkeley, California

Working skills in statistical analysis, systems programming, database design and data security measures. Involves gathering, analyzing, and interpreting a wide variety of research data. Designs and conducts research including selecting data samples, developing research instruments, analyzing collect...

Promoted
VirtualVocations
San Francisco, California

A company is looking for a Provider Data Entry Specialist to ensure the accuracy of provider demographic information within the health plan system. ...

Promoted
OCPA
Orange, California
Remote

Product Testers are wanted to work from home nationwide in the US to fulfill upcoming contracts with national and international companies. A paid Product Tester position is perfect for those looking for an entry-level opportunity, flexible or seasonal work, temporary work or part-time work. Telecomm...

Promoted
BioMarin Pharmaceutical Inc.
San Rafael, California

The Senior Administrative Assistant reports to and provides primary high-quality administrative support to the BioMarin Pharmacovigilance (BPV) and Clinical Medical Writing (CMW) Teams. The administrative assistant will be professional, attentive to details, effective in communications and managemen...

Promoted
ApexFocusGroup
Los Angeles, California
Remote

No Administrative Assistant admin experience needed. Administrative Assistant Admin Work From Home - Part Time Remote Focus Group Panelists. Administrative assistant admin experience is not necessary. If you are an administrative assistant or someone just looking for a flexible part time remote work...

Promoted
American Consumer Panels
Apple Valley, California
Remote

...

Promoted
Sonrava Health
La Puente, California

As a receptionist you are the face of the practice, this is a fantastic opportunity for the right person! Join a team of quality orientated dental professionals that you will be proud to be affiliated with. ...