Talent.com
AI Evaluation Analyst

AI Evaluation Analyst

VirtualVocationsBoulder, Colorado, United States
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

A company is looking for an AI Agent Evaluation Analyst.

Key Responsibilities

Review evaluation tasks and scenarios for logic, completeness, and realism

Identify inconsistencies, missing assumptions, or unclear decision points

Help define clear expected behaviors (gold standards) for AI agents

Required Qualifications

Excellent analytical thinking regarding complex systems and logical implications

Familiarity with structured data formats, such as JSON / YAML

Experience with policy evaluation, logic puzzles, or structured scenario design

Background in consulting, academia, or research

Some understanding of scoring or evaluation in agent testing

serp_jobs.job_alerts.create_a_job

Ai Analyst • Boulder, Colorado, United States

Job_description.internal_linking.related_jobs
  • serp_jobs.job_card.promoted
Remote Finance Advisor - AI Trainer

Remote Finance Advisor - AI Trainer

Data AnnotationLongmont, Colorado
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
Senior Market Research Analyst - Computing & PC Markets

Senior Market Research Analyst - Computing & PC Markets

TechInsightsGreenwood Village, CO, US
serp_jobs.job_card.permanent
serp_jobs.filters_job_card.quick_apply
OUR STORY TechInsights is the information platform for the semiconductor industry.Regarded as the most trusted source of actionable, in-depth intelligence related to semiconductor innovation ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Travel CT Technologist

Travel CT Technologist

ProKatchersFraser, CO, US
serp_jobs.job_card.full_time
ProKatchers is seeking a travel CT Technologist for a travel job in Fraser, Colorado.Job Description & Requirements.Nights - Sun, Mon, Tues and every other Wed nights 07 : 00 PM - 07 : 00 AM.In add...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Travel CT Technologist

Travel CT Technologist

Express Healthcare Staffing ColoradoFraser, CO, US
serp_jobs.job_card.full_time +1
Express Healthcare Staffing Colorado is seeking a travel CT Technologist for a travel job in Fraser, Colorado.Job Description & Requirements. CT / X-Ray / POC Tech - ARRT Certified In addition to ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Licensed Clinical Social Worker (Mental Health Therapist) - Nederland, CO

Licensed Clinical Social Worker (Mental Health Therapist) - Nederland, CO

LifeStance HealthNederland, CO, US
serp_jobs.job_card.full_time
At LifeStance Health, we believe in a truly healthy society where mental and physical healthcare are unified to make lives better. Our mission is to help people lead healthier, more fulfilling lives...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Remote AI Writing Evaluator

Remote AI Writing Evaluator

OutlierBoulder, CO, United States
serp_jobs.filters.remote
serp_jobs.job_card.full_time
Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
AI Trainer

AI Trainer

VirtualVocationsBoulder, Colorado, United States
serp_jobs.job_card.full_time
A company is looking for an AI Trainer / Senior Prompt Evaluation.Key Responsibilities Conduct thorough reviews of AI model dialogues with users to assess quality and relevance Identify growth a...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Remote Finance Director - AI Trainer

Remote Finance Director - AI Trainer

Data AnnotationLoveland, Colorado
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Travel Nurse RN - Med Surg

Travel Nurse RN - Med Surg

BesticaEstes Park, CO, US
serp_jobs.job_card.full_time
Bestica is seeking a travel nurse RN Med Surg for a travel nursing job in Estes Park, Colorado.Job Description & Requirements. BLS and ACLS must be AHA, and PALS.We are a trusted provider of sol...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
AI Evaluation Analyst

AI Evaluation Analyst

VirtualVocationsLittleton, Colorado, United States
serp_jobs.job_card.full_time
A company is looking for an AI Agent Evaluation Analyst.Key Responsibilities Review evaluation tasks and scenarios for logic, completeness, and realism Identify inconsistencies, missing assumpti...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
Remote Financial Analyst - AI Trainer

Remote Financial Analyst - AI Trainer

Data AnnotationLakewood, Colorado
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Travel CT Technologist

Travel CT Technologist

TriOptusFraser, CO, US
serp_jobs.job_card.permanent
TriOptus is seeking a travel CT Technologist for a travel job in Fraser, Colorado.Job Description & Requirements.BLS & MUST BE ARRT CT & R certified to meet the requirement of the state...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Travel CT Technologist

Travel CT Technologist

Magnet MedicalWinter Park, CO, US
serp_jobs.job_card.full_time
Magnet Medical is seeking a travel CT Technologist for a travel job in Winter Park, Colorado.Job Description & Requirements. In addition to POC duties, The Diagnostic Imaging / CT Technologist per...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
AI Squad Lead

AI Squad Lead

WelocalizeEvergreen, CO, US
serp_jobs.job_card.full_time
The Squad Lead is responsible for delivering services and solutions to Welocalize customers.The Squad Lead manages or directs a team (i. These solutions are based on the Welocalize Four Pillars of C...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
AI Agent Evaluation Analyst

AI Agent Evaluation Analyst

VirtualVocationsBoulder, Colorado, United States
serp_jobs.job_card.full_time
A company is looking for an AI Agent Evaluation Analyst.Key Responsibilities Review evaluation tasks and scenarios for logic, completeness, and realism Identify inconsistencies, missing assumpti...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
Evaluation Scenario Writer - QA

Evaluation Scenario Writer - QA

MindriftCO, US
serp_jobs.filters.remote
serp_jobs.job_card.part_time +1
serp_jobs.filters_job_card.quick_apply
We believe in using the power of collective human intelligence to ethically shape the future of AI.The Mindrift platform, launched and powered by. AI projects from innovative tech clients.Our missio...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Licensed Clinical Social Worker (Mental Health Therapist) - Lyons, CO

Licensed Clinical Social Worker (Mental Health Therapist) - Lyons, CO

LifeStance HealthLyons, CO, US
serp_jobs.job_card.full_time
At LifeStance Health, we believe in a truly healthy society where mental and physical healthcare are unified to make lives better. Our mission is to help people lead healthier, more fulfilling lives...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Remote Senior Financial Analyst - AI Trainer

Remote Senior Financial Analyst - AI Trainer

Data AnnotationCentennial, Colorado
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Remote Commercial Banking Analyst - AI Trainer

Remote Commercial Banking Analyst - AI Trainer

Data AnnotationWestminster, Colorado
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Outpatient Testing Clinical Psychologist - Allenspark, CO

Outpatient Testing Clinical Psychologist - Allenspark, CO

LifeStance HealthAllenspark, CO, US
serp_jobs.job_card.full_time
At LifeStance Health, we believe in a truly healthy society where mental and physical healthcare are unified to make lives better. Our mission is to help people lead healthier, more fulfilling lives...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days