Talent.com
Senior Agentic AI Test and Evaluation Engineer

Senior Agentic AI Test and Evaluation Engineer

Leidos IncReston, VA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Description

At Leidos, you'll contribute to AI solutions that serve critical national and global missions-ranging from defense and intelligence to healthcare, energy, and space exploration. Our work emphasizes Trusted Mission AI : systems that are transparent, ethical, resilient, and accountable. You'll collaborate with multidisciplinary teams to transition AI research into operational environments where accuracy, security, and reliability are non-negotiable. Joining Leidos means applying your expertise to solve some of the most complex and meaningful challenges of our time.

We are looking for a motivated Agentic AI Test and Evaluation Engineer who wants to work on challenging problems in a variety of domains - including enterprise IT, health, defense, intelligence, and energy - to get results that apply and go beyond the state of the art for measurably better outcomes. We apply our knowledge, capabilities, and experience to develop and deploy Trusted Mission AI - AI that deserves to be trusted by system owners, end users, and the public - to be accurate, ethical, reliable, and adaptable.

You will work with a team of agentic AI scientists, agentic AI scientists, data scientists and data engineers to operationalize new approaches for test and evaluation of Agentic AI models that produce measurable advances over state of the art solutions.

Primary Responsibilities

  • Develops AI Models Test and Evaluation CONOPS
  • Creates scalable Test and Model Evaluation plans for Agentic AI systems including process, techniques and tools.
  • Works with AI scientists, agentic AI scientists, data scientists and data engineers to understand the AI system under test to develop test procedures
  • both positive and negative testing and evaluation
  • Collect performance metrics as part of evalulation results documentation
  • Works with MLOps engineers to integrate testing tools and procedures with the CI / CD pipeline
  • Analyzes existing processes and resultant metrics to recommend potential improvements
  • Collaborates with AI Governance team to maintain visibility and explainability through testing
  • Implements testing process in the AI system design, development and deployment life cycle
  • Identifies the risk in testing of projects, particularly for assessing the limitations of planned tests on complex AI systems
  • Works within teams of AI / ML researchers and engineers using Agile development processes

Basic Qualifications

  • Bachelor's degree in Computer Science, Data Science or related field and over 8 years of relevant experience, Masters with 6 years experience. Additional experience may be considered in lieu of degree.
  • Strong Python programming fundamentals
  • Experience with system and subsystem level test process and automation
  • Experience with creating user acceptance test scenarios
  • Experience with SecDevOps tooling and MLOps pipeline development
  • Experience with software test automation techniques
  • Experience with AI Performance and vulnerability assessment
  • AI model assurance evaluation
  • Experience applying and automating AI interpretability & explainability tools and methods
  • Experience with developing CONOPS and presentations
  • Good understanding of machine learning algorithms, tools and platforms
  • Self-starter with high intellectual curiosity
  • Great communication skills, able to explain model and test results to a non-technical audience
  • Proficient in data exploration techniques and tools
  • Ability to obtain a Secret clearance
  • Preferred Qualifications

  • Experience with data visualization libraries such as Plotly, Streamlit, and matplotlib.
  • Experience with AI / ML tools, such as common Python packages (e.g., scikit-learn, NumPy, Pandas) and Jupyter notebooks
  • Experience with database administration and data repositories
  • Experience in data exploration techniques and tools
  • Experience with building LLM and other Generative AI applications.
  • Willing to learn new skills and platforms to support data analytics.
  • Ability and willingness to obtain a Top Secret security clearance
  • LeidosAI

    If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the status quo - because the mission demands it. We're not hiring followers. We're recruiting the ones who disrupt, provoke, and refuse to fail. Step 10 is ancient history. We're already at step 30 - and moving faster than anyone else dares.

    Original Posting : August 29, 2025

    For U.S. Positions : While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.

    Pay Range :

    Pay Range $104,650.00 - $189,175.00

    The Leidos pay range for this job level is a general guideline onlyand not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.

    #Remote

    serp_jobs.job_alerts.create_a_job

    Test And Evaluation • Reston, VA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Test Engineer - Senior

    Test Engineer - Senior

    BluestoneLogicWashington, DC, United States
    serp_jobs.job_card.full_time
    The Senior Test Engineer will be supporting the Defense Information Systems Agency (DISA) and lead test strategy, automation, and performance validation efforts for Spectrum XXI software suite, ens...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Senior Staff Engineer, Systems Test (R3896)

    Senior Staff Engineer, Systems Test (R3896)

    The Rundown AI, Inc.Washington, DC, United States
    serp_jobs.job_card.full_time +1
    Founded in 2015, Shield AI is a venture-backed deep-tech company with the mission of protecting service members and civilians with intelligent systems. Its products include the V-BAT aircraft, Hivem...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    Operational Test and Evaluation Engineer (DHS / CBP)

    Operational Test and Evaluation Engineer (DHS / CBP)

    Hive Group LLCWashington, D.C., DC, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    The Operational Test and Evaluation Engineer will be responsible for planning, conducting, and analyzing tests of homeland defense technology systems to ensure operational effectiveness, reliabilit...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    Mid-Level Operational Test and Evaluation Engineer

    Mid-Level Operational Test and Evaluation Engineer

    BluePath LabsSterling, VA, USA
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    BluePath Labs is a fast growing research and management consulting company focused on the challenging research problems for both government and private sector clients. BluePath focuses on the inters...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Test & Evaluation Engineer (AEGIS) - Washington, D.C.

    Test & Evaluation Engineer (AEGIS) - Washington, D.C.

    Serco North AmericaWashington, DC, US
    serp_jobs.job_card.full_time
    If you love high profile and challenging projects supporting the US Navy, then Serco has a great opportunity for you!.As the Test & Evaluation Engineer (AEGIS) with Serco, you will be a part of...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Senior Operational Test and Evaluation Engineer (DHS / CBP)

    Senior Operational Test and Evaluation Engineer (DHS / CBP)

    Hive Group LLCWashington, D.C., DC, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    The Senior Operational Test and Evaluation Engineer will be responsible for planning, conducting, and analyzing tests of homeland defense technology systems to ensure operational effectiveness, rel...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Senior Lead AI Engineer (Gen AI Platform Services)

    Senior Lead AI Engineer (Gen AI Platform Services)

    Capital OneWashington, DC, United States
    serp_jobs.job_card.full_time +1
    Senior Lead AI Engineer (Gen AI Platform Services) Overview : At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an indust...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Senior Test Engineer

    Senior Test Engineer

    ASM Research, An Accenture Federal Services CompanyWashington, DC, United States
    serp_jobs.job_card.full_time
    The Senior Test Engineer will be responsible for the oversight and execution of test scripts, scenarios, evidence, and regression testing across the Electronic Health Record (EHR) workflows.Quality...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Software AI Engineer, Senior

    Software AI Engineer, Senior

    Phase2 TechnologyMcLean, VA, United States
    serp_jobs.job_card.full_time +1
    We're looking for a passionate, senior full-stack s.AI to work with a fast-paced, highly collaborative.You'll have the opportunity to work on building a web-based SaaS solution automating cybersecu...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Research Lead - AI Cyber Testing & Evaluation

    Research Lead - AI Cyber Testing & Evaluation

    RAND CorporationWashington, DC, United States
    serp_jobs.job_card.temporary
    RAND's Meselson Center, part of the Global and Emerging Risks (GER) division, is seeking an accomplished technical leader to drive our ambitious AI cyber evaluation agenda.As Research Lead - AI Cyb...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Lead AI Engineer

    Senior Lead AI Engineer

    Capital OneMc Lean, VA, United States
    serp_jobs.job_card.full_time +1
    At Capital One, we are creating responsible and reliable AI systems, changing banking for good.For years, Capital One has been an industry leader in using machine learning to create real-time, pers...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Lead Submarine Test and Evaluation Engineer - Washington, D.C.

    Lead Submarine Test and Evaluation Engineer - Washington, D.C.

    Serco North AmericaWashington, DC, US
    serp_jobs.job_card.full_time
    If you seek a rewarding, high profile and challenging position supporting projects for the US Navy, then Serco has a wonderful opportunity for you!. As the Lead Submarine Test and Evaluation Enginee...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Staff Engineer, Systems Test (R3896)

    Senior Staff Engineer, Systems Test (R3896)

    Shield AIWashington, DC, United States
    serp_jobs.job_card.full_time
    Founded in 2015, Shield AI is a venture-backed deep-tech company with the mission of protecting service members and civilians with intelligent systems. Its products include the V-BAT aircraft, Hivem...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Senior Test Engineer

    Senior Test Engineer

    LeidosWashington, DC, United States
    serp_jobs.job_card.full_time
    The role will involve developing test plans, designing reusable automated test procedures, executing tests, and maintaining thorough documentation in accordance with applicable government standards...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    Software Test and Evaluation Engineer—CBP Experience Preferred

    Software Test and Evaluation Engineer—CBP Experience Preferred

    GB Tech, IncWashington, DC, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    GB Tech provides managed IT, software testing and logistics services for solutions that demand mission-critical precision and scalability. Serving the aerospace, healthcare, education, homeland secu...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Spacecraft Assembly, Integration and Test (AI&T) Engineer

    Spacecraft Assembly, Integration and Test (AI&T) Engineer

    UmbraArlington, VA, US
    serp_jobs.job_card.full_time +1
    serp_jobs.filters_job_card.quick_apply
    Umbra is an American space technology company delivering advanced systems, from sensors to spacecraft, that empower customers worldwide with unmatched access to critical information from space.Our ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Agentic AI Engineer

    Agentic AI Engineer

    LeidosReston, VA, United States
    serp_jobs.job_card.full_time
    At Leidos, you'll contribute to AI solutions that serve critical national and global missions-ranging from defense and intelligence to healthcare, energy, and space exploration.Our work emphasizes ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Senior AI Engineer

    Senior AI Engineer

    GetWellNetwork, Inc.Bethesda, MD, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Senior AI Engineer Reporting to : SVP, Data & AI Location : USA (Remote) Opportunity : Get Well is seeking a highly skilled and innovative Senior AI Engineer to join our growing t...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Lead AI Engineer (AI Foundations, LLM Core)

    Senior Lead AI Engineer (AI Foundations, LLM Core)

    Capital OneMc Lean, VA, United States
    serp_jobs.job_card.full_time +1
    Senior Lead AI Engineer (AI Foundations, LLM Core).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Senior Consultant, Gen AI

    Senior Consultant, Gen AI

    LovelyticsArlington, VA, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Lovelytics is seeking a Generative AI Engineer (Senior Consultant) with hands-on experience delivering client-facing projects in the Generative AI domain, particularly on the Databricks platform.As...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30