Skip to content

R0055/2026-04-01/C025

Research R0055 — RLHF Yes-Men Claims
Run 2026-04-01
Claim C025

Claim: The DoD's CaTE center (Calibrated AI Trust and Expectations) at SEI/Carnegie Mellon has published frameworks for measuring trust in AI systems

BLUF: Partially correct with name error. The center is the Center for Calibrated Trust Measurement and Evaluation (CaTE), not 'Calibrated AI Trust and Expectations.' It is at SEI/Carnegie Mellon, launched in 2023 with DoD/OUSD(R&E). It has published a guidebook for TEVV of LAWS (lethal autonomous weapons systems) focused on trust measurement.

Probability: Likely (55-80%) | Confidence: High


Summary

Entity Description
Claim Definition Claim text, scope, status
Assessment Full analytical product with reasoning chain
ACH Matrix Evidence x hypotheses diagnosticity analysis
Self-Audit ROBIS-adapted 5-domain audit

Hypotheses

ID Hypothesis Status
H1 Claim is accurate as stated Inconclusive
H2 Claim is partially correct or correct with caveats Supported
H3 Claim is materially wrong Eliminated

Searches

ID Target Results Selected
S01 DoD CaTE Calibrated Trust SEI Carnegie Mellon fram 10 2

Sources

Source Description Reliability Relevance
SRC01 CMU News / SEI High High

Revisit Triggers

  • CaTE expanding scope to address AI output behavior or sycophancy specifically