Skip to content

R0020/2026-03-25/Q003 — Self-Audit

ROBIS 4-Domain Audit

Domain 1: Eligibility Criteria

Rating: Low risk

Criterion Assessment
Evidence types defined before searching Yes — vendor documentation and industry guides targeted
Criteria consistent throughout Yes
Scope maintained Yes — focused on constraint language in prompt engineering guides

Notes: Stable eligibility criteria throughout.

Domain 2: Search Comprehensiveness

Rating: Some concerns

Criterion Assessment
Multiple search strategies used Yes — two distinct searches
Searches designed to test each hypothesis Partial — both searches sought constraint guidance rather than absence
All results dispositioned Yes — 20 results returned, all dispositioned
Source diversity achieved Partial — missing OpenAI (403) and Google documentation

Notes: The inability to access OpenAI's documentation is the most significant gap. Two of three major AI vendors are represented, but the third is absent.

Domain 3: Evaluation Consistency

Rating: Low risk

Criterion Assessment
All sources scored using same framework Yes
Evidence typed consistently Yes
ACH matrix applied Yes
Diagnosticity analysis performed Yes

Notes: Consistent evaluation across sources.

Domain 4: Synthesis Fairness

Rating: Low risk

Criterion Assessment
All hypotheses given fair hearing Yes
Contradictory evidence surfaced Yes — Anthropic's "dial back" guidance prominently featured
Confidence calibrated to evidence Yes
Gaps acknowledged Yes — missing vendor documentation noted

Notes: The finding that constraints are evolving rather than static adds nuance that neither H1 nor H2 fully captures.

Overall Assessment

Overall risk of bias: Low risk

The Anthropic documentation was the most informative and highest-quality source. The missing OpenAI documentation is a limitation but does not invalidate the assessment.

Researcher Bias Check

  • Confirmation bias risk: Low. The research surfaced evidence both for and against imperative constraints, including the unexpected finding that the latest vendor guidance recommends reducing enforcement language.
  • Recency bias risk: Some concern. Anthropic's "dial back" guidance is model-generation-specific (Claude 4.5/4.6) and may not generalize to all models or vendors.