Q003-SRC05 — Scorecard¶

Source¶


Title	Microsoft 365 Copilot Researcher — Critique and Council features
Publisher	Multiple news sources (GeekWire, XDA, Petri)
Authors	Microsoft (product); multiple journalists (reporting)
Date	2026-03-30
URL	https://www.geekwire.com/2026/gpt-drafts-claude-critiques-microsoft-blends-rival-ai-models-in-new-copilot-upgrade/
Type	Product announcement / news coverage

Dimension	Rationale
Reliability	News coverage of a new product feature; benchmark results reported by Microsoft
Relevance	Cross-model verification (GPT drafts, Claude critiques) is the closest mechanism to a formal audit found in commercial tools
Missing data	Internal methodology of Critique not fully documented
Measurement	DRACO benchmark results (57.4 points, 13.8% improvement) are Microsoft-reported
COI/funding	Microsoft reporting on own product's benchmark performance

Evidence	Summary
SRC05-E01	Microsoft Copilot Critique uses cross-model verification (GPT + Claude) achieving 13.8% improvement on DRACO benchmark