Skip to content

R0020/2026-03-25/Q002/SRC02

Research R0020 — Prompt Engineering Gaps
Run 2026-03-25
Query Q002
Search S01
Result S01-R02
Source SRC02

arXiv — Ask don't tell: Reducing sycophancy in large language models

Source

Field Value
Title Ask don't tell: Reducing sycophancy in large language models
Publisher arXiv
Author(s) Multiple academic authors
Date 2026-02
URL https://arxiv.org/html/2602.23971v2
Type Primary research paper

Summary

Dimension Rating
Reliability High
Relevance High
Bias: Missing data Low risk
Bias: Measurement Low risk
Bias: Selective reporting Some concerns
Bias: Randomization N/A
Bias: Protocol deviation N/A
Bias: COI/Funding Low risk

Rationale

Dimension Rationale
Reliability Controlled experiment with three frontier models (GPT-4o, GPT-5, Sonnet-4.5), 440 prompts. Quantitative methodology.
Relevance Tests a specific, user-accessible prompt-level technique for reducing sycophancy
Bias flags Some concern about selective reporting — results presented as mean differences without full distribution analysis. Overall methodology is sound.

Evidence Extracts

Evidence ID Summary
SRC02-E01 Question reframing technique with 24pp sycophancy reduction