E01¶


Research	R0041 — Enterprise Sycophancy
Run	2026-04-01
Query	Q001
Source	SRC06
Evidence	SRC06-E01
Type	Factual

Google Gemini sycophancy performance across models

URL: https://blog.google/products/gemini/gemini-3/

Extract¶

Google's Gemini 3 announcement states the model "shows reduced sycophancy, increased resistance to prompt injections and improved protection against misuse via cyberattacks." Sycophancy reduction is listed alongside other safety improvements.

Independent corroboration: A March 2026 Stanford and Carnegie Mellon University study evaluated 11 LLMs and found Google DeepMind's Gemini-1.5 to be the least sycophantic model tested. Models from Chinese firms DeepSeek and Alibaba were found to be more sycophantic than American LLMs.

Additionally, Google DeepMind developed Gemma Scope 2, an interpretability tool explicitly positioned for studying "jailbreaks, hallucinations, sycophancy, refusal mechanisms and discrepancies between internal state and communicated reasoning."

No enterprise-specific API parameters or configurations for sycophancy control are mentioned.

Relevance to Hypotheses¶

Hypothesis	Relationship	Strength
H1	Contradicts	Sycophancy reduction is a general model improvement, not an enterprise feature
H2	Supports	Active vendor investment confirmed by both vendor claims and independent benchmarks
H3	Contradicts	Independent benchmark verification of Google's claims shows genuine progress

Context¶

Google's approach appears to be through general model improvement and interpretability research rather than enterprise-specific features. The independent benchmark corroboration is particularly valuable.