R0053/2026-03-31-02/C002/SRC01
Analysis of negative instruction effectiveness in LLMs (Pink Elephant Problem)
Source
Summary
| Dimension |
Rating |
| Reliability |
Medium |
| Relevance |
High |
| Bias: Missing data |
Some concerns |
| Bias: Measurement |
Some concerns |
| Bias: Selective reporting |
Some concerns |
| Bias: Randomization |
N/A — not an RCT |
| Bias: Protocol deviation |
N/A — not an RCT |
| Bias: COI/Funding |
Low risk |
Rationale
| Dimension |
Rationale |
| Reliability |
Practitioner analysis with real-world examples, cites Anthropic guidance. Not peer-reviewed but grounded in observable behavior. |
| Relevance |
Directly tests the claim's core mechanism. |
| Bias flags |
Some concerns: may cherry-pick examples, measurement is anecdotal rather than systematic. |
| Evidence ID |
Summary |
| SRC01-E01 |
Negative instructions produce worse output in LLMs |
| SRC01-E02 |
Anthropic recommends positive framing |