C002 — Assessment¶


Research	R0054 — Prompt Claims v2
Run	2026-03-31
Claim	C002

BLUF¶

The claim is well-supported. Research and practitioner evidence consistently confirm that positive instructions alone are insufficient for complex multi-step AI processes, and that explicit negative constraints serve a complementary function that positive instructions cannot replicate. The recommended 80/20 ratio of positive to negative instructions reflects this complementary necessity.

Probability¶

Rating: Very likely / Highly probable (80-95%)

Confidence in assessment: Medium-High

Confidence rationale: Multiple independent sources converge on the same finding. The underlying mechanism (RLHF-trained helpfulness creating compliance bias) is well-documented in academic research. The gap is that no controlled experiment specifically tests this in the context of multi-step analytical processes — the evidence is extrapolated from general prompt engineering research and LLM behavioral studies.

Reasoning Chain¶

REPORTED: An empirical ratio of 80% positive to 20% negative instructions is recommended by practitioners, confirming that both instruction types are needed. [SRC01-E01, Medium reliability, High relevance]
REPORTED: Hard negatives (non-negotiable constraints) serve a distinct function from positive instructions. Positive instructions guide behavior; negative constraints establish boundaries and prevent specific failure modes. [SRC02-E01, Medium reliability, High relevance]
REPORTED: Research from KAIST found that larger LLMs perform worse on negated prompts, and LLMs have systematic weaknesses in processing negation. This explains why careful structuring of both instruction types is needed. [SRC03-E01, Medium reliability, High relevance]
JUDGMENT: The convergence of research findings, practitioner experience, and the documented LLM negation weakness provides strong support for the claim. The specific assertion that "detailed positive instructions produced inconsistent results until complemented with explicit constraints" is consistent with the literature, though it is framed as the researcher's personal experience rather than a published finding.

Evidence Base Summary¶

Source	Description	Reliability	Relevance	Key Finding
SRC01	VibeSparking prompt playbook	Medium	High	80/20 ratio; both types needed
SRC02	Virtualization Review guide	Medium	High	Hard negatives serve distinct function
SRC03	LLM negation research synthesis	Medium	High	LLMs have systematic negation weaknesses

Collection Synthesis¶

Dimension	Assessment
Evidence quality	Medium — practitioner guides and research syntheses, not primary controlled experiments
Source agreement	High — all sources converge on the same conclusion
Source independence	High — different authors, platforms, and perspectives all reach the same conclusion
Outliers	None — Claude documentation suggests general instructions over prescriptive steps, but this applies to simple tasks

Detail¶

All three sources independently confirm the claim's core assertion: positive instructions and negative constraints serve complementary functions, and both are needed for reliable AI behavior in complex tasks. The mechanism is clear — LLMs are trained to be helpful (positive bias), which creates a tendency to comply and agree rather than follow constraints. Negative constraints counter this by establishing boundaries the model cannot cross.

Gaps¶

Missing Evidence	Impact on Assessment
No controlled experiment specifically testing positive-only vs positive+negative for multi-step analytical tasks	Would strengthen the assessment from "very likely" to "almost certain"
Claim includes personal experience ("produced inconsistent results") not independently verifiable	The general principle is well-supported, but the specific personal experience is not testable

Researcher Bias Check¶

Declared biases: The researcher "believes strongly in structured methodology over ad-hoc approaches." This bias aligns with the claim — the researcher has an incentive to validate the structured approach.

Influence assessment: The bias creates a motivation to confirm that constraints are necessary. However, the independent evidence from multiple sources supports this conclusion regardless of the researcher's preferences.

Cross-References¶

Entity	ID	File
Hypotheses	H1, H2, H3	`hypotheses/`
Sources	SRC01, SRC02, SRC03	`sources/`
ACH Matrix	—	ach-matrix.md
Self-Audit	—	self-audit.md