R0054/2026-03-31/C002 — Claim Definition¶
Claim as Received¶
Descriptive guidance alone — telling the AI what to do — is not sufficient for complex, multi-step analytical processes. Detailed positive instructions produced inconsistent results until complemented with explicit constraints on what the AI could not do.
Claim as Clarified¶
This claim asserts: (1) positive instructions alone ("do X") are insufficient for complex multi-step AI processes, (2) adding negative constraints ("do not do Y") was necessary to achieve consistency, and (3) this complementary relationship between positive and negative instructions is a real and documented phenomenon. The claim contains an embedded assumption that complex multi-step processes require different prompting strategies than simple tasks.
BLUF¶
The claim is well-supported by current research on prompt engineering and LLM behavior. Multiple studies confirm that positive instructions alone produce inconsistent results for complex tasks, that LLMs have documented weaknesses in processing negation, and that the most effective approach combines positive guidance with carefully structured negative constraints. The 80/20 ratio (positive to negative) recommended by practitioners aligns with the claim's description of complementary instruction types.
Scope¶
- Domain: AI prompt engineering, LLM behavior
- Timeframe: 2024-2026
- Testability: Verifiable through prompt engineering research, practitioner experience, and LLM behavioral studies
Assessment Summary¶
Probability: Very likely / Highly probable (80-95%)
Confidence: Medium-High
Hypothesis outcome: H1 (claim accurate) prevailed, with strong support from both academic research and practitioner experience. The underlying mechanism (RLHF-trained helpfulness overriding instruction compliance) is well-documented.
[Full assessment in assessment.md.]
Status¶
| Field | Value |
|---|---|
| Date created | 2026-03-31 |
| Date completed | 2026-03-31 |
| Researcher profile | Phil Moore |
| Prompt version | ai-research-methodology v1 research.md |
| Revisit by | 2027-03-31 |
| Revisit trigger | Publication of systematic studies specifically comparing positive-only vs. positive+negative prompting for multi-step analytical tasks |