Chapter 1 Citation Evidence Index
This index maps Chapter 1 claims to the reference evidence package.
It is not a replacement for the final article. It is a citation review aid.
Current Published References
The final Chapter 1 Markdown currently lists 19 references:
- R1-R7: framing references for agent runtime capability, context engineering, multi-agent orchestration, OpenAI/Hugging Face agent docs, and long-running harnesses
- R8-R10: long-context degradation and long-task capability
- R11-R13: authority, support, delegation, and inter-agent risk
- R14-R15: tool calling and capability governance
- R16: lawful stop and agent autonomy
- R17-R19: closest-neighbor systems and positioning
The local reference matrix primarily supports the empirical sections R1-R16. R17-R19 are positioning references and are not the main focus of this matrix.
Claim Mapping
Opening / Framing
Supported by:
- Anthropic, Building effective agents
- Anthropic, Effective context engineering for AI agents
- OWASP GenAI Security Project, LLM Top 10
Use:
- Define the field-level vocabulary: workflows, agents, context engineering, and excessive agency.
- Keep these citations light. They establish context rather than proving the chapter.
Section 1.1: Task Ownership Drift
Supported by:
- Liu et al., Lost in the Middle, arXiv/ACL/TACL reference: 2307.03172
- Chroma, Context Rot
- METR, Measuring AI Ability to Complete Long Tasks, arXiv:2503.14499
- LOCA-bench, arXiv:2602.07962
- UltraHorizon, arXiv:2509.21766
Use:
- Support the point that larger context windows do not automatically preserve task identity.
- Avoid claiming these papers directly prove ownership drift; they support the weaker and more defensible claim that long-context reliability degrades and therefore cannot serve as the whole task-identity substrate.
Section 1.2: Support Quietly Turns Into Ownership
Supported by:
- production failure-mode taxonomies
- CrewAI hierarchical bug reports
- adjacent handoff and scope-creep reports
Use:
- This section is mostly structural reasoning. Use at most one light citation if needed.
- The matrix recommends not over-citing this section.
Section 1.3: Memory Is Not Evidence
Supported by:
- Anthropic, Effective context engineering for AI agents
- AI Hallucination Cases Database
- ACE context collapse, arXiv:2510.04618
- MIRAGE-Bench, arXiv:2507.21017
- When Refusals Fail, arXiv:2512.02445
Use:
- Support the distinction between recall and admissibility.
- The hallucination cases database is a strong concrete example, but should be framed carefully as legal-domain evidence misuse rather than a direct proof of runtime memory architecture.
Section 1.4: Persona Becomes Authority
Supported by:
- When Personas Override Payoffs, arXiv:2601.10102
- Belief in Authority, arXiv:2601.04790
- Persona-induced bias, arXiv:2511.11789
- Identity drift, arXiv:2412.00804
- CrewAI Issue #2838
Use:
- The strongest insertion is the role identity bias framing plus one engineering bug report.
- Avoid stacking all persona papers into the main article.
Section 1.5: Delegation Becomes Authorization
Supported by:
- CrewAI Issue #4783
- CrewAI Issue #2054
- CrewAI Issue #3925
Use:
- Treat these as public engineering examples of delegation and authority confusion.
- Do not claim that CrewAI as a project proves the general argument; use the issues as illustrative evidence.
Section 1.6: Global Chat Becomes Context Pollution
Supported by:
- Anthropic, How we built our multi-agent research system
- AutoGen GroupChat bug reports
Use:
- The Anthropic orchestrator-workers design is the positive counterexample.
- AutoGen issues can support the practical failure mode, but the final article can remain concise.
Supported by:
- OWASP LLM06: Excessive Agency
- MCPTox benchmark cluster
- Breaking MCP Protocol, arXiv:2601.17549
- Prompt Injection Attacks on Agentic Coding Assistants, arXiv:2601.17548
Use:
- This is one of Chapter 1’s strongest empirical sections.
- Use OWASP for taxonomy and MCP/tool-poisoning work for measured risk.
Section 1.8: Lawful Stop and Closure
Supported by:
- Anthropic, Measuring AI agent autonomy in practice
- Anthropic, Effective harnesses for long-running agents
- production failure-mode taxonomies
Use:
- This is another strong empirical section.
- Use Anthropic’s own engineering/research material to ground the need for agent-initiated stop, clarification, and non-premature closure.
Integration Guidance
The package README recommends citing only 8-10 references in the final published chapter, not every source in the evidence package.
Good default selection:
- Anthropic, Building effective agents
- Anthropic, Effective context engineering for AI agents
- Anthropic, How we built our multi-agent research system
- Liu et al., Lost in the Middle
- Chroma, Context Rot
- METR, Measuring AI Ability to Complete Long Tasks
- OWASP LLM06
- MCPTox / MCP tool-poisoning evidence
- Anthropic, Measuring AI agent autonomy in practice
- Anthropic, Effective harnesses for long-running agents
The final v3 article currently cites 19 references. That is acceptable for the PDF edition, but Substack may benefit from a lighter visible reference list with a link back to this evidence index.