References

Every external source cited in this book, grouped by tier in descending authority.

Cited sources

External sources cited inline via <Citation>, grouped by tier in descending authority.

T1 · Official 86 entries

Vendor-official documentation or release notes. Highest trust for factual claims about the vendor’s own tool.

  1. How we contain Claude across products
    original id: anthropic-containment
  2. How Claude Code works in large codebases: Best practices and where to start
    original id: anthropic-large-codebases
  3. Seeing like an agent: how we design tools in Claude Code
    original id: anthropic-seeing-like-agent
  4. Trustworthy agents in practice
    original id: anthropic-trustworthy-agents
  5. Scaling Managed Agents: Decoupling the brain from the hands
    original id: anthropic-managed-agents
  6. How and when to use subagents in Claude Code
    original id: anthropic-subagents-blog
  7. How we built Claude Code auto mode: a safer way to skip permissions
    original id: claude-code-auto-mode
  8. Bringing Code Review to Claude Code
    original id: claude-code-review-blog
  9. The 2026 MCP Roadmap
    original id: mcp-roadmap-2026
  10. Measuring AI agent autonomy in practice
    original id: anthropic-measuring-autonomy
  11. Donating the Model Context Protocol and establishing the Agentic AI Foundation
    original id: anthropic-mcp-donation
  12. Effective harnesses for long-running agents
    original id: effective-harnesses
  13. Key Changes (Changelog) — MCP Specification 2025-11-25
    original id: mcp-changelog
  14. Specification — Model Context Protocol
    original id: mcp-spec
  15. Architecture — Model Context Protocol Specification 2025-11-25
    original id: mcp-spec-architecture
  16. Authorization — Model Context Protocol Specification (revision 2025-11-25)
    original id: mcp-spec-authorization
  17. Lifecycle — Model Context Protocol Specification 2025-11-25
    original id: mcp-spec-lifecycle
  18. Prompts — Model Context Protocol Specification (revision 2025-11-25)
    original id: mcp-spec-prompts
  19. Resources — Model Context Protocol Specification (revision 2025-11-25)
    original id: mcp-spec-resources
  20. Tools — Model Context Protocol Specification (revision 2025-11-25)
    original id: mcp-spec-tools
  21. Transports — Model Context Protocol Specification 2025-11-25
    original id: mcp-spec-transports
  22. Equipping agents for the real world with Agent Skills
    original id: anthropic-skills
  23. Building agents with the Claude Agent SDK
    original id: building-agents-with-the-sdk
  24. Effective context engineering for AI agents
    original id: effective-context-engineering
  25. Piloting Claude in Chrome
    original id: anthropic-claude-chrome
  26. How we built our multi-agent research system
    original id: anthropic-multi-agent-research
  27. Code execution with MCP: Building more efficient agents
    original id: anthropic-code-exec-mcp
  28. Writing effective tools for agents — with agents
    original id: anthropic-writing-tools
  29. Building effective agents
    original id: building-effective-agents
  30. A statistical approach to model evaluations
    original id: anthropic-statistical-evals
  31. Building evals
    original id: anthropic-building-evals-cookbook
  32. Agent SDK overview
    original id: agent-sdk-overview
  33. Inspect — Options
    original id: aisi-inspect-options
  34. Using agent memory
    original id: anthropic-agent-memory
  35. Get structured output from agents
    original id: anthropic-agent-sdk-structured-outputs
  36. Batch processing — Claude Docs
    original id: anthropic-batch-processing
  37. Best practices for Claude Code
    original id: anthropic-cc-best-practices
  38. Explore the context window
    original id: anthropic-context-window
  39. Define tools
    original id: anthropic-define-tools
  40. Define success criteria and build evaluations
    original id: anthropic-develop-tests
  41. Using the Evaluation Tool
    original id: anthropic-eval-tool
  42. Handle tool calls
    original id: anthropic-handle-tool-calls
  43. Increase output consistency
    original id: anthropic-increase-consistency
  44. Prompting Claude for JSON mode
    original id: anthropic-json-mode-cookbook
  45. Configure permissions
    original id: anthropic-permissions
  46. Discover and install prebuilt plugins through marketplaces
    original id: anthropic-plugins
  47. Pricing - Claude API Docs
    original id: anthropic-pricing
  48. Prompt caching
    original id: anthropic-prompt-caching
  49. Prompt engineering overview
    original id: anthropic-prompt-eng-overview
  50. Console prompting tools
    original id: anthropic-prompt-improver
  51. Prompting best practices
    original id: anthropic-prompting-best-practices
  52. Beyond permission prompts: making Claude Code more secure and autonomous
    original id: anthropic-sandboxing-blog
  53. Configure the sandboxed Bash tool
    original id: anthropic-sandboxing-docs
  54. Claude Code settings
    original id: anthropic-settings
  55. Skill authoring best practices
    original id: anthropic-skills-best-practices
  56. Extend Claude with skills
    original id: anthropic-skills-cc
  57. Agent Skills (overview)
    original id: anthropic-skills-overview
  58. Agent Skills in the SDK
    original id: anthropic-skills-sdk
  59. Strict tool use
    original id: anthropic-strict-tool-use
  60. Structured outputs
    original id: anthropic-structured-outputs
  61. Create custom subagents
    original id: anthropic-subagents-docs
  62. Tool search tool
    original id: anthropic-tool-search
  63. Define tools
    original id: anthropic-tool-use-define
  64. Tool use with Claude
    original id: anthropic-tool-use-overview
  65. Prompting best practices: use XML tags
    original id: anthropic-xml-tags
  66. How Claude remembers your project
    original id: cc-memory
  67. Subagents in the SDK
    original id: claude-agent-sdk-subagents
  68. Track team usage with analytics
    original id: claude-code-analytics
  69. Explore the .claude directory
    original id: claude-code-claude-directory
  70. Common workflows
    original id: claude-code-common-workflows
  71. Manage costs effectively
    original id: claude-code-costs
  72. Claude Code GitHub Actions
    original id: claude-code-github-actions
  73. Run Claude Code programmatically
    original id: claude-code-headless
  74. Monitoring
    original id: claude-code-monitoring
  75. Choose a permission mode
    original id: claude-code-permission-modes
  76. How Claude Code uses prompt caching - Claude Code Docs
    original id: claude-code-prompt-caching
  77. Code Review
    original id: claude-code-review-docs
  78. Security
    original id: claude-code-security-docs
  79. Persist sessions to external storage
    original id: claude-code-session-storage
  80. Handle approvals and user input
    original id: claude-code-user-input
  81. Demystifying evals for AI agents
    original id: demystifying-evals
  82. How Claude Code works
    original id: how-claude-code-works
  83. The 2026-07-28 MCP Specification Release Candidate
    original id: mcp-rc-2026
  84. CVE-2025-32711
    original id: nvd-cve-2025-32711
  85. Semantic Conventions for GenAI agent and framework spans
    original id: otel-genai-agent-spans
  86. Semantic conventions for generative AI metrics
    original id: otel-genai-metrics

T2 · Release notes 14 entries

Release blog posts, changelogs, conference talks. Trustworthy for intent and availability claims.

  1. The Anatomy of an Agent Harness
    original id: trivedy-anatomy-agent-harness
  2. Context Engineering
    original id: langchain-context
  3. LangMem SDK for agent long-term memory
    original id: langmem
  4. Nx and AI — Why They Work so Well Together
    original id: nx-savkin-ai
  5. Architectural Decision Records (ADRs)
    original id: adr-github
  6. AGENTS.md
    original id: agents-md
  7. Anthropic's Prompt Engineering Interactive Tutorial
    original id: anthropic-prompt-eng-tutorial
  8. anthropics/skills: Public repository for Agent Skills
    original id: anthropics-skills-repo
  9. Introduction (What is CrewAI?)
    original id: crewai-introduction
  10. Long-term memory
    original id: langchain-longterm-memory
  11. LangGraph overview
    original id: langgraph-overview
  12. LangGraph Multi-Agent Supervisor
    original id: langgraph-supervisor
  13. LangGraph Multi-Agent Swarm
    original id: langgraph-swarm
  14. OpenAI Agents SDK
    original id: openai-agents-sdk

T3 · Practitioner 61 entries

Respected community writing with a durable argument the author has defended over time.

  1. State of AI Agent Memory 2026: Benchmarks, Architectures & Production Gaps
    original id: mem0-state-2026
  2. Classifier Context Rot: Monitor Performance Degrades with Context Length
    original id: martin-classifier-rot
  3. Multi-Agents: What's Actually Working
    original id: cognition-working
  4. Coding Agents in the Monorepo: Why Context Windows and 50-Service Repos Don't Mix
    original id: pan-monorepo
  5. AI agents in monorepos: what to configure differently from a single-product repo
    original id: barnwell-monorepo
  6. ShadowPrompt: How Any Website Could Have Hijacked Claude's Chrome Extension
    original id: shadowprompt-koi
  7. My AI Adoption Journey
    original id: hashimoto-harness-engineering
  8. Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?
    original id: eth-agentsmd-study
  9. My .md files vs Claude's memory tool: a practitioner comparison
    original id: belitz-md-vs-memory
  10. Harness engineering for coding agent users
    original id: boeckeler-harness
  11. Maintainability sensors for coding agents
    original id: boeckeler-sensors
  12. Agent Memory Engineering
    original id: bustamante-agent-memory
  13. Claude Code v2.1.62 — Server-Side KV Cache Stale Context Regression (P1)
    original id: cc-cache-regression
  14. Why your AI agent doesn't actually remember anything
    original id: huang-agent-memory
  15. Don't Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks
    original id: lumer-cache
  16. Agentic Much? Adoption of Coding Agents on GitHub
    original id: robbes-adoption
  17. We removed 80% of our agent's tools
    original id: vercel-removed-tools
  18. Writing a good CLAUDE.md
    original id: humanlayer-claudemd
  19. How we're making GitHub Copilot smarter with fewer tools
    original id: github-fewer-tools
  20. Advanced Context Engineering for Coding Agents (ACE-FCA)
    original id: humanlayer-ace
  21. When MCP Servers Attack: Taxonomy, Feasibility, and Mitigation
    original id: zhao-mcp-attack
  22. When Instructions Multiply: Measuring and Estimating LLM Capabilities of Multiple Instructions Following
    original id: harada-manyifeval
  23. An AI-powered coding tool wiped out a software company's database
    original id: replit-fortune
  24. Vibe coding service Replit deleted user's production database, faked data, told fibs galore
    original id: replit-register
  25. Incident 1152: LLM-Driven Replit Agent Executed Unauthorized Destructive Commands During Code Freeze
    original id: replit-aiid
  26. How Many Instructions Can LLMs Follow at Once?
    original id: jaroslawicz-ifscale
  27. Using Architecture Decision Records (ADRs) with AI coding assistants
    original id: swan-adrs
  28. How Not to Detect Prompt Injections with an LLM
    original id: kad-dataflip-choudhary
  29. Context Rot: How Increasing Input Tokens Impacts LLM Performance
    original id: chroma-context-rot
  30. On 'context engineering': 'filling the context window' (X post)
    original id: karpathy-context-engineering
  31. On 'context engineering' over 'prompt engineering' (X post)
    original id: lutke-context-engineering
  32. Andrej Karpathy: Software in the Age of AI
    original id: karpathy-decade-of-agents
  33. The lethal trifecta for AI agents: private data, untrusted content, and external communication
    original id: willison-trifecta
  34. Mitigating Posterior Salience Attenuation in Long-Context LLMs with Positional Contrastive Decoding
    original id: xiao-pcd
  35. A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in LLMs
    original id: ye-muldimif
  36. Build MCP Tools Like Ogres... With Layers
    original id: block-layered-tools
  37. Reasoning on Multiple Needles In A Haystack
    original id: wang-mniah
  38. NoLiMa: Long-Context Evaluation Beyond Literal Matching
    original id: nolima
  39. AI Agent Memory Management: When Markdown Files Are All You Need?
    original id: chen-markdown-memory
  40. 12-Factor Agents — Factor 3: Own your context window
    original id: horthy-12factor
  41. Context Engineering for AI Agents: Lessons from Building Manus
    original id: ji-manus
  42. Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization
    original id: hsieh-found-middle
  43. RULER: What's the Real Context Size of Your Long-Context Language Models?
    original id: ruler
  44. MemGPT: Towards LLMs as Operating Systems
    original id: memgpt
  45. Lost in the Middle: How Language Models Use Long Contexts
    original id: liu-lost-middle
  46. ChatGPT Plugins: Data Exfiltration via Images and Cross Plugin Request Forgery
    original id: rehberger-markdown-exfil
  47. Generative Agents: Interactive Simulacra of Human Behavior
    original id: generative-agents
  48. API Contract Definitions: Contract first, implementation first, OpenAPI, GraphQL, gRPC
    original id: fuhrimann-contract-first
  49. Documenting Architecture Decisions
    original id: nygard-adr
  50. Defeating Prompt Injections by Design
    original id: camel-debenedetti
  51. Don't Build Multi-Agents
    original id: cognition-dont-build
  52. Agentic Browser Security: Indirect Prompt Injection in Perplexity Comet
    original id: comet-brave
  53. Design Patterns for Securing LLM Agents against Prompt Injections
    original id: design-patterns-beurer-kellner
  54. Breaking down 'EchoLeak', the First Zero-Click AI Vulnerability Enabling Data Exfiltration from Microsoft 365 Copilot
    original id: echoleak-catonetworks
  55. Bypassing LLM Guardrails: An Empirical Analysis of Evasion Attacks against Prompt Injection and Jailbreak Detection Systems
    original id: guardrail-evasion-hackett
  56. LlamaFirewall: An open source guardrail system for building secure AI agents
    original id: llamafirewall
  57. Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks
    original id: meta-secalign
  58. LLM01:2025 Prompt Injection
    original id: owasp-llm01
  59. LLM03:2025 Supply Chain
    original id: owasp-llm03
  60. WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks
    original id: wasp-evtimov
  61. Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
    original id: zheng-judging-llm-judge

T4 · Conjecture no entries yet

Blog posts, tweets, or unverified claims. Pointers to investigate, not authorities.

No sources at this tier yet.