Part 1 Chapter 2 Last verified 2026-06-13 Fresh

Beyond Autocomplete: The Environment & Context Discipline

The argued opener for this book. The discipline that turns a model into an agent is the engineering of the two layers around it — the environment it acts in and the context it reasons over — and it is the most underappreciated, highest-leverage thing an architect designs.

Volatility: architectural-pattern

Tools compared: claude-codecross-tool

On this page

What autocomplete cannot do
Two layers, one discipline
Why it is underappreciated
What is still settling
Quick reference
Practice

The introduction drew the frame and stated the thesis: an agent is a model plus a harness, and the leverage is in two of the harness’s layers — the environment and the context. This chapter makes the case for that thesis — why those two layers, and not the model, are where the discipline of building agents actually lives. It argues; it does not yet build.

What autocomplete cannot do

Start with the gap this book is about. Code completion suggests the next token from the surrounding text; it has no autonomy. An agent, in Anthropic’s framing, is a system where models “dynamically direct their own processes and tool usage.” [Official] Building effective agents · Erik Schluntz and Barry Zhang (2024)T1-official original The working shorthand is tighter: agents are “LLMs autonomously using tools in a loop.” Effective context engineering for AI agents · Prithvi Rajasekaran, Ethan Dixon, Carly Ryan, and Jeremy Hadfield (2025)T1-official original

The model supplies that autonomy. But a model alone, prompted in a vacuum, is still close to autocomplete — it can only act on what is in front of it. What makes it an agent is everything the harness arranges around the model: the substrate it can act in, and the window it gets to reason over. Claude Code, for instance, “provides the tools, context management, and execution environment that turn a language model into a capable coding agent.” [Official] How Claude Code works · AnthropicT1-official original

Two layers, one discipline

The frame named three layers; two of them are the agent’s whole relationship with the world.

Environment — the substrate the agent acts in: the repository, the filesystem, scripts, a running process. A long-running harness even “asks the model to set up the initial environment” before work begins. Effective harnesses for long-running agents · Justin Young (2025)T1-official original Shape the environment well and the agent reads high signal and gets honest feedback; shape it poorly and no amount of prompting recovers.
Context — the assembled window the model reasons over. It is “a critical but finite resource for AI agents” [Official] Effective context engineering for AI agents · Prithvi Rajasekaran, Ethan Dixon, Carly Ryan, and Jeremy Hadfield (2025)T1-official original — a budget, not free memory. What you spend it on, and what you leave out, decides what the agent can actually do on any given turn.

These read as two topics — “set up the repo” and “manage the window” — but they are one discipline seen from two ends. The environment is the large, durable store of everything the agent could use; the context is the small, finite slice it does use on a turn; and the harness’s defining job is owning the boundary between them, keeping “lightweight references” and using them to “dynamically load data into context at runtime using tools.” Effective context engineering for AI agents · Prithvi Rajasekaran, Ethan Dixon, Carly Ryan, and Jeremy Hadfield (2025)T1-official original

Why it is underappreciated

The model gets the headlines — new versions, benchmarks, prompting tricks. That is the inner loop, and it matters. But the model is, increasingly, a given: you choose a capable one and prompt it well. The lever you actually own as an architect is the outer loop — and most of the outer loop’s effect on output quality flows through these two layers.

This is underappreciated for a structural reason: the work is invisible when it succeeds. A well-designed environment shows up as the agent “just knowing” where things are; a well-budgeted context shows up as the agent staying coherent over a long task. Nobody points at the absence of a failure. So the discipline is easy to skip and easy to under-credit — right up until an agent re-reads the same file every turn, loses the thread across sessions, or confidently acts on stale memory, and the cause turns out to be the environment or the context, never the model.

What is still settling

Two honest limits travel with this book, and they shape how to read it.

Most of this discipline is converged craft, not measured effect. Independent practitioners, standards bodies, and vendors agree on the direction of most practices here, but controlled effect sizes are rare. Where a claim is craft consensus, the chapters say so; where there is a measured result, they name it — and they never launder a heuristic into a number.
The vocabulary is young. “Environment engineering” and “context engineering” are recent coinages over practices that predate the names. Treat the framing as a useful organizing lens, not a settled taxonomy, and re-check the landscape rather than the labels.

Convergence claude-codecross-tool

The coinage is young but independently converged-upon. Anthropic frames “context engineering” as curating what enters a finite window; [Official] Effective context engineering for AI agents · Prithvi Rajasekaran, Ethan Dixon, Carly Ryan, and Jeremy Hadfield (2025)T1-official original Shopify’s Tobi Lütke independently prefers the term for “the art of providing all the context for the task to be plausibly solvable by the LLM,” [Practitioner] On 'context engineering' over 'prompt engineering' (X post) · Tobi Lütke (Shopify) (2025)T3-practitioner original and Andrej Karpathy amplifies it as “the delicate art and science of filling the context window.” [Practitioner] On 'context engineering': 'filling the context window' (X post) · Andrej Karpathy (2025)T3-practitioner original A vendor and two practitioners name the same lever — not the prompt, not the model, but the context. [Convergence]

Quick reference

Autocomplete → agent: the model supplies autonomy; the environment and context supply what the autonomy acts on. Engineering those two layers is the discipline.
Environment = the durable substrate (repo, filesystem, processes); context = the finite window. The harness owns the boundary.
One subject, two ends: maximize signal in the environment; spend the context budget deliberately. Neither works alone.
Why underappreciated: the work is invisible when it succeeds, and the model gets the attention — but the leverage is here.
The arc: environment (substrate) → context rot (the problem) → context assembly (the response) → memory (persistence).

Practice

Exercise solutions

Solution ↑ Exercise

Stable, broadly-applicable facts (“tests live in /tests”, “use conventional commits”) usually belong in the context layer — specifically the always-loaded instruction file, because they apply every session and the agent cannot reliably infer them from a single view of the code. Volatile or large procedural knowledge belongs in the environment, loaded on demand, so it does not tax every turn. Moving an always-needed fact out to the environment risks the agent not loading it when it matters; moving a large procedure into the always-loaded layer burns context budget on every turn whether or not it is relevant. The split is the discipline — and it is exactly what the instruction-layer and skills chapters develop.