Context Engineering
Definition
Context Engineering is the systematic approach to designing and structuring the input context provided to Large Language Models (LLMs) to maximize their effectiveness, accuracy, and reliability in generating outputs.
The practice emerged from the recognition that LLMs operate on explicit information only—they cannot intuit missing business logic or infer unstated constraints. Context Engineering addresses this by making tacit knowledge explicit, machine-readable, and version-controlled.
The Requirements Gap
“Prompt Engineering” is often a misnomer. It is simply Requirements Engineering adapted for a probabilistic system. Unlike a human developer who asks clarifying questions when requirements are vague (“What happens if the payment fails?”), an LLM generates the statistically most likely continuation based on its training data. It does not “understand” the business domain; it predicts patterns. When explicit logic is missing, the model defaults to the average case found in its training set, leading to code that is syntactically correct but semantically misaligned with specific project needs.
The Cold Start Problem
Martin Fowler observes: “As I listen to people who are serious with AI-assisted programming, the crucial thing I hear is managing context.”
Anthropic’s research confirms this. Engineers cite the cold start problem as the biggest blocker:
“There is a lot of intrinsic information that I just have about how my team’s code base works that Claude will not have by default… I could spend time trying to iterate on the perfect prompt [but] I’m just going to go and do it myself.”
Context Engineering solves cold start by capturing this intrinsic information in files the agent can read.
Key Characteristics
- Version Controlled: Context exists as a software asset that lives in the repo, is diffed in PRs, and is subject to peer review.
- Standardized: Formatted to be readable by any agent (Cursor, Windsurf, Devin, GitHub Copilot).
- Iterative: Continuously refined based on agent failure modes and tacit information discovered by Human-in-the-loop (HITL) workflows.
- Schema-First: Data structures defined before requesting content generation to ensure type safety and validation.
- Hierarchical: Information organized by importance—critical instructions first, references second, examples last.
Applications
While ASDLC focuses on software development, Context Engineering is domain-agnostic:
- In Design: Design system tokens and Figma layer naming conventions fed to UI agents
- In Law: Briefs restricting paralegal agents to specific case law precedents
- In SDLC: The
AGENTS.mdfile steering agents toward implementation patterns
Screaming Architecture
Context Engineering extends to the filesystem itself. As Raf Lefever notes, “If your code-base doesn’t scream its domain, AI will whisper nonsense.”
A well-structured filesystem (e.g., src/features/checkout/core-logic) provides implicit context to the LLM about intent and boundaries. A generic filesystem (src/utils, src/managers) forces the LLM to guess. In ASDLC, we optimize directory structures to be “training wheels” for the agent.
Toolchain as Context Reduction
Context Engineering is typically framed as a question of what to put in context. Equally important is what to leave out.
Every constraint enforced deterministically by the toolchain is context that does not need to be in the prompt. A well-configured biome.json silently eliminates an entire class of style instructions. A strict tsconfig.json makes type safety rules unnecessary to state. Treat your linter, formatter, and type checker configurations as upstream context engineering — they narrow the solution space before the agent ever sees the prompt.
This principle has empirical support. Gloaguen et al. (2026) found that agents follow context file instructions faithfully, which means unnecessary instructions impose a real cost: broader exploration, more reasoning tokens, higher inference cost — without improving task outcomes. The implication is that bloated context files are not neutral; they are actively harmful.
Furthermore, agents are highly susceptible to Context Anchoring. Telling an LLM what not to do ensures that the concept is front-and-center in its attention mechanism. If your AGENTS.md says “do not use tRPC”, the agent might still reach for it because the token tRPC is highly active in the context window.
The decision hierarchy for any constraint:
- Can a runtime gate enforce it? → Use the gate
- Can a toolchain config enforce it? → Use the config
- Neither? → It belongs in context
Multi-Layer Action Spaces and Economics
The cost and latency of agent orchestration scale directly with context size. As agents take on larger tasks, explicit definition of massive MCP (Model Context Protocol) toolsets bloats the context window.
The Solution: Push actions from the tool-calling layer to the OS layer. By equipping agents with a basic “Virtual Computer” (shell and filesystem access), they can interact with command-line utilities implicitly rather than parsing dozens of explicit JSON schema definitions. This action space offloading dramatically improves the economics of “Prompt Caching,” making high-capacity agent loops viable.
The “Learned Context Management” Fallacy
Some theories suggest that “The Bitter Lesson” applies to context—that as foundational models scale, they will natively learn to manage their own memory streams, rendering explicit file-centric state and Context Gates obsolete.
In ASDLC, we dispute this. Relying on a probabilistic model’s native “attention mechanism” to remember a critical business constraint from 30 turns ago is a regression to “Vibe Coding.” Explicit, deterministically structured context ensures the system fulfills contracts, rather than drifting on the model’s statistical average.
Distinctions
Context vs Guardrails
A distinction exists between Guardrails (Safety) and Context (Utility). Currently, many AGENTS.md files contain defensive instructions like “Do not delete files outside this directory” or “Do not output raw secrets.” This is likely a transitional state. OpenAI, Anthropic, Google, and platform wrappers are racing to bake these safety constraints directly into the inference layer. Soon, telling an agent “Don’t leak API keys” will be as redundant as telling a compiler “Optimize for speed.”
ASDLC Usage
In ASDLC, context is treated as version-controlled code, not ephemeral prompts.
Applied in:
- AGENTS.md Specification — The practical application of context engineering in repositories.
- Model Context Protocol — The standard for serving context to agents.
Related Patterns:
- Specs — Specs are context engineering in document form.
- Context Gates — Checkpoints where context is validated.
- Context Map — The structural pattern for organizing context.
- Agent Optimization Loop — Verifying context quality.
- Context Offloading — Operational practice for preventing context rot.
[!NOTE] Research Validation (InfiAgent, 2026): File-centric state management outperforms compressed long-context prompts. Replacing persistent file state with accumulated conversation history dropped task completion from 80/80 to 27.7/80 average, even with Claude 4.5 Sonnet. This validates treating context as a reconstructed view of authoritative file state, not as conversation memory.
References
- (2024).
OpenAI Best Practices for Prompt Engineering
.
Accessed January 8, 2026.
Foundational guidance on structuring prompts and context for optimal LLM performance.
- .
Constitutional AI Documentation
.
Accessed January 8, 2026.
Documentation on Anthropic's approach to AI alignment and context-based safety constraints.
- (2025).
How AI is Transforming Work at Anthropic
.
Accessed January 9, 2026.
Research identifying the cold start problem as the primary blocker in AI-assisted development.
- (2026).
InfiAgent: An Infinite-Horizon Framework for General-Purpose Autonomous Agents
.
Accessed January 10, 2026.
Empirical validation of file-centric state management for long-horizon agent stability.
- (2026).
Agile in the AI Era: Why 'Boring' Architecture Is Your Secret Weapon
.
Accessed February 12, 2026.
Argues that domain-driven structure ('Screaming Architecture') is critical context for AI agents.
- .
AGENTS.md outperforms skills in our agent evals
.
Accessed February 16, 2026.
Validation of passive context mapping over active tool retrieval for static knowledge.
- (2026).
Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?
.
ETH Zurich / LogicStar.ai.
Empirical study showing that unnecessary context file instructions increase agent reasoning cost and reduce task success rates. Agents follow instructions faithfully — making constraint minimalism a performance concern, not just an aesthetic one.