Loading...
Loading...
Prompt engineering optimizes a single message. Context engineering optimizes the entire information pipeline — what gets loaded, when, in what form, and at what cost.
This is a preview — the full guide is in our Ebook Bundle.
The bundle includes the complete Context Engineering playbook, implementation templates, worked examples with Redis and Pinecone, and all 143 agent skills.
An AI model's output is bounded by its context window — everything it can "see" at once. Context engineering is the discipline of managing that window deliberately: deciding what information to include, how to compress it, how to retrieve it on demand, and how to cache it efficiently across calls.
Where prompt engineering asks "how do I phrase this request?", context engineering asks "what should the model know right now, and how did that information get here?". It spans memory architecture, retrieval systems, caching layers, and token budget allocation.
A framework for thinking about information management in AI systems.
Context engineering is baked into the AI Starter Package at every layer — not bolted on after the fact.
Our memory system maps the four cognitive memory types to concrete file-based tiers. Working memory (Tier 1–2), episodic memory (Tier 6), semantic memory (Tier 3 + 5), and procedural memory (Tier 4) are all present, bounded, and auto-maintained.
Read the Memory System Guide →The Auditor agent reviews candidate learnings in knowledge-nominations.md before promoting them to knowledge-base.md. Every entry requires a [Source:] tag. The knowledge base stays high-signal because noise is blocked at the gate, not cleaned up after the fact.
Context pruning, selective file reads, and model routing are pre-configured. Sessions load only the relevant agent and current memory — not the full agent library. Typical cost reduction: 40–60%.
Read the Token Optimization Guide →12 automated hooks fire on git, file, and session events. The PreCompact hook saves state before auto-compaction. The SessionStart hook restores context and surfaces degradation signals. Use /safe-clear proactively when tool call count exceeds ~30 or output quality degrades.
These patterns are common, easy to fall into, and quietly expensive.
In an agent swarm, context does not live in one place — it flows between agents as tasks are handed off. Each agent should receive only the context slice it needs, not a full dump of everything every parent agent knows.
The orchestrator is the only agent that holds the full task context. Subagents receive purpose-built slices. This keeps each agent's window lean, reduces cost, and prevents cross-contamination of concerns between agents.
| Tool | Role in context engineering | Category |
|---|---|---|
Redis | Semantic cache + multi-tier L2 store | Caching |
Pinecone | Managed vector database for embeddings at scale | RAG |
ChromaDB | Local-first vector store, zero-infra for dev | RAG |
LangChain | Chunking pipelines, retrieval chains, document loaders | Orchestration |
Claude Code MCP | Structured entity + relation memory via knowledge graph | Memory |
The AI Starter Package ships with the full 4-pillar context engineering framework implemented: 7-tier memory, auditor-gated knowledge, token-optimised loading, health monitoring hooks, and multi-agent context routing out of the box.
View Pricing