Karpathy LLM Wiki vs Three-System Knowledge Architecture

decisions architecture wiki knowledge-management

Comparison of Andrej Karpathy’s “LLM Wiki” pattern (published April 2026) against a three-system knowledge architecture: structured operational data (e.g., expertise.yaml), cross-session behavioral memory, and a wiki. Research conducted 2026-04-06.

Sources: Karpathy’s gist, Extended Brain Substack analysis, VentureBeat coverage.

Part 1: Karpathy’s Architecture

Three-Layer Structure

project/
  raw/                  # Layer 1: Immutable source documents
    assets/             # Downloaded images
    *.md, *.pdf, etc.   # Articles, papers, data files -- LLM reads, never writes
  wiki/                 # Layer 2: LLM-generated and LLM-maintained
    index.md            # Content catalog -- every page with one-line summary, by category
    log.md              # Append-only chronological record of ingests, queries, lint passes
    [entity pages]      # e.g. people, organizations, technologies
    [concept pages]     # e.g. theories, patterns, frameworks
    [summary pages]     # Source summaries, comparisons, analyses
  CLAUDE.md             # Layer 3: Schema -- tells LLM how wiki is structured, conventions, workflows

Processing Pipeline

Ingest (raw → wiki):

User drops source document into raw/
User tells LLM to process it
LLM reads source, discusses key takeaways with user
LLM writes a summary page in wiki/
LLM updates index.md with new page listing
LLM updates relevant entity and concept pages across wiki (10-15 pages per source)
LLM appends entry to log.md
User reviews in Obsidian in real-time

Query (wiki → answer → wiki):

User asks a question
LLM reads index.md first to locate relevant pages
LLM drills into those pages, follows links
LLM synthesizes answer with citations
Good answers get filed back into wiki as new pages — this is the compounding mechanism

Lint (wiki → wiki):

Periodic health check, user-initiated
LLM scans for contradictions between pages
Flags stale claims superseded by newer sources
Identifies orphan pages with no inbound links
Suggests missing pages for important concepts

What Makes It Different From RAG

Aspect	RAG	Karpathy Wiki
Knowledge state	Re-derived on every query	Compiled once, kept current
Accumulation	Nothing compounds	Everything compounds
Cross-references	Rebuilt each time via retrieval	Already exist in wiki pages
Contradictions	Discovered per-query (maybe)	Flagged during lint passes
Infrastructure	Vector DB, embeddings, retrieval pipeline	Plain markdown files + index
Transparency	Embeddings are black box	Every claim traceable to a .md file
Scale ceiling	Scales well	Works to ~100 sources / ~400k words, then needs search tooling

Part 2: The Three-System Architecture

System 1: Structured Operational Data (e.g., expertise.yaml)

Structured YAML or equivalent with defined sections: project state, known issues, filing results, implementation patterns, unvalidated observations. Machine-readable. Commands can parse it programmatically. Capped at a defined size. Must always be valid (enforced by tooling).

Best for: Operational data that automation reads. Live state, API results, known issues, counts, versions.

Limitations: Rigid schema, no cross-references, per-project silo.

System 2: Behavioral Memory (.claude/memory/ or equivalent)

Per-project folder of short markdown files with frontmatter. Each file captures a user preference, critical guardrail, or process rule. Injected into every session automatically. Created by the LLM when important patterns emerge in conversation.

Best for: How to work (user preferences, guardrails, process rules). Things the LLM needs to remember about behavior, not content.

Limitations: User does not control what gets remembered, no links between entries, can go stale.

System 3: Wiki (Obsidian-Compatible)

Markdown files with [[wiki links]] and #tags. Organized into categories: platform patterns, decisions, people, concepts. Index file with per-page summaries. Append-only log. Obsidian-compatible for graph view and Dataview queries.

Best for: Synthesized knowledge, architectural decisions with rationale, cross-cutting concepts, relationship-rich knowledge.

Limitations: Requires discipline to maintain. No automated ingest or lint pipeline by default.

Part 3: Gaps vs Karpathy

What Karpathy Does That The Three-System Architecture Doesn’t (By Default)

Automated ingest pipeline — drop a file in raw/, LLM processes and updates 10-15 pages. The three-system architecture has a raw/ folder concept but no default automation.
Query-to-wiki filing — good answers compound into new wiki pages. Without this convention, synthesis disappears into chat history.
Lint/health-check — periodic scan for contradictions, orphans, stale pages. No equivalent for the wiki layer by default.
LLM-navigable index — Karpathy’s index.md lists every page with a one-line summary. A folder-listing index is not useful as an LLM navigation aid.
Source traceability — every wiki page traces back to a raw/ source. The three-system wiki pages often cite “session notes” without linking to originals.

What The Three-System Architecture Does That Karpathy Doesn’t

Machine-readable structured knowledge — Karpathy’s wiki is for humans and LLMs. Structured YAML is for programmatic parsing by commands and automation.
Validation against live state — a self-improve command can validate observations against actual live systems. Karpathy’s lint checks internal consistency only.
Command-driven integration — knowledge is updated as a side effect of doing work, not as a separate activity.
Behavioral memory — user preferences and critical guardrails injected into every session. Karpathy has no equivalent.
Multi-project architecture — per-project knowledge isolation plus cross-project memory.

Part 4: Bridge Plan

Priority 1: LLM-Navigable Index

Change index.md from a folder listing to a per-page catalog with one-line summaries. The LLM reads this first on every query. Effort: 30 min.

Priority 2: Wiki Operations Schema

Add a “Wiki Operations” section to CLAUDE.md (or equivalent schema file) defining ingest, query, lint, and filing workflows step by step. Effort: 1 hour.

Priority 3: Ingest Pipeline

Build a /wiki-ingest command (or hook). Drop a file in raw/, run the command, LLM creates/updates pages, updates index, appends to log, moves file to raw/processed/. Effort: 2 hours.

Priority 4: Query-to-Wiki Filing

When an LLM synthesis is valuable, file it as a new wiki page with source attribution. Can be a /wiki-file command or just a documented convention. Effort: 30 min.

Priority 5: Wiki Lint

Periodic scan: orphans, contradictions, stale pages, missing index entries. Effort: 1 hour.

Part 5: System Comparison Table

System	Best For	Karpathy Equivalent
Structured operational data	Machine-parseable live state	None
Behavioral memory	User preferences, guardrails	None
Wiki	Synthesized knowledge, relationships	Core of Karpathy’s pattern

Part 6: Key Decision

Keep all three systems. They serve distinct purposes.

Karpathy has one system because he has one use case (research). An engineering operations framework has three use cases: operational data, behavioral rules, and synthesized knowledge. The right move is to connect the systems, not merge them.

Connection points:

When structured data is updated with significant findings, create/update corresponding wiki pages.
When behavioral memory captures an architectural decision, it should also exist as a wiki decision page.
When wiki lint finds a contradiction with structured data, flag for resolution.

The single most important thing to adopt from Karpathy: the compounding loop. Every good answer becomes a wiki page. The system gets smarter by being used.

config-driven-routing — example of a well-structured pattern page
multi-format-ingest-strategy — example of a decision page with rationale

Clarity Wiki

Explorer

karpathy-wiki-comparison

Karpathy LLM Wiki vs Three-System Knowledge Architecture

Part 1: Karpathy’s Architecture

Three-Layer Structure

Processing Pipeline

What Makes It Different From RAG

Part 2: The Three-System Architecture

System 1: Structured Operational Data (e.g., expertise.yaml)

System 2: Behavioral Memory (.claude/memory/ or equivalent)

System 3: Wiki (Obsidian-Compatible)

Part 3: Gaps vs Karpathy

What Karpathy Does That The Three-System Architecture Doesn’t (By Default)

What The Three-System Architecture Does That Karpathy Doesn’t

Part 4: Bridge Plan

Priority 1: LLM-Navigable Index

Priority 2: Wiki Operations Schema

Priority 3: Ingest Pipeline

Priority 4: Query-to-Wiki Filing

Priority 5: Wiki Lint

Part 5: System Comparison Table

Part 6: Key Decision

Graph View

Table of Contents

Backlinks

Clarity Wiki

Explorer

karpathy-wiki-comparison

Karpathy LLM Wiki vs Three-System Knowledge Architecture

Part 1: Karpathy’s Architecture

Three-Layer Structure

Processing Pipeline

What Makes It Different From RAG

Part 2: The Three-System Architecture

System 1: Structured Operational Data (e.g., expertise.yaml)

System 2: Behavioral Memory (.claude/memory/ or equivalent)

System 3: Wiki (Obsidian-Compatible)

Part 3: Gaps vs Karpathy

What Karpathy Does That The Three-System Architecture Doesn’t (By Default)

What The Three-System Architecture Does That Karpathy Doesn’t

Part 4: Bridge Plan

Priority 1: LLM-Navigable Index

Priority 2: Wiki Operations Schema

Priority 3: Ingest Pipeline

Priority 4: Query-to-Wiki Filing

Priority 5: Wiki Lint

Part 5: System Comparison Table

Part 6: Key Decision

Related

Graph View

Table of Contents

Backlinks