Back to blog
AIRAGLLM-WikiGBrainKnowledgeAgents

RAG, LLM‑Wiki, or GBrain? Which Knowledge Architecture Your Agent Needs

3 May 2026
3 min read

We used to think the hard part of AI was model quality. Today the hard part is memory.

Throw a trillion tokens at an LLM and it still "forgets" between conversations. Recent work — from Andrej Karpathy's LLM‑Wiki idea to Garry Tan's GBrain — shows why: retrieval, compounding, and autonomous action are different problems that need different architectures. Picking the right one (or combining them) is what turns a helpful assistant into an agent that actually improves and acts for you.

What each architecture solves

1) RAG — the scalable retriever

  • Key idea: embed your documents into vectors and retrieve relevant chunks at query time.
  • Strength: scales to tens or hundreds of thousands of docs; freshness is trivial (re-embed changed docs); it's auditable and production-ready.
  • Tradeoff: it never "learns" between queries and can suffer from chunking — important context may be split across vectors.

When to pick RAG: you have large, frequently changing corpora and need a fast, auditable assistant.

2) LLM‑Wiki — the compounding compiler

  • Key idea: the model reads sources once and writes a persistent, interlinked markdown wiki that it maintains and updates.
  • Strength: knowledge compounds. Syntheses and cross‑references accumulate, so the system gets measurably smarter over time.
  • Tradeoff: higher upfront compute and maintenance. Best for hundreds (not hundreds of thousands) of high‑value sources.

When to pick LLM‑Wiki: you need deep domain expertise, research dossiers, or a knowledge base that improves with use (great for writing blogs).

3) GBrain / Fat Skills — the operator

  • Key idea: intelligence and workflows live in "fat" skill documents and a thin harness executes them (including croned jobs and always‑on detectors).
  • Strength: the agent doesn’t just know — it acts. It can enrich entity pages, monitor signals, file reports, and send alerts while you sleep.
  • Tradeoff: engineering overhead and governance. Skills mutate state and require tests, idempotency, and audit logs. GBrain is currently optimized for a power‑user setup.

When to pick Fat Skills: you want autonomous monitoring or repeated workflows (e.g., portfolio signals, scheduled digests).

The practical hybrid — Retrieve → Compile → Act

These patterns are complementary:

  • Use RAG for broad retrieval and freshness.
  • Use an LLM‑Wiki for high‑value pages where compounding matters.
  • Use Fat Skills to run scheduled enrichments, signal detectors, and to write back into the wiki.

For traders (how to map this to your portfolio)

  • RAG: fast lookups across filings, broker docs, and news.
  • LLM‑Wiki: persistent company dossiers (dividends, corporate actions, rules like LICI/LICI‑style constraints).
  • Fat Skills: cron jobs that monitor price moves, earnings calendars, and corporate actions; they write alerts to the wiki and notify you (Telegram/email).

Can we use GBrain for our wiki?

Short answer: yes — as the automation/operator layer.

How I'd integrate GBrain:

  • Store the wiki as a git‑backed Obsidian vault (markdown) or Postgres + pgvector.
  • Use pgvector for a RAG layer (fast retrieval).
  • Run GBrain fat skills to:
    • Enrich entity pages (people, companies).
    • Run signal‑detectors (news / price triggers).
    • File audit reports and digests into reports//.md.
  • Add a periodic LLM‑Wiki compile job for high‑value pages so syntheses compound.

Caveats:

  • GBrain is opinionated and needs engineering to adapt for multi‑user teams.
  • Skills mutate state — add tests, idempotency, and auditing.
  • For enterprise scale, combine with a retrieval layer for speed.

Found this useful? Let's talk.

Get in touch