RAG, LLM‑Wiki, or GBrain? Which Knowledge Architecture Your Agent Needs

We used to think the hard part of AI was model quality. Today the hard part is memory.

Throw a trillion tokens at an LLM and it still "forgets" between conversations. Recent work — from Andrej Karpathy's LLM‑Wiki idea to Garry Tan's GBrain — shows why: retrieval, compounding, and autonomous action are different problems that need different architectures. Picking the right one (or combining them) is what turns a helpful assistant into an agent that actually improves and acts for you.

What each architecture solves

1) RAG — the scalable retriever

Key idea: embed your documents into vectors and retrieve relevant chunks at query time.
Strength: scales to tens or hundreds of thousands of docs; freshness is trivial (re-embed changed docs); it's auditable and production-ready.
Tradeoff: it never "learns" between queries and can suffer from chunking — important context may be split across vectors.

When to pick RAG: you have large, frequently changing corpora and need a fast, auditable assistant.

2) LLM‑Wiki — the compounding compiler

Key idea: the model reads sources once and writes a persistent, interlinked markdown wiki that it maintains and updates.
Strength: knowledge compounds. Syntheses and cross‑references accumulate, so the system gets measurably smarter over time.
Tradeoff: higher upfront compute and maintenance. Best for hundreds (not hundreds of thousands) of high‑value sources.

When to pick LLM‑Wiki: you need deep domain expertise, research dossiers, or a knowledge base that improves with use (great for writing blogs).

3) GBrain / Fat Skills — the operator

Key idea: intelligence and workflows live in "fat" skill documents and a thin harness executes them (including croned jobs and always‑on detectors).
Strength: the agent doesn’t just know — it acts. It can enrich entity pages, monitor signals, file reports, and send alerts while you sleep.
Tradeoff: engineering overhead and governance. Skills mutate state and require tests, idempotency, and audit logs. GBrain is currently optimized for a power‑user setup.

When to pick Fat Skills: you want autonomous monitoring or repeated workflows (e.g., portfolio signals, scheduled digests).

The practical hybrid — Retrieve → Compile → Act

These patterns are complementary:

Use RAG for broad retrieval and freshness.
Use an LLM‑Wiki for high‑value pages where compounding matters.
Use Fat Skills to run scheduled enrichments, signal detectors, and to write back into the wiki.

For traders (how to map this to your portfolio)

RAG: fast lookups across filings, broker docs, and news.
LLM‑Wiki: persistent company dossiers (dividends, corporate actions, rules like LICI/LICI‑style constraints).
Fat Skills: cron jobs that monitor price moves, earnings calendars, and corporate actions; they write alerts to the wiki and notify you (Telegram/email).

Can we use GBrain for our wiki?

Short answer: yes — as the automation/operator layer.

How I'd integrate GBrain:

Store the wiki as a git‑backed Obsidian vault (markdown) or Postgres + pgvector.
Use pgvector for a RAG layer (fast retrieval).
Run GBrain fat skills to:
- Enrich entity pages (people, companies).
- Run signal‑detectors (news / price triggers).
- File audit reports and digests into reports//.md.
Add a periodic LLM‑Wiki compile job for high‑value pages so syntheses compound.

Caveats:

GBrain is opinionated and needs engineering to adapt for multi‑user teams.
Skills mutate state — add tests, idempotency, and auditing.
For enterprise scale, combine with a retrieval layer for speed.