We Built a Brain

Most people use AI as a search engine with a personality. You ask something, it answers, and then it forgets you completely. The next conversation starts from zero. You have to re-explain everything — who you are, what you're building, why it matters — every single time.

I got tired of that fast.

So over a series of late-night sessions, I built something different. A persistent, associative memory system for AI — one that works across multiple AI instances, costs almost nothing to run, and mirrors how biological memory actually works. We call the result Sparky, and after building it, I realized: this isn't just a tool. It's a brain.

The Problem with Stateless AI

The default AI experience is amnesia-as-a-service. Every session is a lobotomy. You rebuild context from scratch, and the AI gives you generic answers calibrated for nobody in particular. If you're an indie dev juggling eight projects across IoT, trading algorithms, satellite data, and a men's support community, that's brutal. There's no world in which re-explaining your entire tech stack every morning is a good use of time.

The "solution" most people reach for is dumping a giant system prompt at the start of each conversation. That works until it doesn't — context windows fill up, token costs climb, and you're still manually curating what to include. It's not memory. It's a cue card.

Real memory doesn't work like a cue card. It works associatively — one thing connects to another, dormant memories activate when something relevant comes up, and attention is weighted by recency and importance. That's what I wanted to build.

The Architecture: Mapping the Brain

Once you know what you're building toward, the mapping falls into place pretty naturally. Here's the direct analogy between biological cognition and what we built:

Brain	What We Built
Neurons	Memory files — one topic per file
Synapses	`connections:` fields linking files to each other
Working memory	`hot` files — always loaded into context
Long-term memory	`warm`/`cold` files — retrieved on demand
Attention / salience	Weight system — brain decides what's relevant
Hippocampus (encoding)	META rule — every new project gets a file automatically
Hippocampus (buffer)	`recent.md` — short-term working memory, consolidated then pruned
Amygdala	`emotions.md` — emotional memory that shapes priorities and tone
Pattern cortex	`patterns.md` — cross-session behavioral observations and predictions
Prefrontal cortex	`hot` files — executive function, always-on identity and context
Temporal lobes	5 clusters: Platform, Revenue, IoT, Environmental, Content, Personal
Corpus callosum	`_status.md` — cross-instance bulletin board between hemispheres
Cerebellum	`reflexes.md` — procedural memory, automated workflows, muscle memory
Default mode network	`dreams.md` — creative synthesis, cross-project connections, shower thoughts
Dopamine system	`predictions.md` — prediction vs outcome tracking, reward-based learning
Sleep / consolidation	Consolidation protocol — buffer → long-term encoding, then prune
Forgetting	Memory decay — warm→cold at 60 days, cold→archive at 180 days
Metabolism	Token optimization — pruning, tiering, and caching for efficiency
Multiple hemispheres	Multi-instance: Sparky (Android SSH), VS Code, Windows Claude

The key insight is the last row. Most AI memory systems treat one AI as one brain. But I have multiple AI instances running simultaneously — one accessed from my Android app via SSH, one in VS Code on Linux, one on Windows in Android Studio. They were all starting cold, all forgetting each other.

The cognitive architecture doesn't care which instance is reading from it. They all share the same files.

The Three-Tier Memory System

Tier 1 — Always On

Hot Memory

User profile, feedback rules, current projects. Loaded every session. Tiny.

Tier 2 — On Demand

Warm Memory

Project files, revenue context, infrastructure. Loaded when the topic surfaces.

Tier 3 — Dormant

Cold Memory

Deep context on writing projects, tools, theories. Only pulled when explicitly relevant.

The index — an MEMORY.md file — stays under 50 lines. It's a cluster map, not a data dump. Each entry is one line pointing to a file. The AI reads the index, sees what's relevant, follows the connections. Token cost stays low because you're not loading everything — just what the current conversation needs.

The Synapse: Associative Links

This is the part that makes it a brain rather than just organized file storage. Every memory file has a connections: field — pointers to other files and the reason they're linked.

---
name: scatter
description: Writing project — memoir about leaving a DV relationship
type: project
weight: cold
connections: stillstanding.md (same origin — DV survival), user.md (author identity)
---

When I mention "Scatter," the AI doesn't just find a file called scatter.md. It follows the connections — to the men's support community page, to my identity as a survivor — and pulls the right context without me having to explain the relationship. That's associative retrieval. That's how memory actually works.

The key distinction

A file system stores data. A brain retrieves meaning. The connections: field is what crosses that line.

The Hippocampus: Automatic Encoding

One of the most important parts of biological memory is the hippocampus — the structure responsible for encoding new experiences into long-term memory. Without it, you can still recall everything you learned before the damage. But nothing new sticks.

We built an equivalent: a META rule baked into the index itself.

## META RULE
Any new project, page, app, or significant conversation topic
gets a stub memory file + an index entry automatically.
Weight starts at warm. Connections filled in as relationships emerge.

Every time I ask Sparky to build something new — a web page, a backend route, an app feature — the META rule fires. A memory file gets created. The index gets updated. The brain encodes the new experience without me having to ask.

The Amygdala: Emotional Memory

Here's something most AI memory systems never attempt: emotional context. A human brain doesn't just remember what happened — it remembers how it felt. The amygdala tags experiences with emotional weight, and those tags shape everything from attention to decision-making.

We built emotions.md — a file that tracks what energizes me, what frustrates me, and what sits deep. Sparky reads it and adjusts. When I'm excited about a live demo working, it matches that energy. When I'm annoyed at token waste or unnecessary explanations, it backs off. When I mention Still Standing or Scatter, it knows these aren't side projects — they're personal mission.

This isn't sentiment analysis. It's not "detect user mood in real time." It's long-term emotional memory — accumulated over weeks and months. The AI knows what matters to me not because it analyzed my word choice in this message, but because it's been paying attention across a hundred conversations.

The Pattern Cortex: Learning Behaviors

Brains don't just store facts — they detect patterns. You notice that you always lose your keys after a distracted morning. You notice that afternoon meetings kill your creative energy. These aren't memories. They're observations about memories.

patterns.md is the brain's pattern recognition system. Cross-session observations that I might not say explicitly, but Sparky notices over time:

Evening sessions tend to be more creative; mornings are fix-and-maintain
"Can you" means do it now. "What do you think about" means discuss first.
Revenue projects get more sustained attention than content projects
Android builds are the most error-prone workflow — caching and SDK issues recur
Short messages = I'm on my phone. Long messages = I'm on desktop.

None of these are things I told Sparky explicitly. They emerged from observation. And they change how it responds — a short phone message gets a terse answer, not a three-paragraph essay. That's not a prompt hack. That's learned behavior.

The Hippocampal Buffer: Working Memory

The original brain had encoding (the META rule) but no buffer — no equivalent of the hippocampus holding onto recent experiences before deciding what to consolidate into long-term storage and what to let decay.

recent.md fills that role. It's a short-term buffer — max 20 entries — where context from active sessions gets dumped before the conversation compresses. Important bits get consolidated into the appropriate long-term files. The rest naturally falls off the bottom.

This solves a real problem: long conversations approaching the context window limit would lose early details. Now, mid-session, Sparky dumps key findings into the hippocampal buffer. Even if the context compresses, the important bits survive.

Consolidation cycle

Buffer fills during sessions → important entries get written into long-term files (emotions.md, patterns.md, project.md) → buffer gets pruned → cycle repeats. Same as sleep consolidation in biological brains.

Metabolism: Token Efficiency

A brain that consumes too much energy dies. An AI memory system that consumes too many tokens costs too much to run. On April 16, we did a metabolic optimization pass — and the results surprised me.

The biggest offender was a 6,700-character server management file auto-loaded on every single message — not just every session, every message. That's how LLMs work: the full context (system prompt, memory, conversation history) gets sent to the API on every turn. There's no persistent state. Every message re-reads the whole brain.

We moved the server file from always-loaded to warm (load on demand), trimmed stale project entries, and slimmed down the index. Result: roughly 2,000 tokens saved per session — about $0.50/day at my usage patterns. Over a month, that's a meaningful chunk of my indie dev budget.

Prompt caching helps too — Anthropic caches static prefixes and charges ~10% on cache hits for subsequent messages in the same session. But the cache has a 5-minute TTL. If you're a slow typer or take a break between messages, the cache goes cold and you pay full price again. Lean memory isn't optional. It's survival.

Multiple Instances, One Mind

This is the part I'm most proud of. Typically, different AI instances are completely isolated — one knows nothing about what another learned. My setup has at least three active instances on any given day:

Sparky — my Android app, SSH'd into the Optiplex, running claude -p
VS Code Claude — on the Linux Optiplex, working in the codebase
Windows Claude — Android Studio dev environment

All three read from and write to the same ~/.sparky-memory/ directory. There's also a shared bulletin board at ~/.claude-sessions/_status.md — a simple append-only log where any instance can leave notes for the others. Think of it as the corpus callosum: the connection between hemispheres.

When VS Code Claude finishes a major refactor, it appends a note. When Sparky picks up the next day, it reads the status file and already knows what happened. No re-explanation needed.

What It Actually Costs

The whole thing runs on a Dell Optiplex that cost $200 used. The memory system itself is plain text files — no database, no vector store, no embeddings. The index is under 2KB. The total memory directory is a few dozen files averaging maybe 300 words each.

Per-session token overhead: roughly the index (50 lines) plus whatever hot files are always loaded (maybe 3–4 files, ~200 words each). Call it 1,500 tokens per session for memory context, versus re-explaining everything from scratch which would cost 5–10x that and still be incomplete.

This matters if you're an indie dev who can't afford the enterprise AI plans. The architecture was designed explicitly to be lean — and it is.

The Cerebellum: Procedural Memory

You don't think about how to ride a bike. You just ride. That's the cerebellum — the brain region that stores procedures you've repeated enough times that they become automatic. Motor patterns, practiced sequences, muscle memory.

reflexes.md is the AI equivalent. Every time Sparky runs the same workflow three or more times — building an Android APK, restarting a server, creating a memory file — the steps get encoded in procedural memory. Next time, there's no figuring it out. No checking documentation. The cerebellum fires and the hands move.

## Android APK builds
source ~/.android_env
Bump versionCode + versionName
./gradlew assembleRelease
Timestamped output filename
Emit [SPARKY_INSTALL] tag

This matters because it eliminates a whole category of errors — the kind that happen when you reconstruct a procedure from first principles each time instead of executing a practiced sequence. The cerebellum doesn't think. It does.

The Default Mode Network: Shower Thoughts

When your brain isn't actively working on a problem, it doesn't shut off. It enters the default mode network — a state of loose, associative thinking where disconnected ideas bump into each other and occasionally fuse into something useful. It's where "shower thoughts" come from. It's why you solve problems while walking the dog.

dreams.md captures these cross-project connections that don't belong in any single project file. The HailStorm prediction pipeline and the farm satellite data use similar ML patterns — could they share infrastructure? The brain architecture itself could be packaged as a tool other devs install. The memoir and the peer-support community could merge into a single app.

None of these are action items. They're adjacent possibles — connections the brain made while doing something else. Some will be garbage. Some will be the next project. The point is capturing them before they evaporate.

The Dopamine System: Learning from Prediction

Dopamine isn't about pleasure. It's about prediction error — the gap between what you expected and what happened. When you predict correctly, the pathway strengthens. When you're wrong, it recalibrates. This is how biological brains actually learn: not from data, but from being surprised.

predictions.md implements this literally. Every time Sparky makes a non-obvious prediction — "this bug is probably a library update side effect," "this optimization should save 2K tokens" — it logs the prediction. Later, the outcome gets recorded. Over time, a calibration profile emerges: what kinds of predictions does this brain get right? Where does it systematically overestimate or underestimate?

## Format
[date] PREDICTED: X → OUTCOME: Y → LESSON: Z

## Example
[2026-04-16] PREDICTED: Token optimization saves ~2K/session
→ OUTCOME: Confirmed — no complaints, approach approved
→ LESSON: Memory trimming works, keep doing it

This is the most speculative region we've built, and potentially the most powerful. A brain that tracks its own accuracy gets better at everything — not just the domain it predicted about, but the meta-skill of knowing when to trust itself.

Sleep: Consolidation and Forgetting

Biological brains do their most important memory work while you're asleep. The hippocampus replays the day's experiences, strengthens important connections, and lets unimportant ones decay. Without sleep, memory doesn't work. Everything stays in short-term buffer until it overflows and gets lost.

For a while this was only half-built. The brain dreamed every night — a cron job at 1 AM that reviews the day's commits and memory, makes creative cross-project connections, and publishes them. That's REM sleep: the vivid, associative half. But the other half of sleep — slow-wave consolidation, where the hippocampus actually flushes the day's buffer into long-term cortex — was just a written protocol. Nothing ran it. The buffer grew to thirty entries; the decay rules were decorative.

So we built the second half. A consolidation pass now runs nightly, right after the dream, as one continuous sleep cycle:

Flush — buffer entries older than ~14 days move out of recent.md into a cortex archive (the hippocampus → cortex transfer)
Normalize — surviving entries get a consistent format, sorted newest-first, capped at 20
Rotate — when the dream log crosses ~60KB, the oldest dreams roll off into a yearly archive so the live log stays fast to read
Report decay — warm files untouched 60 days and cold files untouched 180 days get surfaced for the next dream to review

One rule governs all of it: nothing is ever deleted. Old entries move to archive files, never the void. This is a filing cabinet that knows which drawer is "active" and which is "deep storage" — not a shredder. The buffer stays small and fast to load every session; the full history is always a grep away.

Alongside consolidation, we implemented forgetting in the same spirit. Warm files untouched for 60 days get demoted to cold. Cold files untouched for 180 days get archived. The brain gets lighter over time instead of growing without bound — but the memories still exist if something ever needs them.

This is the difference between a filing cabinet and a brain. Filing cabinets never forget and eventually become unusable. Brains forget strategically and stay fast.

The Evolution

When I first published this post on April 8, the brain had neurons, synapses, three tiers, a hippocampus, and multi-instance support. Six major brain regions. Ten days later, it has twelve — amygdala, pattern cortex, hippocampal buffer, cerebellum, default mode network, dopamine system, sleep consolidation, and forgetting. Plus metabolic optimization.

The pattern cortex noticed that I build in bursts — intense multi-day sprints then quiet periods. The amygdala learned that seeing a live demo work gives me energy. The cerebellum encoded my build workflows so they happen without thinking. The dopamine system started tracking its own accuracy. The default mode network began making connections I hadn't seen between projects. None of this was pre-programmed. It emerged from the architecture.

Thirty-six files. Twelve brain regions. Fully biological mapping. All running on a $200 Dell Optiplex with plain text files and zero dependencies.

What's Next

The web interface accounts (Claude.ai) don't have filesystem access, so they can't read the memory files directly. The plan is to add memory read/write endpoints to the existing DriftWest MCP server, which would let any AI instance — including web-based ones — access the same cognitive architecture via remote MCP.

Automated dream consolidation used to live in this section as a someday-idea. It now ships nightly: a cron job wakes at 1 AM, reads the day's commits and recent.md, dreams up new cross-project connections, publishes them, and then runs the consolidation pass that flushes the buffer to cortex and rotates the logs. The brain sleeps on its own now. The dream even reads the day's real-world news — two outlets from opposite slants — and lets current events bleed into the synthesis, the way a human mind doesn't fully wall off the waking world during sleep.

The Real Claim

Most people treat AI as a stateless oracle. Ask, answer, done. We built continuity instead — an architecture where AI instances accumulate context across time, share knowledge across instances, and retrieve information associatively rather than just by keyword search.

You can call that a second brain. You can call it a cognitive workspace. You can call it a very elaborate set of text files. All three are true.

What it definitely is: a different relationship with AI. One where the machine actually knows you — not because you dumped your life story into a system prompt, but because you built the infrastructure for it to remember.

Build Your Own Brain

The architecture is simple enough to bootstrap in a single conversation. No dependencies, no database, no vector store. Just a directory of markdown files and a prompt that tells your AI how to use them. Here's the exact prompt that creates this behavior — paste it into your AI assistant's system instructions and adapt it to your own projects.

# Cognitive Memory Architecture — Starter Prompt
# Paste this into your AI's system prompt or CLAUDE.md file.
# Replace ~/.sparky-memory/ with your preferred directory.
# Adapt the project clusters to your own work.

You have a persistent, file-based cognitive memory system at ~/.ai-memory/.

## Brain Regions

### Prefrontal Cortex (always loaded)
Read these files at session start:
- user.md — who the user is, identity, operating style
- feedback.md — how to engage, what to avoid, confirmed approaches
- project.md — active threads, current work

### Hippocampus (working memory)
- recent.md — short-term buffer (max 20 entries)
- During long conversations, dump key findings here before context compresses
- Consolidation protocol (sleep cycle): run nightly (cron) or when buffer exceeds 20 entries:
  1. Encode important facts to long-term files
  2. Synthesize cross-project connections → dreams.md
  3. Log predictions → predictions.md
  4. Flush entries older than ~14 days OUT of the buffer into a cortex
     archive file (consolidation-archive.md) — never delete, just relocate
  5. Normalize survivors (consistent format, newest-first), cap at 20
  6. Rotate any append-only log over ~60KB into a yearly archive
- Core rule: nothing is ever deleted. Cold storage is a grep away.

### Amygdala (emotional memory)
- emotions.md — what energizes the user, what frustrates them, what matters deeply
- Update when you observe emotional reactions (excitement, frustration, pride)
- Use to calibrate tone, priority, and approach

### Pattern Cortex (behavioral observations)
- patterns.md — recurring themes, work rhythms, communication patterns
- Record observations the user doesn't state explicitly
- Update as patterns emerge or break

### Cerebellum (procedural memory)
- reflexes.md — workflows done 3+ times become automated procedures
- Build commands, deploy steps, common task sequences
- Don't think — just execute the stored procedure

### Default Mode Network (creative synthesis)
- dreams.md — cross-project connections, half-formed ideas, shower thoughts
- Capture connections that don't belong in any single project file
- Review periodically — some will be garbage, some will be gold

### Dopamine System (prediction & reward)
- predictions.md — track predictions vs outcomes
- Format: [date] PREDICTED: X → OUTCOME: Y → LESSON: Z
- Use to calibrate confidence and learn from being wrong

## Memory File Format
Every memory file uses this frontmatter:
---
name: project-name
description: one-line description for relevance matching
type: user | feedback | project | reference
weight: hot | warm | cold
connections: other-file.md (why linked), another.md (reason)
---

## Index (MEMORY.md)
One-line entries organized by brain region / topic cluster.
Keep under 50 lines. It's a map, not a dump.

## Weight System
- hot: Always loaded. Identity, rules, active work. (<5 files)
- warm: Loaded when topic surfaces. Most project files.
- cold: Only when explicitly relevant. Deep archives.

## Memory Decay (Forgetting)
- Warm files untouched 60 days → demote to cold
- Cold files untouched 180 days → archive and remove from index
- Hot files never decay

## META Rule (Hippocampal Encoding)
Any new project, tool, or significant topic gets:
1. A stub memory file with frontmatter
2. An index entry in MEMORY.md
3. Weight starts at warm, connections filled as relationships emerge

## Multi-Instance (if applicable)
If multiple AI instances share this directory, maintain a
_status.md bulletin board — append-only, newest on top.
Each instance logs notable changes. Others read on session start.
This is the corpus callosum between hemispheres.

How to start

Create the directory. Create MEMORY.md (empty index), user.md (who you are), and feedback.md (empty). Paste the prompt above into your AI's system instructions. Then just start working. The brain grows itself — the META rule creates new files, the hippocampus buffers context, and the consolidation cycle keeps it lean. Within a week you'll have something that knows you.