arscontexta: Building an OS for Thinking with AI

Six-part series by heinrich (@arscontexta) on using Claude Code as an operating layer for an Obsidian vault — treating the vault not as a note-taking app but as a living knowledge graph that an agent operates. Directly relevant to how we run our own vault + Claude Code setup.

Article 1: The Foundational Vault Concept

Tweet 2013045749580259680 — “obsidian + claude code 101”

The core argument: vibe coding changed how we write software; vibe note-taking changes how we think. A vault is just markdown files that link to each other, but it gives LLMs (which have no persistent memory) something to work against.

Key ideas:

Every note is essentially a skill — curated knowledge that gets injected when relevant. The agent uses a vault index the same way Claude Code decides which skills to load.
The vault encodes how you think, not just what you thought. Methodology becomes part of the system.
Good notes are composable: each stands alone and makes sense when linked from elsewhere. Name notes as claims (“quality is the hard part”), not topics. The title becomes part of the sentence when linked.
Link weaving: embed [[wiki links]] inside sentences, not at the bottom as footnotes. The agent follows your reasoning by following your links.
Network > nodes: a note with many incoming links is more valuable than an isolated note. The network is the knowledge.

Navigation layers (how the agent orients without reading everything):

Folder structure visible at session start
Index file with one-sentence descriptions per note
Topic pages (MOCs) that the agent uses as tables of contents and leaves breadcrumbs on for future sessions

Vault types: different purposes need different philosophies. Heinrich runs separate vaults for thinking/AI work and for client/project work. Same underlying patterns (markdown, claude.md, index), different rules.

Human role evolution: writer → editor; creator → curator. Your job becomes judgment.

Article 2: Yapping to PRDs

Tweet 2013718955576250466 — “Yapping to PRDs: Claude Code & Obsidian”

Recorded conversations (meetings, brainstorms) get transcript-mined into structured vault documents — not summarized, but deeply extracted. This externalizes tacit knowledge that you can’t easily write down because you don’t know you’re doing it.

The mining mindset: a 1-hour meeting should yield 10+ idea notes, multiple framework notes, several decisions with reasoning, state updates across multiple project hubs — not a 3-bullet summary. If you’re getting a short summary, you’re leaving knowledge on the table.

A well-structured transcript extraction for one meeting might produce:

1 archived transcript + 1 meeting summary
7+ feature idea notes
2 framework notes
4 philosophy additions
3 project status updates
20+ files created or modified

Why transcripts work better than writing: you naturally include reasoning paths, uncertainties, alternatives considered, and explanation depth — all the tacit context that never makes it into written docs.

The PARA parallel: the folder structure maps to Tiago Forte’s PARA system (Projects, Areas, Resources, Archives), repurposed for team knowledge with agent navigation in mind.

Context engineering: everything is defined in CLAUDE.md — vault philosophy, folder structure, navigation rules. Each folder has its own README for granular context. Without structure, you have a pile of transcripts. With structure, you have a knowledge system Claude can build on.

Article 3: Build Claude a Tool for Thought

Tweet 2015201046469943660 — “Build Claude a Tool for Thought”

The meta-move: use the vault system to research how humans built tools for thought, then apply those findings to agent architecture. The system builds itself a tool for thought.

Historical lineage: Llull’s rotating wheels, Bruno’s memory palaces, Luhmann’s Zettelkasten, Evergreen Notes, MOCs — all were tools to think with, not just store in. The shift here: the operator is now an agent, not a human.

Technical primitives:

Graph database from markdown: files = nodes, wiki links = edges, YAML frontmatter = queryable metadata
Hooks and subagents for automation and specialization
grep/git/bash/MCP for tooling

Discovery layer: every note has a YAML description field. Before loading any file, the agent grabs descriptions and decides if the content is worth the context budget. Most decisions can be made at description level without opening files — this is the key curation move.

Filenames as claims: before opening anything, the file tree already tells you what each note argues. “quality is the hard part” tells you more than “quality notes”.

The self-engineering loop: the system logs observations across sessions, reflects on learnings, and proposes changes to its own rules. Every rule starts as a hypothesis.

The Cornell Notes adaptation: Claude found the Cornell 5R framework while researching, adapted it for agents, and added a 6th phase for self-improvement. The system can request deep research to learn more about specific topics.

Article 4: Context Engineering (Progressive Disclosure)

Tweet 2015585363318743071 — “Obsidian & Claude Code 101: Context Engineering”

The core context engineering technique: progressive disclosure — force the agent to earn each level of detail before loading more. Four layers:

File tree — injected at session start via hook. Descriptive filenames give first-impression signal without opening anything. “queries evolve during search so agents should checkpoint.md” > “search notes.md”.
YAML descriptions — every note has a one-sentence description in frontmatter. If something looks interesting, query it with ripgrep before loading.
Outline — if the description passes, check the note’s heading structure. Often only one section is needed; loading the full file adds noise.
Full content — only for notes that passed all three filters. Most notes never get here, and that’s the point.

The MCP parallel: this mirrors how Claude handles 50+ tools — tool specs are available but deferred until actually searched. Same structure: lazy loading, progressive commitment.

Implementation: a SessionStart hook that runs tree, YAML frontmatter with a description field, and CLAUDE.md instructions telling Claude to check descriptions before reading. Low-code, high-leverage.

Article 5: Editing Workflow (Spatial Comments)

Tweet 2015909609999941965 — “Vibe Note-Taking 101: Editing Workflow”

The problem: editing long content with Claude Code requires constant copy-paste — pull text out, give it context, wait for edits, repeat. This breaks flow.

The spatial editing solution: leave {edit instructions} inline, embedded in the text where they apply. Position IS context. The agent knows what the comment refers to because of where it sits.

Workflow:

Write draft without stopping
Do a pass and drop {thoughts} wherever something needs work
Run /edit
Review changes — the command outputs a summary of what changed

If run with no file open, /edit searches the vault for all pending {thoughts} and lets you pick which files to edit — useful for cross-file consistency changes.

Article 6: Async Hooks for Note History

Tweet 2016587691505164749 — “Obsidian & Claude Code: Async Hooks for Note History”

Auto-commit every edit to git using Claude Code’s async hooks, then add an interpretation layer that reads diffs conceptually, not just syntactically.

The insight: notes are living documents. The history of how a note changed is itself valuable — it’s a journal of how thinking evolved, written automatically.

Technical setup:

Git repo in the vault (local, GitHub, or GitLab)
A SessionStop or post-edit hook with async: true that commits changes silently in the background
async: true is the key — without it, Claude waits for each commit to finish (annoying)
A /note-history skill that reads diffs and explains conceptual changes, not just line-level deltas

The outcome: every note has a complete, interpretable history. The vault becomes a timeline of how thinking evolved, reconstructable at any point.

Alignment with Our Setup

Where we’re doing the same thing:

arscontexta	RDCO
Vault index for agent orientation	QMD hybrid search (BM25 + vector) over 561 docs
CLAUDE.md teaches vault philosophy	SOUL.md + project-level CLAUDE.md files
Every note is a skill (composable, injectable)	skills-as-building-blocks in `~/.claude/skills/`
Folder structure as navigation signal	`rdco-vault/` directory convention (01-projects, 02-sops, etc.)
PARA for project knowledge structure	Same PARA influence in our folder architecture
MOCs as topic hubs with agent breadcrumbs	Index files per project directory
Recording + transcript mining → vault	Not yet systematic — currently ad-hoc

Where we differ:

Difference	Notes
arscontexta uses Obsidian as the agent’s IDE; we use it as human UI	We interface with the vault through Claude Code + QMD MCP, not direct Obsidian file access
File tree injected via SessionStart hook	We have QMD semantic search instead; worth considering whether a tree hook adds signal
Description-level filtering (YAML + grep before loading)	QMD abstracts this with its snippet/scoring layer — similar effect, different mechanism
Async auto-commit hook for note history	We don’t do this — see “steal” below
Spatial `{comment}` editing pattern	Not in our workflow — this one is directly applicable
Self-engineering loop (system researches tools for thought to improve itself)	Adjacent to compile-vault skill; not yet self-directed

Where we’re ahead:

QMD gives us vector + BM25 + LLM reranking, which is richer than ripgrep + tree for retrieval
Discord + iMessage channels as live intake pipelines — arscontexta doesn’t mention this
Skills ecosystem via plugins (27+ skills) — more mature than his /edit + /note-history examples
Karpathy’s compounding loop already maps to our intake SOP

Ideas to Steal

High priority:

Async auto-commit hook — every vault edit silently committed to git. Zero friction, complete history. This is a one-afternoon build. See 04-tooling/2026-03-29-infrastructure-decisions for git context.
Spatial {comment} editing pattern — embed edit instructions inline, position as context. Would improve how we draft and iterate on reference docs and SOPs. Could build as a skill.
YAML description field on every note — one-sentence summary in frontmatter, queryable before loading full content. Our QMD snippets serve a similar role but aren’t in the files themselves. Adding a description: frontmatter field would make docs more self-describing.

Medium priority:

Transcript mining → vault — we record calls but don’t systematically extract them into structured notes. Heinrich’s extraction depth (20+ files per meeting) is the target. This connects directly to the compound engineering compounding step.
Agent breadcrumbs on MOCs — have Claude leave navigation notes on index/topic pages during sessions. Builds session-persistent memory about what it learned while traversing the graph.
Descriptive filenames as claim-titles — we mostly use descriptive names but not always as full claims. The discipline of “the filename IS the argument” forces clearer note design.

Low priority / already addressed:

Progressive disclosure (file tree → description → outline → full content): QMD’s snippet + scoring layer handles this differently but achieves similar goals.
Separate vault philosophies per domain: we use CLAUDE.md at project level to achieve similar effect without separate vaults.

Article 7: Skill Graphs > SKILL.md

Tweet 2023957499183829467 — “Skill Graphs > SKILL.md” — February 18, 2026 8,756 likes · 25,731 bookmarks · 4M impressions

The argument: single skill files are fine for simple tasks but real depth requires something structurally different. A skill for summarizing is one file. But a therapy skill that covers cognitive behavioral patterns, attachment theory, active listening, and emotional regulation frameworks can’t live in one file — the scope is too large and the interconnections too important.

Skill graphs are the answer: a network of skill files connected by wikilinks. Instead of one monolithic skill, many small composable pieces that reference each other. Each file is one complete thought, technique, or skill. Wikilinks between them create a traversable graph. The same skill discovery pattern applies recursively inside the graph itself.

The progressive disclosure stack: Index → descriptions → links → sections → full content

Most decisions happen before reading a single full file. Every node has YAML frontmatter with a description the agent can scan. Every wikilink carries meaning because it’s woven into prose — the agent follows relevant paths and skips what doesn’t matter.

The primitives (you already have them):

Wikilinks written as prose in sentences, carrying semantic meaning not just reference
YAML frontmatter with descriptions for pre-read scanning
MOCs (Maps of Content) organizing clusters of related skills into navigable sub-topics
Skill files that link to other skill files, which link to other skill files — the graph goes as deep as the domain requires

The arscontexta plugin itself is a skill graph — ~250 connected markdown files teaching an agent how to build a knowledge base. The files cover cognitive science, zettelkasten, graph theory, and agent architecture, each piece linking to others. One skill file couldn’t hold that. A graph can.

What this enables:

A trading skill graph: risk management, market psychology, position sizing, technical analysis — each piece linked so context flows between concepts
A legal skill graph: contract patterns, compliance requirements, jurisdiction specifics, precedent chains — all traversable from one entry point
A company skill graph: org structure, product knowledge, processes, onboarding context, culture, competitive landscape

How to build one:

Easy way: install the arscontexta Claude Code plugin, pick the research preset, point it at a topic, fill with /learn and /reduce
Manual way: create an index file as an entry point (not a lookup table — an entry point that points attention), individual node files as standalone methodology claims with wikilinks woven into prose, and MOCs for sub-topic organization when the graph grows

The evolution: individual skills are context engineering — curated knowledge injected where it matters. Skill graphs are the next step: instead of one injection, the agent navigates a knowledge structure, pulling in exactly what the current situation requires. The difference between an agent that follows instructions and an agent that understands a domain.

Alignment with our setup:

This directly extends what Article 1-6 established for vault design. The skill graph pattern is exactly how our ~/.claude/skills/ directory should be architected — individual skills that reference each other, navigated by the agent via description scanning before full file loading. Our current skills are mostly flat (one file per skill). Adding wikilink cross-references between related skills and an index with YAML descriptions per skill would convert the skills directory into a traversable skill graph.

The vault itself already functions as a knowledge graph (files = nodes, wikilinks = edges, YAML frontmatter = queryable metadata). The insight here is that the same architecture applies to the skills layer, not just the reference layer.

Ideas to steal:

Build a skill graph for our data consulting domain: context engineering, analytics engineering, agent architecture, Snowflake/Cortex, dbt — each as connected skill nodes traversable from a central index
Add a description: field to every skill’s YAML frontmatter so the agent can decide relevance without loading full content (the QMD snippet layer already does this for vault docs — bringing it to skills closes the loop)
Create a MOC for the phData consulting skills cluster (Snowflake, Cortex AI, agent patterns, enterprise data) to give the agent a navigable entry point into that domain

Connections

06-reference/2026-04-01-karpathy-llm-knowledge-bases — same knowledge-as-flywheel pattern; arscontexta adds the agent-operated angle
06-reference/2026-04-04-compound-engineering — the “compound step” is exactly what arscontexta is building into the vault: every intake refines the system
06-reference/concepts/skills-as-building-blocks — arscontexta’s “every note is a skill” maps directly to our skills-as-building-blocks pattern
06-reference/2026-04-07-claude-code-architecture-teardown — arscontexta implements the SessionStart hooks and skills layers described in that teardown
04-tooling/2026-03-29-infrastructure-decisions — our current stack vs. his stack; key difference is QMD vs. ripgrep+tree
06-reference/2026-04-04-anthropic-skills-internally — his /edit and /note-history skills are amateur versions of what Anthropic builds internally; we have a richer skills foundation to build on