07-source-material

readme

Wed Apr 29 2026 20:00:00 GMT-0400 (Eastern Daylight Time) ·project ·status: active

07-source-material/ — RDCO bookshelf

The canonical home for raw source material that grounds Ray’s reasoning + the founder’s research.

This directory is the answer to the founder’s 2026-04-30 question: “Where is our ‘bookshelf’ if we need to go back to the source to rediscover the details for a specific situation?”

See ../06-reference/2026-04-30-bookshelf-source-material-architecture-gap for the architectural concept and ../06-reference/2026-04-30-bookshelf-discovery-source-material-inventory for the discovery inventory that produced this initial migration.

Architecture

07-source-material/
├── books/                      ← canonical books (1 dir per title)
│   └── <slug>/
│       ├── source.pdf          (or source.epub, source.txt, source.docx)
│       ├── extracted.txt       (or extracted.md — text version for retrieval)
│       └── metadata.yaml       (title, author, ISBN, source, format, copyright)
├── masterclass/                ← purchased course materials (multi-file)
│   └── <slug>/
│       └── (mixed PDFs + xlsx + docx)
├── articles/                   ← long-form articles saved raw
├── papers/                     ← arxiv / academic papers
├── transcripts/                ← video/audio transcripts
│   └── youtube-and-podcast/    (symlink → 06-reference/transcripts)
├── newsletter-bodies/          ← raw newsletter HTML/text (extension to /process-newsletter)
└── web-archives/               ← saved web articles + blog posts
    └── <author-slug>-<title-slug>/
        ├── source.html         (or source.md if natively markdown)
        ├── extracted.md
        └── metadata.yaml

Per-source slugging

Per-source metadata.yaml schema

title: "Understanding Variation: The Key to Managing Chaos"
authors: ["Donald J. Wheeler"]
publisher: "SPC Press"
edition: "2nd"
year: 1993
isbn: "9780945320807"
pages: 161
source_format: "VitalSource ebook → screencap+OCR text extraction"
acquired: "2026-04-21"
acquired_by: "founder (purchased SPC Press → VitalSource)"
copyright: "Personal-use only. NEVER quote at length. NEVER index in QMD without explicit founder OK."
extraction_quality_notes: "Italics sometimes render with stray slashes. Page-number footers reliable."
canonical_citation_format: "Wheeler, *Understanding Variation*, Ch. N, p. M"

Retrieval patterns

Pattern 1 — grep for a concept

grep -n "operational definition" ~/rdco-vault/07-source-material/books/wheeler-understanding-spc/extracted.txt | head -5

Returns line numbers + matching lines. Use Read with offset/limit on those line numbers to fetch the surrounding context.

Pattern 2 — citation chain

When making a strong claim that benefits from grounding:

  1. grep the relevant slug in 07-source-material/books/
  2. Read the matching passage with surrounding context
  3. Cite as: Wheeler, *Understanding SPC*, Ch. N, p. M ([07-source-material/books/wheeler-understanding-spc/extracted.txt:LINE](path))

Pattern 3 — Phase-2 (deferred): semantic search via separate QMD collection

Pending founder green-light: ingest 07-source-material/ into a SECOND QMD collection named source-material (distinct from the rdco-vault collection). Query via mcp__qmd__query with collection: "source-material". Keeps synthesis search and source search independent.

Current inventory (2026-04-30 initial migration)

books/ (17 total, ~488MB)

Founder-purchased / pre-existing:

Free graduate ML curation (sub-agent download 2026-04-30, per ../06-reference/2026-04-30-dami-defi-12-graduate-ml-textbooks-curation):

Free curation result: 10/12 success, 1 partial (Goodfellow front-matter only), 1 failed (Bellemare). ~408MB added.

masterclass/

web-archives/

transcripts/

Pending acquisitions (per ../06-reference/2026-04-30-bookshelf-discovery-source-material-inventory decision queue)

Free downloads (queued, sub-agent dispatched 2026-04-30):

Web archive scrape candidates (deferred, founder call):

Skill integrations (Phase 2)

Migration history