01-projects / graph-db-eval

vertex edge dictionary

Mon Apr 13 2026 20:00:00 GMT-0400 (Eastern Daylight Time) ·project ·status: active

Graph DB — Vertex and Edge Type Dictionary

Running data dictionary for the typed knowledge graph we’re prototyping on DuckDB. Maintained incrementally — add new types as they emerge from real query needs.

Vertex types (nodes)

TypePurposeExamples
DocumentVault entries (source of most edges)any 06-reference/*.md file
PersonPerson who wrote source materialBen Thompson, João Moura, Jaya Gupta, Joe Reis, Ben Wilson
PublicationNewsletter or outletStratechery, Every, Semi-Structured, Practical Data Modeling
TopicConcept/area the doc is about”harness thesis”, “data quality”, “state ownership”, “BEAM methodology”
FrameworkNamed framework we reference or buildMAC, BEAM, Scope×Basis, Seven Powers, Entangled Software
ProjectRDCO work streamMG Progress, Sanity Check, autoinv, MAC content series, Graph DB eval
SponsorEntity sponsoring contentAnthropic, OpenAI, Box, Estuary
OrganizationCompany/institutionphData, MG, Mammoth Growth, Anthropic, Foundation Capital
SkillRDCO skill fileprocess-newsletter, audit-model, postgrid, morning-prep
ClusterThematic grouping of docs”harness-thesis dissent”, “data quality sources”, “moat debate”
DecisionSignificant decision docphData vs MG, Mode A vs Mode B, SQL guardrail adoption
SourceOriginal external documentbook PDFs, podcast transcripts, X articles

Edge types (typed relations)

Semantic relations between documents

EdgeDirectionSemantics
validatesA → BA provides evidence supporting B’s claim
contradictsA → BA disputes B’s central thesis
disputes-claim-inA → BA disputes a specific claim in B (not whole doc)
refinesA → BA sharpens B’s framework without contradicting
extendsA → BA builds on B, takes further
synthesizesA → (B, C, D…)A combines multiple sources into a view
echoesA → BA independently reaches B’s conclusion
applies-framework-fromA → BA uses the framework defined in B
citesA → BA explicitly references B
mentions-in-passingA → BWeaker than cites

Personship and identity

EdgeDirectionSemantics
authored-byDocument → Person
published-inDocument → Publication
sponsored-byDocument → SponsorDisclosed sponsorship
affiliated-withPerson → Organization
role-ofPerson → Role (CEO, founder, analyst)

Topical

EdgeDirectionSemantics
about-topicDocument → TopicPrimary subject
mentions-entityDocument → EntityNamed entity reference
part-of-seriesDocument → Seriese.g., ADWD ch1 → “Agile Data Warehouse Design” book
part-of-clusterDocument → Clustere.g., Moura → “harness-thesis dissent”

Strategic / operational

EdgeDirectionSemantics
supports-positionDocument → RDCO positionEvidence for a strategic stance
informs-decisionDocument → DecisionInput that shaped a decision outcome
test-case-forCase study → FrameworkConcrete example of framework application
example-ofDocument → Pattern/ConceptIllustration of a pattern

Provenance

EdgeDirectionSemantics
generated-by-skillDocument → SkillWhich skill produced this vault entry
derived-from-sourceDocument → SourceWhich external doc this note is assessing
produced-by-projectDocument → ProjectWhich RDCO project created this artifact

Edge metadata (properties on edges)

Every edge should carry:

Query catalog (what this graph must answer)

These are the queries that justify the graph over QMD-only retrieval:

  1. Cascading contradictions — multi-hop contradiction chains
  2. Dissent cluster aggregation — which authors appear in ≥N dissent docs
  3. Source authority ranking — validates/contradicts edges received per author
  4. Framework adoption trajectory — which docs apply which framework, over time
  5. Positioning evidence compilation — docs that support-position a given stance
  6. Derivation path — shortest path between two conceptually linked docs
  7. Gap detection in a cluster — subtopics with insufficient sources
  8. Person citation network — inbound + outbound citation traversal
  9. Skill improvement provenance — findings → improvements → skill changes (two-hop)
  10. Decision-evidence audit — all docs that informed a given decision

Prototype scope (first 3 queries to prove out)

  1. Positioning evidence compilation (query #5) — “Find all vault entries that support the state-ownership architecture”
  2. Dissent cluster aggregation (query #2) — “Which authors appear in ≥3 dissent docs?”
  3. Decision-evidence audit (query #10) — “What evidence informed the phData vs MG decision?”

These three cover the most immediate operational use cases.

Versioning

This dictionary is iterative. When a query surfaces a need for a new edge type, add it here with the date. When an edge type turns out to be redundant or never-used after 2+ months, deprecate it here.

DateChangeReason
2026-04-14Initial dictionaryFounder asked for running data dictionary during graph DB eval scoping
2026-04-14Renamed AuthorPersonAuthor is a role relative to a doc. Person is the entity. Covers non-authors too (colleagues, clients, friend-of-founder contacts like the brother-in-law or vertical-farm friend). The authored-by edge still exists and connects Document → Person.