06-reference

practical data modeling mma ch8 grain

Mon Mar 16 2026 20:00:00 GMT-0400 (Eastern Daylight Time) ·reference ·source: Practical Data Modeling (Substack) ·by Joe Reis
data-modelinggraingranularityfan-outaggregationmixed-model-artschapter-8

Grain: Getting the Level Right (Ch 8)

Chapter 8 on grain — the fundamental level of detail captured in a dataset. Core question: what does one row represent?

Key rules:

Common failure modes:

Grain across five camps: Relational (atomic row-level), Analytical (aggregated to answerable level), Application (event or document), ML/AI (feature grain per entity per snapshot date), Knowledge (the triple as atomic unit).

Introduces “grain routing” for AI agents: LLMs need a semantic layer that tags each dataset with its grain so the agent knows where to route queries before touching data.

The Four Questions for grain decisions and a Grain Audit Checklist for production deployment.

RDCO relevance

Grain is the #1 source of bugs in dbt projects. The fan-out section and audit checklist are directly deployable in client code reviews. The “grain routing” concept for AI agents is forward-looking and connects to semantic layer work.