06-reference

book adwd ch4 modeling business processes 2026 04 13

Sun Apr 12 2026 20:00:00 GMT-0400 (Eastern Daylight Time) ·reference ·source: Agile Data Warehouse Design (book) ·by Lawrence Corr, Jim Stagnitto

Chapter 4: Modeling Business Processes — Corr & Stagnitto

Why this is in the vault

This chapter is the operational core of BEAM (Business Event Analysis & Modeling). It bridges the gap between modeling a single event table (Ch 2-3) and planning an entire data warehouse. The event matrix it introduces is the single most practical planning artifact for dimensional warehouse design — it forces conformance decisions early, exposes process sequences, and produces a prioritized product backlog. For Ray Data Co, every data modeling engagement should start with something like this matrix rather than jumping straight to dbt models.

Key concepts

Conformed dimensions

A conformed dimension is a single physical dimension shared by multiple fact tables, or separate dimensions that contain identical attributes with identical business meanings. They are what makes cross-process analysis possible. Without them, you get the silo data mart anti-pattern: each department gets its own mart, each defines PRODUCT or CUSTOMER slightly differently, and cross-department reporting becomes impossible. Rebuilding siloed marts is extremely expensive because you are refactoring historical data, not just code.

Three types exist at the attribute conformance level:

Conformance is fundamentally a political problem, not a technical one. Homonyms (same term, different meaning across departments) and synonyms (different terms, same meaning) are the usual blockers. Modelstorming with examples from both sides is the most effective way to surface and resolve these conflicts.

The data warehouse bus

Conformed dimensions define a data warehouse bus — an integration standard that lets independently built data marts plug in and interoperate, analogous to USB. The bus requires more upfront work (modeling multiple processes, confronting political conformance challenges, building MDM-aware ETL) but the payback is dramatically less technical debt and the ability for teams to work in parallel once the bus is defined. Local (non-conformed) dimensions are always permitted in addition to bus dimensions, never as replacements.

The event matrix

The event matrix is a grid with business processes as rows and dimensions as columns, tick marks at intersections. It is the central planning artifact for the bus. Two ways to read it:

The enhanced version (the BEAM event matrix) adds: event sequences with main clauses, dimensions ordered by 7W type (who/what/where/why/how), stakeholder group columns for ownership, and importance/estimate rows for Scrum-style prioritization. Events that create new dimension values get a star rather than a tick — a significant dependency signal.

Event sequences

Events on the matrix are ordered by value chain (increasing business value output), not alphabetically. This roughly maps to time sequence at a macro level. Within a value chain, stricter process sequences exist — chains of milestone events that must occur in order (e.g. purchase order, then delivery, then payment). These are denoted by indentation on the matrix.

Key discovery technique: ask “why does X happen?” — if the answer points to a prior event, you have found a process sequence linked by a conformed degenerate dimension (like ORDER ID appearing on both orders and shipments). Process sequences can be modeled as a single evolving event for process performance measurement (late deliveries, average fulfillment time).

Gaps in the matrix between events often signal missing business processes. Asking “does anything costly happen between X and Y?” reliably surfaces them.

Role-playing dimensions

When a new event introduces what looks like a new dimension (e.g. WAREHOUSE WORKER), check whether it is actually a role of an existing conformed dimension (EMPLOYEE). The W-type helps: if there are already two “who” dimensions, ask stakeholders whether the new one shares attributes. If yes, rename to the generalized form (EMPLOYEE [RP]) and document the specific role in the event detail using bracket notation ([employee]).

Generalization is powerful but carries BI usability risks. If stakeholders see no analytical benefit to combining entities, postpone generalization until star schema design — the modeling stage should produce dimensions stakeholders recognize and want to query.

The event rating game

A structured prioritization exercise for deciding what to build first:

This produces a Scrum-ready backlog directly from the modelstorming session. The product owner adjusts ratings later using source data profiling and ETL estimates.

Mapping against Ray Data Co