06-reference

the data warehouse toolkit

Thu Apr 02 2026 20:00:00 GMT-0400 (Eastern Daylight Time) ·book ·by Ralph Kimball, Margy Ross

The Data Warehouse Toolkit — Ralph Kimball & Margy Ross

Summary

The bible of dimensional modeling, still foundational decades later. Kimball’s framework defines how to structure data for analytical consumption. The highlights capture the timeless design principles. Core mental models:

  1. Two Worlds: Operational vs. Analytical. Operational systems are where you put data in — processing one transaction at a time, optimized for speed, reflecting current state only. Analytical (DW/BI) systems are where you get data out — processing hundreds of thousands of records at once, optimized for queries, demanding historical context. These are fundamentally different systems serving fundamentally different users.

  2. The Seven Requirements of a DW/BI System. (a) Make information easily accessible — intuitive and obvious to business users, not just developers. (b) Present information consistently — same labels mean same things across sources. (c) Adapt to change gracefully. (d) Present information in a timely way. (e) Be a secure bastion. (f) Serve as the authoritative foundation for decision-making. (g) The business community must accept and use it — an elegant unused system is a failure.

  3. The Recurring Data Themes. Three decades of the same complaints: “We collect tons of data but can’t access it.” “We spend meetings arguing about whose numbers are right instead of making decisions.” “Just show me what is important.” These are design problems, not technology problems.

  4. Core Dimensional Concepts. Conformed dimensions (shared across fact tables for consistent analysis), slowly changing dimensions (tracking historical changes to attributes), factless fact tables (recording events that have no measurable facts), and the enterprise data warehouse bus matrix (mapping business processes to dimensions). These concepts solve the recurring themes above.

Relevance

Open Questions