Data Products Aren’t All the Same: 5 Critical Examples — Eric Weber
Summary
Weber taxonomizes data products into five distinct categories, each requiring different skills and contexts:
- Experimentation platform — experimental design, cohort allocation, logging, guardrails metrics, readouts
- ML platform — model building, serving, management, updating, monitoring
- Product metrics — fewer, more streamlined metrics treated as products (precursor to ML and experimentation)
- Product analytics — leveraging data to form hypotheses about user/customer behavior
- Data quality — foundational; every upstream product depends on reliable, documented data
The key insight: sequence your data product investments. No one can win in every domain at once. Data quality is the foundation; metrics come next; analytics and ML build on top.
Relevance
Directly relevant to 01-projects/data-marketplace/index — the marketplace needs to decide which categories of data products to serve first. Weber’s taxonomy is a lens for segmenting the offering.
The hierarchy (quality → metrics → analytics → ML → experimentation) maps to 06-reference/2026-03-31-block-hierarchy-to-intelligence — you can’t build intelligent systems without the foundational layers. Also connects to 06-reference/2026-04-03-feature-stores-hierarchy — feature stores occupy the ML platform layer and have their own internal hierarchy.
For 01-projects/phdata/index, this taxonomy helps scope consulting engagements: are we building a data quality product or an ML platform? Different skills, different timelines, different stakeholders.
The “sequence your investments” advice aligns with SOUL.md principles around focused execution over breadth.
Open Questions
- Where does the “metrics layer” / headless BI (see 06-reference/2026-04-03-headless-bi) fit in this taxonomy? Is it a subset of “product metrics”?
- For the data marketplace, should we start with data quality products (broadest demand) or analytics products (highest perceived value)?
- How does 06-reference/concepts/compounding-knowledge apply — does investing in data quality compound faster than investing in ML platforms?