"5 Key Predictions for the Data Industry in 2026" — @SeattleDataGuy

⚠️ Sponsorship / commercial placements — flagged (two, not one)

Estuary pre-amble at the top. Same disclosed-adviser relationship as the Jan 14 article. Quote: "Estuary, a platform I've used to help make clients' data workflows easier and am an adviser for."
Self-consulting plug mid-article — SDG inserts "if your data team needs help revamping your data infrastructure… reach out for a consultation!" with a link to his consulting services as a "great segway" off prediction #3.

Neither is opaque — both are labeled — but note for the skill: the second pattern (embedded self-consulting CTA inside the body) is different from the curation-section self-promo we've seen in prior issues. Skill should flag both.

The five predictions

Microsoft Fabric will rebrand again. Cites the "Databricks from Temu" meme, predicts Microsoft relaunches the Fabric stack under an AI-first name given historical pattern.
The "AI gap" is real — 1% chasing LLMs, 99% still emailing Excel files via SFTP. Pitches a wish-list product: an AI tool that ingests a spreadsheet and auto-generates a replacement data pipeline. Argues Excel captures business logic and shouldn't be fought.
Modern data stacks will be shaken. Acquisition consolidation + pricing shifts + sunset risks drive a rebuild cycle. Frame it as "AI-foundations" to get it approved.
AI POCs start crystallizing into patterns. The most useful section — a generic hype-cycle model (see below).
Snowflake will rediscover itself. Gut-call editorial: Snowflake is strategically unclear vs Databricks' committed all-in-one identity. References the Playing to Win strategy framework (Lafley/Martin) as the lens.

The useful takeaway — the AI hype cycle stages (prediction #4)

This is the part worth lifting. SDG's compressed version of the tech hype cycle:

New capability appears — magical early demos, real constraints not understood.
Everyone builds the obvious thing first — for LLMs: chatbots, "ask your data anything," copilots.
Reality sets in — hallucinations, cost blowups, governance, safety, edge-case failures.
Patterns start to crystallize — teams that kept iterating produce replicable recipes.
Becomes a standard — integrated invisibly into workflows.
Hype fades; capability settles into its real strength — narrative shifts from "can solve everything" to "here's what it actually solves."

Mapping against Ray Data Co

Where we sit on the AI-agent hype cycle: somewhere between stages 3 and 4. We've felt the reality constraints (context limits, model cost, tool-call overhead), we've built discipline around them (CLAUDE.md, SOUL.md, auto-memory, subagent delegation, vault-as-state), and we're actively refining them. We are not building "obvious first things" anymore — chatbots, "ask your data" — we're building the working-agent-COO pattern, which is one of the crystallizing patterns the article anticipates.
"Excel captures business logic" is an important lens for any future RDCO data product. If we ever build a product for a data team, the moat isn't replacing Excel; it's making Excel portable into the rest of the stack. File this as a business design prior.
Snowflake vs Databricks identity-clarity. Not directly relevant to our stack (we use neither), but the Playing to Win frame — "pick where you play and how you'll win, commit" — is the same discipline we're using for the 5-agent architecture target. A useful cross-reference for strategy conversations.
Predictions #1, #3, #5 are vendor-watching commentary — interesting but not actionable for us unless we onboard one of those platforms.

Curation section — notes

"What It Actually Takes to Build a Data Pipeline System" — SDG's own prior article again (third time linking it this month). Already filed as [[2026-01-14-seattle-data-guy-build-a-pipeline-system]]. Pattern noted: newer newsletters cross-promote recent series entries for 2-3 issues.
"How Uber Scaled Data Replication to Move Petabytes Every Day" — real third-party (Uber blog). Hive Sync + Apache Hadoop Distcp optimization at 350 PB scale. Not relevant to us; we operate at trivial scale.

[[2026-01-05-seattle-data-guy-data-pipeline-patterns]]
[[2026-01-14-seattle-data-guy-build-a-pipeline-system]]
[[2026-01-23-seattle-data-guy-analytical-skills]]
[[../01-projects/automated-investing/architecture-vision]] — our own "commit to where we play" pick for 5-agent target