06-reference

analytics engineering roundup jagged frontier dispatch

Sat Apr 25 2026 20:00:00 GMT-0400 (Eastern Daylight Time) ·reference ·source: Analytics Engineering Roundup (Substack) ·by Jason Ganz (dbt Labs)

“A Dispatch from the Jagged Frontier of Analytics Engineering” — Jason Ganz

Why this is in the vault

Direct hands-on field report from inside dbt Labs about exactly where coding agents succeed and fail in real analytics-engineering work — the most concrete “what is actually happening on the ground in April 2026” data point AER has filed in months. This is also the first vault doc that explicitly imports Ethan Mollick’s “jagged frontier” framing into the analytics-engineering domain — which makes it a primary reference for any Sanity Check piece, MAC marketing copy, or phData conversation that needs to talk about where AI helps an AE and where it ships clean SQL with wrong joins.

The core argument

Ganz applies Mollick’s jagged frontier — the observation that LLM capability is unevenly distributed across tasks and the demarcation line is invisible because output looks confident on both sides — to analytics engineering specifically. The shape of the AE frontier is different from the SWE frontier: the peaks and troughs sit in different places.

Where today’s models excel (peaks):

Where today’s models fail (troughs):

The load-bearing line: “The SQL being correct is just one part of the data being correct. Agents are extremely good at the first axis and variable at the second, and the gap between ‘the code runs’ and ‘the data is right’ is where data incidents tend to live.”

Where AEs go next — shift left, shift right. Benoit’s framing: the left part of the DAG requires more knowledge of the company’s data systems (the tacit-knowledge work agents can’t do); the right part requires more business understanding (what should ARR be, when is a user “active”). The middle compresses. AEs add value at the edges.

Six-month wishlist: agents that (1) check their own assumptions about data — or surface them to a human — before acting, (2) know which tool calls are expensive and ask first, (3) connect to more of the data stack with lower friction.

Mapping against Ray Data Co