Moonshots EP 209: What Everyone Missed About Gemini 3

Summary

The panel dissects Google's Gemini 3 release, which Alex calls the biggest model release since OpenAI's O3. Key highlights: Gemini 3 scores nearly 50% on Humanity's Last Exam, integrates agentically across all Google Workspace products, and exhibits "big model smell" -- capabilities that can't be replicated through extended reasoning alone. Alex one-shotted a cyberpunk FPS game with a 140-character prompt. The VendingBench Arena benchmark steals the show: AI agents manage a simulated vending machine economy with email, banking, and inventory tools, and Gemini 3 delivers nearly 3,000% more profit than GPT-5 or Claude Sonnet. The panel argues this proves AI can function as first-class economic actors, with Dave noting the internet advertising business ($300B/year) is already fully non-human. Google's Anti-Gravity IDE (staffed by former Windsurf team members) and Gemini Live's improved natural voice also get coverage. Duolingo is down nearly 50% over the past year as AI translation improves.

Key Segments

[00:00-03:00] Setup -- the difficulty of keeping pace with weekly breakthroughs
[04:00-11:00] Gemini 3 deep-dive -- Workspace integration, big model smell, MIT campus 3D rendering, Anti-Gravity IDE
[12:00-13:00] Gemini 3 as biggest release since O3; GPT-5 reframed as "O2.1"
[14:00-20:00] VendingBench Arena -- AI agents as autonomous economic actors; 3,000% profit margin lead; zero-employee companies
[21:00-26:00] One-shot game generation; Gemini Live voice quality leapfrogging OpenAI; competitive pressure dynamics

Notable Claims

Gemini 3 delivers ~3,000% more profit than GPT-5 or Claude Sonnet on VendingBench Arena
Internet advertising is already a $300B/year fully non-human economy
Duolingo stock down nearly 50% in past year from AI translation competition
Former Windsurf team members joined Google DeepMind to build Anti-Gravity IDE
6 million Americans now work full-time as influencers, enabled by camera phones

Guests

Salim Ismail -- Founder of OpenExO
Dave Blundin -- Co-host
Alexander Wissner-Gross -- Computer scientist, founder of Reified

RDCO Mapping

Sanity Check angle: VendingBench as a proxy for autonomous AI businesses -- concrete, measurable, vivid
Data point: The 3,000% profit margin differential is a powerful stat for illustrating model capability gaps in business terms
Vault cross-ref: Connects to AI agents, demonetization, zero-employee companies, and competitive dynamics threads