Moonshots EP 209: What Everyone Missed About Gemini 3
Summary
The panel dissects Google’s Gemini 3 release, which Alex calls the biggest model release since OpenAI’s O3. Key highlights: Gemini 3 scores nearly 50% on Humanity’s Last Exam, integrates agentically across all Google Workspace products, and exhibits “big model smell” — capabilities that can’t be replicated through extended reasoning alone. Alex one-shotted a cyberpunk FPS game with a 140-character prompt. The VendingBench Arena benchmark steals the show: AI agents manage a simulated vending machine economy with email, banking, and inventory tools, and Gemini 3 delivers nearly 3,000% more profit than GPT-5 or Claude Sonnet. The panel argues this proves AI can function as first-class economic actors, with Dave noting the internet advertising business ($300B/year) is already fully non-human. Google’s Anti-Gravity IDE (staffed by former Windsurf team members) and Gemini Live’s improved natural voice also get coverage. Duolingo is down nearly 50% over the past year as AI translation improves.
Key Segments
- [00:00-03:00] Setup — the difficulty of keeping pace with weekly breakthroughs
- [04:00-11:00] Gemini 3 deep-dive — Workspace integration, big model smell, MIT campus 3D rendering, Anti-Gravity IDE
- [12:00-13:00] Gemini 3 as biggest release since O3; GPT-5 reframed as “O2.1”
- [14:00-20:00] VendingBench Arena — AI agents as autonomous economic actors; 3,000% profit margin lead; zero-employee companies
- [21:00-26:00] One-shot game generation; Gemini Live voice quality leapfrogging OpenAI; competitive pressure dynamics
Notable Claims
- Gemini 3 delivers ~3,000% more profit than GPT-5 or Claude Sonnet on VendingBench Arena
- Internet advertising is already a $300B/year fully non-human economy
- Duolingo stock down nearly 50% in past year from AI translation competition
- Former Windsurf team members joined Google DeepMind to build Anti-Gravity IDE
- 6 million Americans now work full-time as influencers, enabled by camera phones
Guests
- Salim Ismail — Founder of OpenExO
- Dave Blundin — Co-host
- Alexander Wissner-Gross — Computer scientist, founder of Reified
RDCO Mapping
- Sanity Check angle: VendingBench as a proxy for autonomous AI businesses — concrete, measurable, vivid
- Data point: The 3,000% profit margin differential is a powerful stat for illustrating model capability gaps in business terms
- Vault cross-ref: Connects to AI agents, demonetization, zero-employee companies, and competitive dynamics threads