Moonshots EP 161: The Future of AI Avatars and DeepFake Technology
Summary
Peter Diamandis interviews Joshua Xu, founder/CEO of HeyGen, at what appears to be an Abundance 360 event, followed by a presentation from Steve Brown on agent training. HeyGen builds “human-centric video models” — from just two minutes of training footage, their AI learns facial muscle movements, expressions, and body language to create a photorealistic digital twin that can speak in 175 languages with the user’s own voice. A live demo shows an attendee’s avatar created in four minutes the night before, speaking fluently in multiple languages.
Xu frames HeyGen’s mission as “visual storytelling for all” — less than 1% of people can create great video content today, and HeyGen aims to make video as easy as sending an email. He envisions every email being replaceable with a personalized video. On trust and safety, Xu emphasizes that every avatar on HeyGen requires first-party video consent (the person must appear on camera saying “I am [name] and I consent”), and they have never compromised on this requirement. A moderation team combining AI and human reviewers blocks hate speech, fraud, and political campaign content.
The avatar vs. agent distinction gets clarified: avatars are the audiovisual layer (AV = avatar), while agents provide the intelligence and agency behind them. The future is “agentizing yourself” — having three meetings simultaneously through digital twins, or building a team of 20 marketing agents that show up on Slack, Zoom, and email with unique appearances. Steve Brown’s segment covers training agents with life story, values, purpose, and personal anecdotes — he built agents specifically designed to interview users and build the biographical corpus needed to make a meaningful digital clone. He also demos a medical agent connected to his own health records. Diamandis advises the audience to record elderly family members’ stories now for future avatar recreation, and to establish a family password against deepfake phone/video scams.
Key Segments
- [00:00-10:00] HeyGen introduction, Joshua Xu’s journey from Snap to founding HeyGen in 2020
- [10:00-16:00] Avatar creation demo (2 min training footage, 175 languages), live attendee demo
- [16:00-22:00] Trust and safety (first-party consent, moderation), deepfake defense (family password)
- [22:00-28:00] Avatar vs agent distinction, “agentizing yourself,” Steve Brown on training agent corpus
Notable Claims
- HeyGen creates photorealistic avatars from just 2 minutes of training footage
- Supports 175 languages with voice cloning
- Less than 1% of people can currently create great video content
- HeyGen has never compromised on first-party consent in its entire history
- Steve Brown predicts the next 12 months will be “the year of agents” after 2024 being “the year of avatars”
Bias/Sponsor Notes
This is a live product demo/interview at an Abundance event — essentially a promotional segment for HeyGen. Diamandis is clearly an enthusiast/partner. Extended mid-episode ad reads for Fountain Life, Viome, and OneSkin. No critical examination of deepfake risks beyond the family password suggestion.