05 Dec 2025
1h 1m

What Comes After ChatGPT? The Mother of ImageNet Predicts The Future

Podcast cover

The a16z Show

In this episode of the Latent Space Podcast, Fei-Fei Li and Justin Johnson of WorldLabs discuss spatial intelligence and their new model, Marble, which generates explorable 3D worlds from texture images. They explore the differences between spatial and language intelligence, the importance of structure in 3D world modeling, and the role of open science versus proprietary models in AI development. They also touch on the challenges of resourcing academic AI research and the potential of using physics engines to enhance world models, as well as the shift in academia's role in AI, advocating for wacky ideas and theoretical underpinnings. The conversation further explores the capabilities and potential use cases of Marble, including its applications in gaming, VFX, film, robotics training, and interior design, while also considering the future of world models and the integration of physics and dynamics.

Outlines

Part 1: Genesis of WorldLabs and Marble

Part 2: Vision-Language Modeling and Real-Time Captioning

Part 3: World Models and Spatial Intelligence

Part 4: Conclusion and Future

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval