28 Oct 2025
54m

Google DeepMind Developers: How Nano Banana Was Made

Podcast cover

The a16z Show

In this episode of the A16Z podcast, Oliver Wang and Nicole Brichtova from Google DeepMind discuss Gemini 2.5 image, also known as Nano Banana. They delve into the model's architecture, its integration of image generation and editing within Gemini's multimodal framework, and the challenges of achieving character consistency, compositional control, and conversational editing at scale. They also touch on open questions and model evaluation, safety and latency optimization, and how visual reasoning connects to broader advances in multimodal systems. The conversation explores the potential impact of AI on creative arts, the evolution of user interfaces, and the future of image representation, as well as the balance between control and intent in AI-driven art creation.

Outlines

Part 1: Introduction and Vision

Part 2: Development, Diversity, and Modality

Part 3: Evaluation, Data, and Interfaces

Part 4: Capabilities, Reasoning, and Taste

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval