18 Dec 2025
1h 15m

SAM 3: The Eyes for AI — Nikhila & Pengchuan (Meta Superintelligence), ft. Joseph Nelson (Roboflow)

Podcast cover

Latent Space: The AI Engineer Podcast

This podcast episode features a discussion about the launch of SAM 3, a new model for segmenting and tracking objects in images and videos using concept prompts. The speakers, including researchers from Meta and the co-founder of Roboflow, discuss the model's capabilities, architecture, and data engine, as well as its potential applications in various fields such as robotics, medical imaging, and video editing. They also explore the integration of SAM 3 with large language models (LLMs) and its role in the broader AI ecosystem, emphasizing the importance of open-source contributions and community feedback for future development. The conversation touches on the challenges of video annotation, the need for efficient models, and the goal of achieving human-level performance in computer vision tasks.

Outlines

Part 1: Introduction and SAM 3 Overview

Part 2: Technical Deep Dive

Part 3: Future Directions and Community Engagement

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval