YouTube08 Dec 2023
1h 18m

Stanford CS25: V3 I Low-level Embodied Intelligence w/ Foundation Models

Podcast cover

Stanford Online

This episode explores the application of foundation models to low-level embodied intelligence in robotics. Against the backdrop of current AI's limitations in interacting with complex real-world environments, Fei Xia, a Senior Research Scientist at Google DeepMind, discusses the challenges and progress in building robots capable of nuanced physical interactions. More significantly, the conversation highlights the development of models like PALM-E and RT-2, which integrate high-level planning with low-level control, leveraging large language models and extensive datasets of robot interactions. For instance, RT-2 demonstrates the ability to perform complex tasks like sorting objects and responding to unexpected changes in the environment, showcasing the potential of foundation models to accelerate robotics development. The discussion also delves into the challenges of data scarcity in robotics and explores alternative approaches, such as using reward functions as an interface between language models and robot control, enabling more flexible and adaptable robot behavior. In contrast to traditional methods, this approach allows for more intuitive instruction and potentially greater robustness. What this means for the future of robotics is a significant shift towards more capable and adaptable robots, potentially leading to widespread adoption of robots in various real-world applications.

Outlines

Part 1: Introduction and Challenges

Part 2: Foundation Models in Robotics

Part 3: Interfaces and Applications

Part 4: Q&A and Future Directions

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval