11 Jan 2024
1h 25m

RLHF 201 - with Nathan Lambert of AI2 and Interconnects

Podcast cover

Latent Space: The AI Engineer Podcast

Reinforcement Learning from Human Feedback (RLHF) is a technique that combines reinforcement learning with human feedback to train language models. It involves using human preferences to guide the training process, with various challenges, including data collection, reward optimization, and preference aggregation. RLHF has potential applications in language model fine-tuning, decision-making, and dialogue system development.

Outlines

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval