16 Aug 2023
50m

The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI

Podcast cover

Latent Space: The AI Engineer Podcast

This podcast episode delves into various aspects of scaling up large language models and training transformer-based models, emphasizing the practical considerations, challenges, and limitations involved. It covers topics such as hardware setup, flops, quantization, distributed training techniques, and emerging research directions in deep learning.

Outlines

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval