YouTube18 Aug 2025
32m

Muon Optimizer for Dense Linear Layer Explained | Newton-Schulz + Momentum

Podcast cover

Deep Learning with Yacine

Deep Learning with Yacine - Muon Optimizer for Dense Linear Layer Explained | Newton-Schulz + Momentum

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval