03 Nov 2025

How Zyphra went all-in on AMD + Why Devs feel faster with AI but are slower — with Quentin Anthony

Podcast cover

Latent Space: The AI Engineer Podcast

In this episode of the Latent Space Podcast, Alessio interviews Quentin Anthony, Head of Model Training at Zyphra and advisor at Eleuther AI, about Zyphra's work on foundation models for edge deployment and their recent move to AMD training clusters. Quentin shares insights on optimizing kernels for AMD GPUs, the role of open source in AMD development, and the use of coding agents. They discuss the METR study on AI's impact on software engineering productivity, Quentin's coding workflow with AI, and the challenges of evaluating AI-generated kernels. The conversation also covers Zyphra's model development roadmap, the potential of ASICs for inference, edge deployment strategies, and the future of open source AI with Eleuther AI.

Outlines

Part 1: Zyphra, AMD, and Open Source

Part 2: Kernel Development and Hardware Considerations

Part 3: Edge Deployment and Future Roadmap

Part 4: AI Coding Productivity and Workflow

Part 5: Team Building, Interviewing, and Open Source

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval