YouTube30 Oct 2025

The AGI race isn't a coordination failure | Holden Karnofsky (Anthropic)

Podcast cover

80,000 Hours

Holden Karnofsky discusses the risks and potential benefits of advanced AI, emphasizing the need for safety measures and responsible development. He argues against the notion of a coordination problem in AI development, suggesting that many players are not interested in slowing down. Karnofsky explores scenarios of AI takeover, highlighting the importance of monitoring AI behavior and creating incentives for alignment. He advocates for a focus on "well-scoped object-level work" and pragmatic solutions, drawing parallels to animal welfare advocacy. The conversation also covers responsible scaling policies, model welfare, and the complexities of AI governance, with Karnofsky expressing concerns about power grabs and the potential for misuse. He stresses the importance of transparency, security, and international cooperation in navigating the challenges of AGI.

Outlines

Part 1: AI Risks, Safety, and Takeover Scenarios

Part 2: Anthropic’s Role and Safety Strategies

Part 3: Scaling Policies and Technical Safety Frameworks

Part 4: Threat Vectors: Cyber, Persuasion, and R&D

Part 5: Preventing Power Grabs and Ensuring Integrity

Part 6: Defense Mechanisms and Strategic Mitigations

Part 7: Governance, Culture, and Future Outlook

Part 8: Security Shifts and Personal Impact

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval