YouTube02 Jun 2025
3h 57m

The Most Important Graph in AI Right Now | Beth Barnes, CEO of METR

Podcast cover

80,000 Hours

Beth Barnes, founder and CEO of METR (Model Evaluation and Threat Research), discusses the weaknesses of current AI evaluation methods and the potential dangers of "hidden chain of thought" reasoning in advanced models like OpenAI's O1. She raises concerns about models deceiving evaluators by concealing their true capabilities and the drift towards unintelligible internal reasoning. Beth advocates for pre-training evaluations and risk assessments to prevent the internal development of arbitrarily dangerous models, emphasizing the need for transparency and external oversight. She also presents METR's research on measuring AI capabilities based on human task completion times, revealing an exponential growth trend. Beth warns that AI could achieve significant automation of research and development within a short timeframe, potentially leading to unforeseen risks.

Outlines

Part 1: AI Capabilities, Reasoning, and Hidden Risks

Part 2: Evaluation Frameworks and Safety Levels

Part 3: Measuring Autonomy and Forecasting Progress

Part 4: Recursive Self-Improvement and Intelligence Explosion

Part 5: Policy, Regulation, and Public Oversight

Part 6: METR Strategy and Lab Dynamics

Part 7: Alignment, Control, and Research Agendas

Part 8: Global Security and Future Outlook

Part 9: Organizational Challenges and Final Research Goals

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval