YouTube23 Apr 2025
15m

Ensure AI Agents Work: Evaluation Frameworks for Scaling Success — Aparna Dhinkaran, CEO Arize

Podcast cover

AI Engineer

Aparna Dhinkaran, one of the founders of Arise, discusses the importance of evaluating AI agents and assistants, especially as they move into production and multimodal applications like voice. She breaks down the components of an agent—router, skills, and memory—explaining how each functions and can be evaluated. Using examples, including the Priceline PennyBot and her own company's co-pilot, she emphasizes the need for evaluations at every level of the agent's operation, including the audio component in voice applications, to ensure accuracy, efficiency, and the correct execution of skills.

Outlines

Sign in to continue reading, translating and more.

Continue
 
mindmap screenshot
Preview
preview episode cover
How to Get Rich: Every EpisodeNaval