Featured

Practical Evaluation Strategies for AI Agents

Hosted by Hamza Farooq and Gabriela de Queiroz

Thu, Jan 22, 2026

5:00 PM UTC (45 minutes)

Virtual (Zoom)

Free to join

52 students

Invite your network

Go deeper with a course

Gen AI Bootcamp for Leaders
Hamza Farooq
View syllabus

What you'll learn

Identify what “good” looks like for an AI agent

Define success metrics for agent reasoning, actions, and outcomes, beyond simple accuracy.

Design practical evals for agent workflows

Build task-based, behavioral, and regression-style evaluations that reflect real-world usage.

Use evals to iterate and improve agent systems

Apply evaluation results to debug failures, compare agent versions, and confidently ship changes.

Why this topic matters

AI agents often appear to work until they’re exposed to real users, edge cases, and scale. Without proper evaluation, teams ship systems they can’t trust or improve. This topic matters because evals turn agents from impressive demos into reliable products by making behavior measurable, debuggable, and safe to deploy in production.

You'll learn from

Hamza Farooq

Founder | Ex-Google | Adjunct UCLA & UMN, SCU | Venture Partner

I am a Founder by day and Professor by night. My work revolves in the realm of LLMs and Multi-Modal Systems.

My startup, traversaal.ai was built with one vision: provide scalable AI Products for Startups and Enterprises, which can seamlessly integrate within the existing ecosystem, while being customizable and cost efficient.

Gabriela de Queiroz

Ex-Microsoft & IBM AI leader | AI Advisor for Startups

Gabriela de Queiroz is the Founder of f02 labs, where she delivers AI Strategy and Developer Advocacy as a Service to help startups accelerate visibility, product adoption, and market awareness. Previously Director of AI at Microsoft, she advised hundreds of startups on building with AI and driving product adoption, and earlier led AI strategy and open-source initiatives at IBM.


In addition to her industry leadership, Gabriela has taught for Coursera, EdX, and DataCamp, where her courses have reached over 300k learners worldwide. She also founded R-Ladies and AI Inclusive, global communities empowering over 200,000 members.

Previous attendees from:

Google
Apple
Airbnb
Amazon
Microsoft

Sign up to join this lesson

By continuing, you agree to Maven's Terms and Privacy Policy.