Making LLM Agents Observable & Debuggable

Hosted by Hugo Bowne-Anderson and Vincent Koc

Thu, Jul 3, 2025

4:00 PM UTC (30 minutes)

Virtual (Zoom)

Free to join

Invite your network

Go deeper with a course

Building LLM Applications for Data Scientists and Software Engineers
Hugo Bowne-Anderson and Stefan Krawczyk
View syllabus

What you'll learn

How to debug and monitor agent behaviour in real-time

LLM agents fail silently, hallucinate, and drift: learn to catch issues early with output checks, trace logs, & metrics.

Work with human annotations and LLM's-as-a-judge

Use humans and LLMs to evaluate outputs with real-world workflows and practical examples you can apply immediately.

Using MCPs to level-up your vibe coding with telemetry

Give your IDE eyes and ears using Opik MCP to add telemetry and metrics, so you can spot and fix AI issues fast.

Start building today with open-source cookbooks

Get hands-on examples that work across LLMs and agent frameworks—apply these methods in your stack right away.

Why this topic matters

As LLM agents take on complex tasks—long chats, memory, multi-step tools—traditional model evals fall short. Failures go undetected, costing time, trust, and money. Opik is an open-source platform that brings observability to agents: test behavior, trace actions, and improve performance continuously. Learn how to debug smarter and ship more reliable AI systems.

You'll learn from

Hugo Bowne-Anderson

Podcaster, Educator, DS & ML expert

Hugo Bowne-Anderson is an independent data and AI consultant with extensive experience in the tech industry. He is the host of the industry Vanishing Gradients, where he explores cutting-edge developments in data science and artificial intelligence. As a data scientist, educator, evangelist, content marketer, and strategist, Hugo has worked with leading companies in the field. His past roles include Head of Developer Relations at Outerbounds, a company committed to building infrastructure for machine learning applications, and positions at Coiled and DataCamp, where he focused on scaling data science and online education respectively. Hugo's teaching experience spans from institutions like Yale University and Cold Spring Harbor Laboratory to conferences such as SciPy, PyCon, and ODSC. He has also worked with organizations like Data Carpentry to promote data literacy. His impact on data science education is significant, having developed over 30 courses on the DataCamp platform that have reached more than 3 million learners worldwide. Hugo also created and hosted the popular weekly data industry podcast DataFramed for two years. Committed to democratizing data skills and access to data science tools, Hugo advocates for open source software both for individuals and enterprises.

Vincent Koc

Comet, ex-Microsoft, Qantas

Vincent Koc is an AI research engineer at Comet, focused on model and agentic system evaluation as well as telemetry aware agentic systems. With nearly two decades of experience from shipping AI systems for publicly traded companies, contributing to leading open-source projects and working from statistics to deep learning. Vincent is also an adjunct lecturer at leading universities teaching applied AI and generative AI focusing on NLP. Vincent brings a very hands-on, cross-industry lens to his work with an academic grounding. He is passionate about making AI accessible to everyone and is currently authoring a book and has published for several publications and papers in this space. His work is available on github.com/vincentkoc ⁠ ⁠



Previously at

Yale University
Microsoft
Qantas

Learn directly from Hugo Bowne-Anderson and Vincent Koc

By continuing, you agree to Maven's Terms and Privacy Policy.

© 2025 Maven Learning, Inc.