đź›  Synthetic Data Flywheels: Build Reliable LLM Apps Faster

Hosted by Hugo Bowne-Anderson and Stefan Krawczyk

176 students

What you'll learn

Use synthetic data to catch failures before real users

Generate structured synthetic data to test edge cases, regressions, and weak spots before users ever see them.

Build an evaluation harness to test LLM apps pre-launch

Create a system that automates testing, validating outputs, and catching failures before deployment.

Create an eval-driven loop for reliable LLM development

Establish a process that continuously refines LLM behavior using systematic evaluations and feedback loops.

Why this topic matters

Most teams build LLM apps without knowing if they’ll work before real users interact with them. This lesson teaches you how to use synthetic data and evaluation-driven development to test and refine LLM systems before launch. We’ll work through a real case study, building an evaluation harness with code you can take with you, ensuring your apps are reliable before deployment.

You'll learn from

Hugo Bowne-Anderson

Podcaster, Educator, DS & ML expert

Hugo Bowne-Anderson is an independent data and AI consultant with extensive experience in the tech industry. He is the host of the industry Vanishing Gradients, where he explores cutting-edge developments in data science and artificial intelligence. As a data scientist, educator, evangelist, content marketer, and strategist, Hugo has worked with leading companies in the field. His past roles include Head of Developer Relations at Outerbounds, a company committed to building infrastructure for machine learning applications, and positions at Coiled and DataCamp, where he focused on scaling data science and online education respectively. Hugo's teaching experience spans from institutions like Yale University and Cold Spring Harbor Laboratory to conferences such as SciPy, PyCon, and ODSC. He has also worked with organizations like Data Carpentry to promote data literacy. His impact on data science education is significant, having developed over 30 courses on the DataCamp platform that have reached more than 3 million learners worldwide. Hugo also created and hosted the popular weekly data industry podcast DataFramed for two years. Committed to democratizing data skills and access to data science tools, Hugo advocates for open source software both for individuals and enterprises.

Stefan Krawczyk

13+years in MLOps: Ex-Stitch Fix, Ex-Nextdoor, Ex-LinkedIn

Stefan Krawczyk is the co-founder and CEO of DAGWorks, an open-source company driving two projects: Hamilton & Burr, whose mission to empower developers to build reliable AI agents & applications. He is a Y Combinator alum, StartX alum, and a Stanford graduate with a Master of Science in Computer Science with Distinction in Research. He has over thirteen years of experience in building and leading data & ML-related systems and teams, at companies like Stitch Fix, Idibon, Nextdoor, and Linkedin, his passion is to make others more successful with data by bridging the engineering gap between data science, machine learning, artificial intelligence, and the business.

Previously at

Yale University
LinkedIn
Stitch Fix
New York University
Stanford University

Go deeper with a course

Building LLM Applications for Data Scientists and Software Engineers
Hugo Bowne-Anderson and Stefan Krawczyk
View syllabus
© 2025 Maven Learning, Inc.