Cohort-based courses
Guided programs to get real results.
AI Evals For Engineers & PMs
4.7
·4 weeks·Sep 5 – Oct 3
Hamel Husain ML Engineer with 25 years of experience
Shreya Shankar ML Systems & Applied AI Evals Researcher
Production-Ready Systems with LLMs and Agents: An Intensive for Engineers
NEW·6 weeks·Jul 13 – Aug 23

Ehsan GazarPrincipal Engineer | AI & System Design
Hardcore Agentic Engineering for builders who ship
NEW·3 weeks·Aug 3 – Aug 21


Greg Ceccarelli Founder, SpecStory • Ex-CPO, Pluralsight
+ Sean Johnson and Jake Levirne
AI Evals and Analytics Playbook
5.0
·3 weeks·Jun 22 – Jul 13

Stella Liu Head of AI Applied Science
Amy Chen Cofounder, AI Evals & Analytics
Beyond Evals: Designing Improvement Flywheels for AI Products
NEW·3 weeks·Sep 19 – Oct 10
.png&w=256&q=75)

Aishwarya Naresh Reganti AI Founder & Advisor to F500s | Ex-AWS
Kiriti Badam Applied AI @ OpenAI Codex | Ex-Google
Build a Software Factory: Hands-off agentic coding for experienced engineers
3 weeks·Jul 14 – Aug 6


Matt Wynne Cucumber co-founder, BDD pioneer
+ Aldric Giacomoni, Jeremy Lightsmith, and David Laing
1-day workshops
Short, focused sessions to build specific skills.
Building Multi-Agent Forecasting Systems
NEW·7 hours·Jun 27

Stefan JansenAuthor, ML for Trading · Applied AI
Debug Your AI Product: Private Team Workshop
Jun 30


Hamel Husain ML Engineer with 20+ years experience
Shreya Shankar ML Systems Researcher
AI Trust at Scale. From Evals to Governance
NEW·3 hours·Jul 11

Subha ShettyFractional Chief Product and AI Officer
Free Lightning Lessons
Interactive sessions to explore new topics.
Mastering Agentic RAG & AI Evals
·60 minutes1,146 StudentsWatch.png&w=1536&q=75)
Dr. Ryan Ahmed, Ph.D., MBA and Kukesh KodessShip a Production Cursor Agent System in 30 Minutes
·Jun 24·30 minutes164 StudentsLive
Carmelo IariaHow to Setup Evals For Agents
·30 minutes1,848 StudentsWatch
Harrison Chase, Hamel Husain, andPressure-test any AI analysis
·Jun 24·60 minutes781 StudentsLive
Shane Butler, Sravya Madipalli, and Hai GuanRaise Your Technical Bar as an AI-Native PM
·30 minutes15,903 StudentsWatch
Jason P. Yoong and Gayathri Keerthana (GK)Build Multi-Agent Systems You Can Audit
·Jun 24·30 minutes37 StudentsLive
Stefan JansenFrom trading idea to validated strategy
·Jun 24·30 minutes21 StudentsLive
Stefan JansenThe New Frontier of AI Search
·Jun 30·75 minutes16 StudentsLive
Trey Grainger and Doug TurnbullAI Evals for Product Managers
·60 minutes2,042 StudentsWatch
Anshumani RuddraModern Information Retrieval Evaluation In The RAG Era
·45 minutes5,365 StudentsWatch
Nandan Thakur, Hamel Husain, and Shreya ShankarBuild Your AI Evals & Analytics Playbook
·30 minutes532 StudentsWatch
Stella Liu and Amy ChenDesign Evals Users Will Trust
·45 minutes781 StudentsWatch
Aishwarya Naresh RegantiEvals in Action With Arize
·45 minutes210 StudentsWatch
Laurie VossSetting Eval for AI Agents & Scaling with Auto-Evaluation
·30 minutes866 StudentsWatch
Mahesh YadavEvaluate AI agents with Confidence
·45 minutes803 StudentsWatch
Mahesh YadavProduction Grade AI Evals by Braintrust.dev
·30 minutes509 StudentsWatch
Mengying LiScale Evals Without the Chaos
·45 minutes255 StudentsWatch
Aishwarya Naresh RegantiCollaborative AI Evals with Human Feedback
·30 minutes120 StudentsWatch
Rogério ChavesDebug the weird stuff your AI does (in less than 1 hour)
·45 minutes5,171 StudentsWatch.webp&w=1536&q=75)
Marily Nika and Hamel HusainAutomating Evals With Claude Code + Phoenix
·60 minutes2,362 StudentsWatch
Mikyo King and Hamel HusainEvals for Everyone
·3 lessons2,202 StudentsWatch
Aishwarya & KiritiEvaluating AI Agents
·45 minutes1,434 StudentsWatch
Amir Feizpour and Samuel Dion-GirardeauFrom Automation to Multi-Agent Architectures
·3 lessons1,361 StudentsWatch
Hamza FarooqBuild Your Own Eval Tools With Notebooks!
·45 minutes615 StudentsWatch
Vincent D. Warmerdam, Hamel Husain, and Shreya ShankarEvaluation Driven Development for Agentic AI Systems
·45 minutes591 StudentsWatch
Aurimas GriciūnasStrategies for building self-improving document processing
·60 minutes431 StudentsWatch
Jason Liu and Eli BadgioHow to Drive AI Evals Adoption
·30 minutes333 StudentsWatch
Dr Sebastian FoxStay Ahead in AI: Evaluate Any New LLM in 15 Minutes
·30 minutes95 StudentsWatch
Sherveen MashayekhiEvaluating AI Agents before Users Break Them
·60 minutes89 StudentsWatch
Aki Wijesundara, PhD, Marc Klingen, and Lotte VerheydenRun Eval Loops and Guardrails for Cursor Agents
·30 minutes88 StudentsWatch
Carmelo IariaSetting up your first AI eval with a LLM-as-judge
·45 minutes68 StudentsWatch
Madalina Turlea and Catalina TurleaGo Beyond AI Evals: Diagnose and Decide
·45 minutes55 StudentsWatch
Rajiv ShahDebug Cursor Agent Failures Before Production
·30 minutes46 StudentsWatch
Carmelo IariaHow to test AI when you don't have any data yet
·45 minutes26 StudentsWatch
Madalina Turlea and Catalina TurleaError Analysis: The AI Engineer’s Best ROI
·60 minutes1,518 StudentsWatch
Hamel Husain and Shreya ShankarEvaluating Agentic AI Applications Beyond Vibe Checks
·45 minutes1,255 StudentsWatch
Aishwarya Naresh Reganti, Kiriti Badam, and Claire LongoUnderstanding Embedding Performance through Generative Evals
·60 minutes1,181 StudentsWatch
Jason Liu and Kelly HongHow OpenAI Customers Use Evals To Build Better AI Products
·30 minutes1,083 StudentsWatch
Jim Blomo and Hamel HusainHow Evals Made GitHub Copilot Happen
·30 minutes893 StudentsWatch
John Berryman, Shawn Simister, and Hamel HusainLearn Agentic AI: Setting agents metrics and evaluations
·45 minutes861 StudentsWatch
Mahesh YadavOptimize Structured Data Retrieval With Evals
·45 minutes843 StudentsWatch
Daniel Svonava and Hamel HusainOnline Evals and Production Monitoring
·60 minutes831 StudentsWatch
Jason Liu, Ben Hylak, and Sidhant BendreAI Systems Under Pressure: Red-Team Before You Ship
·60 minutes803 StudentsWatch
Krystal JacksonImprove reliability of your AI applications
·30 minutes747 StudentsWatch
Shreya RajpalOptimize Your Dev Setup For Evals w/ Cursor Rules & MCP
·30 minutes689 StudentsWatch
Isaac Flath, Hamel Husain, and Shreya ShankarHow You Catch Production Hallucinations in Real Time
·60 minutes505 StudentsWatch
Jason Liu and Julia NeaguScaling Judge-Time Compute for Robust Auto LLM Evaluation
·60 minutes489 StudentsWatch
Jason Liu and Leonard TangPractical Evaluation Strategies for AI Agents
·45 minutes476 StudentsWatch
Hamza Farooq and Gabriela de QueirozMaster Evaluation Techniques for LLM Apps
·30 minutes413 StudentsWatch
Haroon ChouderyUnderstand SHAP (SHapley Additive exPlanations)
·30 minutes311 StudentsWatch
Patrick HallReliable RAG Agents: Intent-Driven Failure Detection
·60 minutes298 StudentsWatch
Jason Liu and Ben HylakCreate MCP Tool Evals Before You Ship
·45 minutes283 StudentsWatch
Emmanuel ParaskakisDon't Tweak Prompts. Engineer Agents.
·30 minutes274 StudentsWatch
Hugo Bowne-Anderson and Skylar PayneEvals for Voice AI: Learnings from Google Evals Team
·30 minutes246 StudentsWatch
Ravin KumarMastering LLM Application Testing
·30 minutes240 StudentsWatch
Hugo Bowne-Anderson and Stefan KrawczykSynthetic RAG evaluation
·60 minutes214 StudentsWatch
Alexey Grigorev and Doug TurnbullCalibrate LLM-as-a-judge for Real-world Impact
·45 minutes209 StudentsWatch
Eddie Landesberg🛠 Synthetic Data Flywheels: Build Reliable LLM Apps Faster
·30 minutes187 StudentsWatch
Hugo Bowne-Anderson and Stefan KrawczykThe Hidden Signal in Production AI Logs
·60 minutes173 StudentsWatch
Jason Liu and Scott ClarkHow to test and improve your AI agents
·45 minutes167 StudentsWatch
Jacob BankDe-Risking LLM Model Switches w Evals & Prompt Optimization
·45 minutes145 StudentsWatch
Amir Feizpour and Hugo MailhotPart 3: Building Robust Evaluations for AI Agents
·60 minutes144 StudentsWatch
Hamza Farooq and Gabriela de Queiroz