Teach on Maven
Log In
  • Courses
  • Workshops
  • Free lessons

  • AI
    • All courses inAI
      • Agentic AI
      • Coding with AI
      • AI Workflows
      • Claude Code
      • OpenClaw
      • Vibe Coding
      • AI Evals
      • AI Transformation
      • RAG & Search
      • MCP
      • AI for PMs
      • AI for Engineers
      • AI for Designers
      • AI for Marketers
      • AI for Founders
  • Product
    • All courses inProduct
      • AI for PMs
      • Agentic AI
      • AI Evals
      • Vibe Coding
      • Product Sense
      • Product Discovery
      • User Research
      • Prototyping
      • Growth
      • Analytics
      • Tech Foundations
      • Strategy
      • Influence
      • Leadership
      • Career Growth
  • Engineering
    • All courses inEngineering
      • AI for Engineers
      • Agentic AI
      • Coding with AI
      • Claude Code
      • OpenClaw
      • MCP
      • RAG & Search
      • AI Evals
      • Machine Learning
      • LLM Ops
      • Context Eng
      • Security
      • System Design
      • Leadership
      • Career Growth
  • Design
    • All courses inDesign
      • AI for Designers
      • Agentic AI
      • Vibe Coding
      • Prototyping
      • Figma
      • Design Systems
      • User Research
      • Product Discovery
      • UX
      • UI
      • Visual Design
      • Design Strategy
      • Influence
      • Leadership
      • Career Growth
  • Marketing
    • All courses inMarketing
      • AI for Marketers
      • Agentic AI
      • Vibe Coding
      • Automation
      • Content Marketing
      • Demand Gen
      • Go-to-Market
      • Product Marketing
      • Positioning
      • Social Media
      • Brand
      • B2B Marketing
      • SEO & AEO
      • Strategy
      • Leadership
  • Leadership
    • All courses inLeadership
      • AI for Leaders
      • Agentic AI
      • AI Transformation
      • AI Governance
      • Communication
      • Influence
      • Strategy
      • Management
      • People Operations
      • Exec Presence
      • Storytelling
      • Goal-setting
      • Personal Brand
      • Career Growth
  • Founders
    • All courses inFounders
      • AI for Founders
      • Agentic AI
      • AI Workflows
      • Vibe Coding
      • Prototyping
      • Product Sense
      • Positioning
      • Product Discovery
      • Management
      • Strategy
      • Go-to-Market
      • Personal Brand
      • Leadership
      • Fundraising
      • PMF
  • More
    • All courses inMore
      • Everyone
      • Operators
      • Data Scientists
      • Business Analysts
      • User Researchers
      • Customer Success
      • Project Managers
      • HR Professionals
      • Sales People
      • Lawyers
      • Finance
      • Investors
      • Real Estate
      • Educators
      • Creators
Engineering
Teach on Maven
Log In
Engineering
Teach on Maven
Log In
AI EvalsAI for EngineersAgentic AICoding with AIClaude CodeOpenClawMCPRAG & SearchMachine LearningLLM OpsContext EngSecuritySystem DesignLeadershipCareer Growth

Cohort-based courses

Guided programs to get real results.

AI Evals For Engineers & PMs

4.7
·4 weeks·Sep 7 – Oct 2
Hamel Husain ML Engineer with 20 years of experience
Shreya Shankar ML Systems & Applied AI Evals Researcher

Beyond Evals: Designing Improvement Flywheels for AI Products

NEW·3 weeks·Jun 6 – Jun 27
Aishwarya Naresh Reganti AI Founder & Advisor to F500s | Ex-AWS

1-day workshops

Short, focused sessions to build specific skills.

AI Trust at Scale. From Evals to Governance

NEW·3 hours·Jun 30
Subha Shetty Fractional Chief Product and AI Officer

Free Lightning Lessons

Interactive sessions to explore new topics.
  • How to Setup Evals For Agents

    Watch
    ·30 minutes1,611 Students
    Harrison Chase, Hamel Husain, and
  • Raise Your Technical Bar as an AI-Native PM

    Watch
    ·30 minutes15,831 Students
    Jason P. Yoong and Gayathri Keerthana (GK)
  • Debug the weird stuff your AI does (in less than 1 hour)

    Watch
    ·45 minutes5,165 Students
    Marily Nika and Hamel Husain
  • From Automation to Multi-Agent Architectures

    Watch
    ·3 lessons1,351 Students
    Hamza Farooq
  • Vibe Code Annotation UIs for AI Analytics Evals

    Live
    ·Jun 24·60 minutes633 Students
    Shane Butler
  • Build Your AI Evals & Analytics Playbook

    Watch
    ·30 minutes509 Students
    Stella Liu and Amy Chen
  • AI Evals for Product Managers

    Watch
    ·60 minutes2,011 Students
    Anshumani Ruddra
  • Debug Cursor Agent Failures Before Production

    Live
    ·Jun 10·30 minutes39 Students
    Carmelo Iaria
  • Evaluation Driven Development for Agentic AI Systems

    Watch
    ·45 minutes586 Students
    Aurimas Griciūnas
  • Production Grade AI Evals by Braintrust.dev

    Watch
    ·30 minutes490 Students
    Mengying Li
  • Practical Evaluation Strategies for AI Agents

    Watch
    ·45 minutes471 Students
    Hamza Farooq and Gabriela de Queiroz
  • Ship a Production Cursor Agent System in 30 Minutes

    Live
    ·Jun 24·30 minutes93 Students
    Carmelo Iaria
  • Learn Agentic AI: Setting agents metrics and evaluations

    Watch
    ·45 minutes855 Students
    Mahesh Yadav
  • How to Drive AI Evals Adoption

    Watch
    ·30 minutes325 Students
    Dr Sebastian Fox
  • Evals for Voice AI: Learnings from Google Evals Team

    Watch
    ·30 minutes242 Students
    Ravin Kumar
  • Design Evals Users Will Trust

    Watch
    ·45 minutes770 Students
    Aishwarya Naresh Reganti
  • Evals for Everyone

    Watch
    ·3 lessons2,195 Students
  • Setting Eval for AI Agents & Scaling with Auto-Evaluation

    Watch
    ·30 minutes862 Students
    Mahesh Yadav
  • Modern Information Retrieval Evaluation In The RAG Era

    Watch
    ·45 minutes5,281 Students
    Nandan Thakur, Hamel Husain, and Shreya Shankar
  • Build Your Own Eval Tools With Notebooks!

    Watch
    ·45 minutes612 Students
    Vincent D. Warmerdam, Hamel Husain, and Shreya Shankar
  • Part 3: Building Robust Evaluations for AI Agents

    Watch
    ·60 minutes140 Students
    Hamza Farooq and Gabriela de Queiroz
  • Setting up your first AI eval with a LLM-as-judge

    Watch
    ·45 minutes62 Students
    Madalina Turlea and Catalina Turlea
  • Collaborative AI Evals with Human Feedback

    Watch
    ·30 minutes113 Students
    Rogério Chaves
  • Run Eval Loops and Guardrails for Cursor Agents

    Live
    ·May 27·30 minutes79 Students
    Carmelo Iaria
  • Master Evaluation Techniques for LLM Apps

    Watch
    ·30 minutes412 Students
    Haroon Choudery
  • Mastering LLM Application Testing

    Watch
    ·30 minutes240 Students
    Hugo Bowne-Anderson and Stefan Krawczyk
  • Evaluating Agentic AI Applications Beyond Vibe Checks

    Watch
    ·45 minutes1,250 Students
    Aishwarya Naresh Reganti, Kiriti Badam, and Claire Longo
  • How Evals Made GitHub Copilot Happen

    Watch
    ·30 minutes892 Students
    John Berryman, Shawn Simister, and Hamel Husain
  • Reliable RAG Agents: Intent-Driven Failure Detection

    Watch
    ·60 minutes298 Students
    Jason Liu and Ben Hylak
  • Strategies for building self-improving document processing

    Watch
    ·60 minutes429 Students
    Jason Liu and Eli Badgio
  • The Hidden Signal in Production AI Logs

    Watch
    ·60 minutes172 Students
    Jason Liu and Scott Clark
  • Calibrate LLM-as-a-judge for Real-world Impact

    Watch
    ·45 minutes205 Students
    Eddie Landesberg
  • Evaluating AI Agents before Users Break Them

    Watch
    ·60 minutes88 Students
    Aki Wijesundara, PhD, Marc Klingen, and Lotte Verheyden
  • Automating Evals With Claude Code + Phoenix

    Watch
    ·60 minutes2,354 Students
    Mikyo King and Hamel Husain
  • How to test and improve your AI agents

    Watch
    ·45 minutes167 Students
    Jacob Bank
  • Go Beyond AI Evals: Diagnose and Decide

    Watch
    ·45 minutes52 Students
    Rajiv Shah
  • Understand SHAP (SHapley Additive exPlanations)

    Watch
    ·30 minutes310 Students
    Patrick Hall
  • Improve reliability of your AI applications

    Watch
    ·30 minutes747 Students
    Shreya Rajpal
  • Evaluating AI Agents

    Watch
    ·45 minutes1,428 Students
    Amir Feizpour and Samuel Dion-Girardeau
  • De-Risking LLM Model Switches w Evals & Prompt Optimization

    Watch
    ·45 minutes145 Students
    Amir Feizpour and Hugo Mailhot
  • Evaluate AI agents with Confidence

    Watch
    ·45 minutes800 Students
    Mahesh Yadav
  • Error Analysis: The AI Engineer’s Best ROI

    Watch
    ·60 minutes1,514 Students
    Hamel Husain and Shreya Shankar
  • 🛠 Synthetic Data Flywheels: Build Reliable LLM Apps Faster

    Watch
    ·30 minutes187 Students
    Hugo Bowne-Anderson and Stefan Krawczyk
  • Understanding Embedding Performance through Generative Evals

    Watch
    ·60 minutes1,181 Students
    Jason Liu and Kelly Hong
  • Online Evals and Production Monitoring

    Watch
    ·60 minutes831 Students
    Jason Liu, Ben Hylak, and Sidhant Bendre
  • Optimize Structured Data Retrieval With Evals

    Watch
    ·45 minutes843 Students
    Daniel Svonava and Hamel Husain
  • Scaling Judge-Time Compute for Robust Auto LLM Evaluation

    Watch
    ·60 minutes489 Students
    Jason Liu and Leonard Tang
  • How OpenAI Customers Use Evals To Build Better AI Products

    Watch
    ·30 minutes1,080 Students
    Jim Blomo and Hamel Husain
  • Optimize Your Dev Setup For Evals w/ Cursor Rules & MCP

    Watch
    ·30 minutes686 Students
    Isaac Flath, Hamel Husain, and Shreya Shankar
  • How You Catch Production Hallucinations in Real Time

    Watch
    ·60 minutes504 Students
    Jason Liu and Julia Neagu
  • Don't Tweak Prompts. Engineer Agents.

    Watch
    ·30 minutes274 Students
    Hugo Bowne-Anderson and Skylar Payne
  • Synthetic RAG evaluation

    Watch
    ·60 minutes210 Students
    Alexey Grigorev and Doug Turnbull
  • Stay Ahead in AI: Evaluate Any New LLM in 15 Minutes

    Watch
    ·30 minutes93 Students
    Sherveen Mashayekhi
  • AI Systems Under Pressure: Red-Team Before You Ship

    Watch
    ·60 minutes802 Students
    Krystal Jackson
  • Create MCP Tool Evals Before You Ship

    Watch
    ·45 minutes282 Students
    Emmanuel Paraskakis
  • How to test AI when you don't have any data yet

    Watch
    ·45 minutes23 Students
    Madalina Turlea and Catalina Turlea
  • Scale Evals Without the Chaos

    Watch
    ·45 minutes248 Students
    Aishwarya Naresh Reganti
  • Evals in Action With Arize

    Watch
    ·45 minutes200 Students
    Laurie Voss

Browse by topic

  • AI for Engineers
  • Agentic AI
  • Coding with AI
  • Claude Code
  • OpenClaw
  • MCP
  • RAG & Search
  • AI Evals
  • Machine Learning
  • LLM Ops
  • Context Eng
  • Security
  • System Design
  • Leadership
  • Career Growth
Engineering

AI Evals

Cohort-based courses

Guided programs to get real results.

AI Evals For Engineers & PMs

4.7
·4 weeks·Sep 7 – Oct 2
Hamel Husain ML Engineer with 20 years of experience
Shreya Shankar ML Systems & Applied AI Evals Researcher

Beyond Evals: Designing Improvement Flywheels for AI Products

NEW·3 weeks·Jun 6 – Jun 27
Aishwarya Naresh Reganti AI Founder & Advisor to F500s | Ex-AWS

1-day workshops

Short, focused sessions to build specific skills.

AI Trust at Scale. From Evals to Governance

NEW·3 hours·Jun 30
Subha Shetty Fractional Chief Product and AI Officer

Free Lightning Lessons

Interactive sessions to explore new topics.
  • How to Setup Evals For Agents

    Watch
    ·30 minutes1,611 Students
    Harrison Chase, Hamel Husain, and
  • Raise Your Technical Bar as an AI-Native PM

    Watch
    ·30 minutes15,831 Students
    Jason P. Yoong and Gayathri Keerthana (GK)
  • Debug the weird stuff your AI does (in less than 1 hour)

    Watch
    ·45 minutes5,165 Students
    Marily Nika and Hamel Husain
  • From Automation to Multi-Agent Architectures

    Watch
    ·3 lessons1,351 Students
    Hamza Farooq
  • Vibe Code Annotation UIs for AI Analytics Evals

    Live
    ·Jun 24·60 minutes633 Students
    Shane Butler
  • Build Your AI Evals & Analytics Playbook

    Watch
    ·30 minutes509 Students
    Stella Liu and Amy Chen
  • AI Evals for Product Managers

    Watch
    ·60 minutes2,011 Students
    Anshumani Ruddra
  • Debug Cursor Agent Failures Before Production

    Live
    ·Jun 10·30 minutes39 Students
    Carmelo Iaria
  • Evaluation Driven Development for Agentic AI Systems

    Watch
    ·45 minutes586 Students
    Aurimas Griciūnas
  • Production Grade AI Evals by Braintrust.dev

    Watch
    ·30 minutes490 Students
    Mengying Li
  • Practical Evaluation Strategies for AI Agents

    Watch
    ·45 minutes471 Students
    Hamza Farooq and Gabriela de Queiroz
  • Ship a Production Cursor Agent System in 30 Minutes

    Live
    ·Jun 24·30 minutes93 Students
    Carmelo Iaria
  • Learn Agentic AI: Setting agents metrics and evaluations

    Watch
    ·45 minutes855 Students
    Mahesh Yadav
  • How to Drive AI Evals Adoption

    Watch
    ·30 minutes325 Students
    Dr Sebastian Fox
  • Evals for Voice AI: Learnings from Google Evals Team

    Watch
    ·30 minutes242 Students
    Ravin Kumar
  • Design Evals Users Will Trust

    Watch
    ·45 minutes770 Students
    Aishwarya Naresh Reganti
  • Evals for Everyone

    Watch
    ·3 lessons2,195 Students
  • Setting Eval for AI Agents & Scaling with Auto-Evaluation

    Watch
    ·30 minutes862 Students
    Mahesh Yadav
  • Modern Information Retrieval Evaluation In The RAG Era

    Watch
    ·45 minutes5,281 Students
    Nandan Thakur, Hamel Husain, and Shreya Shankar
  • Build Your Own Eval Tools With Notebooks!

    Watch
    ·45 minutes612 Students
    Vincent D. Warmerdam, Hamel Husain, and Shreya Shankar
  • Part 3: Building Robust Evaluations for AI Agents

    Watch
    ·60 minutes140 Students
    Hamza Farooq and Gabriela de Queiroz
  • Setting up your first AI eval with a LLM-as-judge

    Watch
    ·45 minutes62 Students
    Madalina Turlea and Catalina Turlea
  • Collaborative AI Evals with Human Feedback

    Watch
    ·30 minutes113 Students
    Rogério Chaves
  • Run Eval Loops and Guardrails for Cursor Agents

    Live
    ·May 27·30 minutes79 Students
    Carmelo Iaria
  • Master Evaluation Techniques for LLM Apps

    Watch
    ·30 minutes412 Students
    Haroon Choudery
  • Mastering LLM Application Testing

    Watch
    ·30 minutes240 Students
    Hugo Bowne-Anderson and Stefan Krawczyk
  • Evaluating Agentic AI Applications Beyond Vibe Checks

    Watch
    ·45 minutes1,250 Students
    Aishwarya Naresh Reganti, Kiriti Badam, and Claire Longo
  • How Evals Made GitHub Copilot Happen

    Watch
    ·30 minutes892 Students
    John Berryman, Shawn Simister, and Hamel Husain
  • Reliable RAG Agents: Intent-Driven Failure Detection

    Watch
    ·60 minutes298 Students
    Jason Liu and Ben Hylak
  • Strategies for building self-improving document processing

    Watch
    ·60 minutes429 Students
    Jason Liu and Eli Badgio
  • The Hidden Signal in Production AI Logs

    Watch
    ·60 minutes172 Students
    Jason Liu and Scott Clark
  • Calibrate LLM-as-a-judge for Real-world Impact

    Watch
    ·45 minutes205 Students
    Eddie Landesberg
  • Evaluating AI Agents before Users Break Them

    Watch
    ·60 minutes88 Students
    Aki Wijesundara, PhD, Marc Klingen, and Lotte Verheyden
  • Automating Evals With Claude Code + Phoenix

    Watch
    ·60 minutes2,354 Students
    Mikyo King and Hamel Husain
  • How to test and improve your AI agents

    Watch
    ·45 minutes167 Students
    Jacob Bank
  • Go Beyond AI Evals: Diagnose and Decide

    Watch
    ·45 minutes52 Students
    Rajiv Shah
  • Understand SHAP (SHapley Additive exPlanations)

    Watch
    ·30 minutes310 Students
    Patrick Hall
  • Improve reliability of your AI applications

    Watch
    ·30 minutes747 Students
    Shreya Rajpal
  • Evaluating AI Agents

    Watch
    ·45 minutes1,428 Students
    Amir Feizpour and Samuel Dion-Girardeau
  • De-Risking LLM Model Switches w Evals & Prompt Optimization

    Watch
    ·45 minutes145 Students
    Amir Feizpour and Hugo Mailhot
  • Evaluate AI agents with Confidence

    Watch
    ·45 minutes800 Students
    Mahesh Yadav
  • Error Analysis: The AI Engineer’s Best ROI

    Watch
    ·60 minutes1,514 Students
    Hamel Husain and Shreya Shankar
  • 🛠 Synthetic Data Flywheels: Build Reliable LLM Apps Faster

    Watch
    ·30 minutes187 Students
    Hugo Bowne-Anderson and Stefan Krawczyk
  • Understanding Embedding Performance through Generative Evals

    Watch
    ·60 minutes1,181 Students
    Jason Liu and Kelly Hong
  • Online Evals and Production Monitoring

    Watch
    ·60 minutes831 Students
    Jason Liu, Ben Hylak, and Sidhant Bendre
  • Optimize Structured Data Retrieval With Evals

    Watch
    ·45 minutes843 Students
    Daniel Svonava and Hamel Husain
  • Scaling Judge-Time Compute for Robust Auto LLM Evaluation

    Watch
    ·60 minutes489 Students
    Jason Liu and Leonard Tang
  • How OpenAI Customers Use Evals To Build Better AI Products

    Watch
    ·30 minutes1,080 Students
    Jim Blomo and Hamel Husain
  • Optimize Your Dev Setup For Evals w/ Cursor Rules & MCP

    Watch
    ·30 minutes686 Students
    Isaac Flath, Hamel Husain, and Shreya Shankar
  • How You Catch Production Hallucinations in Real Time

    Watch
    ·60 minutes504 Students
    Jason Liu and Julia Neagu
  • Don't Tweak Prompts. Engineer Agents.

    Watch
    ·30 minutes274 Students
    Hugo Bowne-Anderson and Skylar Payne
  • Synthetic RAG evaluation

    Watch
    ·60 minutes210 Students
    Alexey Grigorev and Doug Turnbull
  • Stay Ahead in AI: Evaluate Any New LLM in 15 Minutes

    Watch
    ·30 minutes93 Students
    Sherveen Mashayekhi
  • AI Systems Under Pressure: Red-Team Before You Ship

    Watch
    ·60 minutes802 Students
    Krystal Jackson
  • Create MCP Tool Evals Before You Ship

    Watch
    ·45 minutes282 Students
    Emmanuel Paraskakis
  • How to test AI when you don't have any data yet

    Watch
    ·45 minutes23 Students
    Madalina Turlea and Catalina Turlea
  • Scale Evals Without the Chaos

    Watch
    ·45 minutes248 Students
    Aishwarya Naresh Reganti
  • Evals in Action With Arize

    Watch
    ·45 minutes200 Students
    Laurie Voss

Contact support: support@maven.com

Learn

    Courses
    Workshops
    Free lessons
    Expense a course

Teach

    Teach on Maven
    Instructor resources

Maven

    About us
    Careers
    Help center
    Privacy policy
    Terms of service

© 2026 Maven Learning, Inc.