Build a Definition of Done for Claude Code and Codex

Part of The AI Evaluation Handbook

Hosted by Greg Ceccarelli

Tue, Jul 14, 2026

8:00 PM UTC (1 hour)

Virtual (Zoom)

Free to join

Invite your network

Go deeper with a course

Hardcore Agentic Engineering for builders who ship
Greg Ceccarelli, Sean Johnson, and Jake Levirne
View syllabus

What you'll learn

Why does your coding agent decide it's "done"?

You'll understand the mechanics of what makes a coding agent stop, and why native confident "done" is a guess it makes

How often do you reply "did you miss anything?

You'll stop interrogating your agent and learn to build a check that runs the real test so it only ever stops on a pass.

What if "done" had to be earned by your agent?

Leave with a tiny harness to wrap Claude Code and Codex that keeps tool calls going until your done criteria passes

Why this topic matters

You already know defining "done" is product's oldest hard problem, the place rework hides. Now your agent calls half-built work "done" with total confidence, and as more agents run longer you can't read every line. Come build a check that makes "done" something a machine verifies, so you stop babysitting and start supervising. That's the shift at the heart of Hardcore Agentic Engineering.

You'll learn from

Greg Ceccarelli

Co-Founder • Ex-CPO Pluralsight • Data at GitHub, Dropbox & Google

I co-founded SpecStory and have built our products (open source extensions and Stoa) entirely by steering agents with almost no hand-written code.

I wrote about it extensively: the Beyond Code-Centric whitepaper, Goal Engineering, and 25 Patterns in Agentic Engineering book.

Before this I was Chief Product Officer at Pluralsight and led data teams at GitHub, Dropbox, and Google. I'm passionate about teaching what it takes to actually ship with agents: the process, mindset and actual techniques that work.

Previously at

Pluralsight
GitHub
Google
Dropbox
AlixPartners
See all products from Greg

Sign up to join this lesson

By continuing, you agree to Maven's Terms and Privacy Policy.