Turn Eval Results Into a Better Model

Part of AI Product Engineering

Hosted by Will Brown, Florian Brand, and Hamel Husain

Fri, Jul 31, 2026

7:00 PM UTC (45 minutes)

Virtual (Zoom)

Free to join

Invite your network

Go deeper with a course

Featured in Lenny’s List
AI Evals For Engineers & PMs
Hamel Husain and Shreya Shankar
View syllabus

What you'll learn

Know when to own a model

Decide when owning and training a model beats renting a closed one.

Close the loop with RL

Use reinforcement learning so the model gets better at your real tasks each round.

Translate evals into environments

Turn your evals into an environment the model can hill climb.

Why this topic matters

Teams use evals to find where a model is weak on their tasks. An eval measures the problem but does not improve the model. To improve it, you turn that eval into an environment, a version of the task with a clear way to check the answers. You train the model against that check until it improves. Will and Florian build these at Prime Intellect and will show how teams use them.

You'll learn from

Will Brown

Research Lead at Prime Intellect

Will Brown is Research Lead at Prime Intellect, where he builds open-source research and infrastructure for agentic reinforcement learning, including the verifiers library. He holds a PhD in algorithmic game theory from Columbia, co-advised by Christos Papadimitriou and Tim Roughgarden, and previously worked in Morgan Stanley's machine learning research group.

Florian Brand

Research Engineer at Prime Intellect

Florian Brand is a Research Engineer at Prime Intellect, where he works on LLM evals. He focuses on the practical side of evaluation: reproducibility, infrastructure, and building signals that models can actually improve on. He also writes and contributes to discussions on open models.

Hamel Husain

ML Engineer with 20+ years of experience

Hamel Husain is a ML Engineer with 20+ years of experience. He has worked with companies such as Airbnb and GitHub, which included early LLM research used by OpenAI, for code understanding. He has also led and contributed to numerous popular open-source machine-learning tools. Hamel is currently an independent developer helping companies with applied evals.

See all products from Hamel Husain & Shreya Shankar

Sign up to join this lesson

By continuing, you agree to Maven's Terms and Privacy Policy.