Scaling Judge-Time Compute for Robust Auto LLM Evaluation
Hosted by Jason Liu and Leonard Tang
Wed, Jul 2, 2025
6:00 PM UTC (1 hour)
Virtual (Zoom)
Free to join
Go deeper with a course
Wed, Jul 2, 2025
6:00 PM UTC (1 hour)
Virtual (Zoom)
Free to join
81 students
Go deeper with a course
What you'll learn
LLM Judge Reliability Issues
Judge-Time Compute Scaling
RL-Powered Evaluation Systems
Why this topic matters
You'll learn from
Jason Liu
Consultant at the intersection of Information Retrieval and AI
Jason has built search and recommendation systems for the past 6 years. He has consulted and advised a dozens startups in the last year to improve their RAG systems. He is the creator of the Instructor Python library.
Leonard Tang
Co-Founder & CEO @ Haize Labs
Leonard Tang is the Co-Founder and CEO of Haize Labs, he works on solving the ultimate extant problem in AI: ensuring its robustness, quality, and alignment for any application. Prior to this, his research covered adversarial robustness, mathematical reasoning pitfalls, computational neuroscience, interpretability, and language models. Leonard dropped out of , before starting, a Stanford PhD in computer science to pursue Haize Labs.
worked with
Learn directly from Jason Liu and Leonard Tang
By continuing, you agree to Maven's Terms and Privacy Policy.
By continuing, you agree to Maven's Terms and Privacy Policy.