Evaluating LLMs for Your Applications
Hosted by Mahesh Yadav
What you'll learn
Framework for choosing the right LLMs
Setting Evaluation Criteria
Case Study: Contract Processing Application
Why this topic matters
You'll learn from
Mahesh Yadav
GenAI Product Lead at Google, previously at Meta, Amazon, and Microsoft
Mahesh Yadav is a Product Leader at Google GenAI team. Mahesh is one of the world's top AI executives and an award-winning AI Product Educator. His work on AI has been featured in the Nvidia GTC conference, Microsoft Build, and Meta blogs.
Mahesh has 20 years of experience in building products at Meta, Microsoft and AWS AI teams. Mahesh has worked in all layers of the AI stack from AI chips to LLM and has a deep understanding of how GenAI companies ship value to customers.
Currently, he leads AI agent at Google Cloud where it is used extensively for Gemini and other key Google products.
Go deeper with a course
Keep exploring