This cohort meets on Thursday evening 7-9pm

Thursday Cohort

A hands-on course on RLHF for finetuning LLM, with complete introduction to reinforcement learning, PPO and finetuning process .

Junling Hu

Founder of Coach.ai

Mastering Finetuning LLMs with RLHF in 4 weeks

RLHF (Reinforcement Learning with Human Feedback) is the secret weapon behind ChatGPT and Llama. It is a crucial step to enhance the performance of an LLM. By the end of this workshop, you will gain complete understanding of RLHF, the models behind it such as PPO, reward function and supervised finetuning. You will gain the confidence of knowing the full process of RLHF, and how to implement them.&nbsp;

Mastering RLHF for Finetuning LLM

Technology managers who want to understand RLHF and the process of finetuning LLM, and gain hands-on experience.

Software engineers and data scientists who want to grow in their career and learn new skills. and gain hands-on experience.

Academics and students who want to learn about RLHF, the cutting edge solution for LLMs.

Who is this course for

What you’ll get out of this course

You will gain deep understanding of LLM finetuning, and end to end process.

Deep understanding of LLM finetuning

You will gain complete understanding of RLHF, the models behind it such as PPO, reward function and supervised finetuning.

Complete knowledge of RLHF

You will get the complete knowledge of reinforcement learning.

Deep understanding of reinforcement learning and its use for LLM

You gain hands-on experience with RLHF, by working a real-world data, and solve the problem.

Hands-on experience with RLHF 

Provide a brief description for each module as a way to sell the value of what students will be learning throughout the course.

First Module

Second Module

Third Module

Course syllabus

Meet your instructor

What people are saying

You can choose cohort for different time of the week: Tuesday evening, Thursday evening or Wed daytime. We meet through Zoom. Our meeting includes live lectures and working through live Python notebook.

Tue or Wed or Thur

You can ask the instructor any question during the week through email. In addition, there are optional homework that you can practice, which will help you to get deeper into the class materials.

Q&A and Additional materials

Course Schedule

This course builds on live workshops and hands-on projects

Active hands-on learning

You’ll be interacting with other learners through breakout rooms and project teams

Interactive and project-based

Join a community of like-minded people who want to learn and grow alongside you

Course completed by

About this course

Get notified about the next cohort

Learn from live sessions