3 Weeks
·Cohort-based Course
Learn how to create your own training framework like Accelerate
This course is popular
5 people enrolled last week.
3 Weeks
·Cohort-based Course
Learn how to create your own training framework like Accelerate
This course is popular
5 people enrolled last week.
Course overview
Every training framework wraps around the same core distributed training strategies:
- torch's Distributed Data Parallelism
- DeepSpeed
- torch's FullyShardedDataParallelism
However, APIs can break. Strong deprecations in the name of "clean code" can destroy your entire stack.
This course is designed to help you create your own scalable training framework by incorporating the most common tools in the industry.
This is not a "how does DDP work" course.
This is a "how do I incorporate DDP into an internal training framework that is stable and works well"
Our aim is at the end of 3 weeks to create a minimal version of Hugging Face's Accelerate with a few different spins.
This includes:
1. Writing the source code
2. Learning how to test distributed code well
3. How do you create CI's in Github for distributed testing
I'll give you all the tricks I've learned over 4 years in developing accelerate into one class to help you not only write frameworks yourself but also be able to understand common patterns in other frameworks to make navigating them easier
To help facilitate the learning, Prime Intellect will be sponsoring $300 in compute per student
01
Students wanting hands-on experience with what real training frameworks are like in the real world and how the code is structured
02
Beginner MLE's who want to understand how to go from one-off scripts to a more robust framework that they own
03
Mid-level MLE's wanting to know how to ensure their training stack is up to par with the latest integrations
Our aim is to make a training framework in PyTorch using common libraries. So as a result, I expect you know some PyTorch
We're building training frameworks. Please make sure you know how back-propagation works (as an idea) and such
How to build a scalable API
We need to create a multi-faceted API for such a framework. You will do so and help understand the balance of functionality with readability
How to use FSDP/DDP/DeepSpeed
We'll directly be using the latest and most common training APIs and understanding how they can work together
How to test distributed code
Testing distributed code isn't as simple as "pytest domything"
Live sessions
Learn directly from Zachary Mueller in a real-time, interactive format.
Lifetime access
Go back to course content and recordings whenever you need to.
Community of peers
Stay accountable and share insights with like-minded professionals.
Certificate of completion
Share your new skills with your employer or on LinkedIn.
Maven Guarantee
This course is backed by the Maven Guarantee. Students are eligible for a full refund up until the halfway point of the course.
Nov
4
Nov
6
Nov
11
Nov
13
Nov
18
Nov
20
This is where you'll add your bio as a way to establish credibility and demonstrate to your audience why you're the right person to teach this course.
Join an upcoming cohort
Cohort 1
$1,000
Dates
Payment Deadline
Join an upcoming cohort
Cohort 1
$1,000
Dates
Payment Deadline