Lightning Lessons

Pushing document parsing frontiers with Reducto

Hosted by Jason Liu, Evan Vogelbaum, Yifei Hu, and Alvin Ryanputra

Tue, Oct 14, 2025

6:00 PM UTC (1 hour)

Virtual (Zoom)

Free to join

Invite your network

Go deeper with a course

Systematically Improving RAG Applications
Jason Liu
View syllabus

What you'll learn

Hybrid OCR-VLM Architecture Design

Discover how to combine traditional OCR with vision language models to achieve high accuracy and low latency.

Large-Scale Document Processing Pipeline

Learn to build production systems that parse complex multi-page documents without hallucination issues.

Natural Language Document Automation

Master techniques for enabling PDF/Word editing through conversational interfaces and structured extraction.

Why this topic matters

Most business data is trapped in PDFs and Word docs that traditional AI can't reliably process. This hybrid OCR-VLM approach solves a $50B+ problem affecting every industry. Students will learn production-ready techniques to automate document workflows, opening career paths in enterprise AI, fintech, healthcare, and legal tech where document processing expertise commands premium salaries.

You'll learn from

Jason Liu

Consultant at the intersection of Information Retrieval and AI

Jason has built search and recommendation systems for the past 6 years. He has consulted and advised a dozens startups in the last year to improve their RAG systems. He is the creator of the Instructor Python library. 

Evan Vogelbaum

Machine Learning, Reducto

Ex-HFT building models and infra to power the next generation of AI in business

Yifei Hu

Machine Learning, Reducto

Training specialized models to parse documents

Alvin Ryanputra

Engineering, Reducto

SWE building in AI, prev worked on AI-powered code optimization, VectorDBs

worked at

Reducto
Stitch Fix
Meta
University of Waterloo
New York University

Sign up to join this lesson

By continuing, you agree to Maven's Terms and Privacy Policy.