Vectorless RAG with PageIndex

Hosted by Matt Overstreet, Doug Turnbull, and Trey Grainger

Mon, Jun 22, 2026

5:00 PM UTC (1 hour)

Virtual (Zoom)

Free to join

Invite your network

Go deeper with a course

Save 25% til Sunday
AI-Powered Search: Modern Retrieval for Humans & Agents
Trey Grainger and Doug Turnbull
View syllabus

What you'll learn

Chunking with document structure

Learn how PageIndex breaks up content up using the author's own boundaries (table of contents + section breaks)

Integrating PageIndex and OpenSearch

Matt will share how he's integrated PageIndex into IBM's OpenSearch product

Solving RAG without embedddings

RAG implies chunks + embeddings. But navigating document structure to find what's relevant is just as important.

Why this topic matters

RAG too long has ignored document structure. The classic RAG stack assumes chunks + embeddings. Practitioners assume the best way to find what's relevant is through pure question-answering. Matt will share his work integrating PageIndex with OpenSearch, He'll share how PageIndex works and how he integrated integrated it alongside classic RAG.

You'll learn from

Matt Overstreet

Product Manager - IBM OpenSearch

Matt Overstreet has spent fifteen years building the distributed databases and search systems behind mission-critical applications. He cut his teeth on search relevance at OpenSource Connections — during the years that gave the "relevance engineer" its name — working alongside Doug Turnbull and John Berryman as Relevant Search was being written, and was a founding participant at Haystack. Since then he's deployed or supported Cassandra for everyone from FedEx to Venmo, helped AWS and Google prove distributed data at Kubernetes scale, and today guides product for OpenSearch inside IBM's watsonx.data. His current obsession: where agentic memory meets traditional retrieval.

Doug Turnbull

Co-Author AI Powered Search

In 2012, Doug got bit by the search bug and he's still trying to keep up. From full-text search, to Learning to Rank models, to search agents that generate their own code, he knows the endless landscape first hand. Yet Doug wants to deeply understand the what / how / why, and help teams use these technologies practically, distinguishing hype from reality.

He’s led search at Reddit, Shopify, and Wikipedia, authored Relevant Search and AI Powered Search, and advised 100+ organizations over the years - all in pursuit of the same question: how does search actually work?

Trey Grainger

Author, AI-Powered Search

Trey Grainger is lead author of the book AI-Powered Search (Manning 2025) and founder of Searchkernel, a software consultancy building the next generation of AI-powered search. He also serves as a technical advisor at OpenSource Connections.


He previously served as CTO of Presearch, a decentralized web search engine, and as Chief Algorithms Officer and SVP of Engineering at Lucidworks, a search company whose technology powers hundreds of the world’s leading organizations. Trey is also co-author of the book Solr in Action (Manning 2014), as well as over a dozen other publications including books, journals, and research papers. Trey has 18 years of experience in search and data science focused on building self-learning search platforms integrating the most successful AI Search techniques.


Trey teaches AI Search in the course AI-Powered Search: Modern Retrieval for Humans & Agents with Doug Turnbull.

Previously at

Shopify.com
Reddit
CareerBuilder
O'Reilly Media
See all products from AI-Powered Search

Sign up to join this lesson

By continuing, you agree to Maven's Terms and Privacy Policy.