Build your own vector database

Doug Turnbull

Led Search + ML at Reddit and Shopify

Make embedding retrieval an asset, not a headache

In modern search, embeddings answer questions. They go beyond simple keyword matching to find semantically similar answers. Increasingly: RAG, recommendations, and traditional search need zero-in on what's relevant. In this class we'll live-code core data structures that live behind search systems like Pinecone, Weaviate, Turbopuffer, QDrant, Vespa, Elasticsearch etc etc etc.

By building your own vector database, you'll be better equiped to work with production vector search systems. You'll have first-hand experience with the knobs to turn to improve performance and develop a robust hybrid + vector search system

Workshop agenda

  • Benchmarking vector search

    How we think through vector database stats: recall, latency, throughput

  • Building HNSW algorithm from scratch

    Hands on developing the core graph-based algorithm behind search engines: HNSW

  • Filtering vector search

    How to enhance vector algorithms to filter based on metadata

  • Optimizing with quantization

    How to use standard quantization techniques to reduce memory, improve speed, without sacrificing recall

  • Layering in hybrid search

    Lexical retrieval isn't obsolete - we'll talk about approaches to combining lexical and vector search into a single solution

Learn directly from Doug

Doug Turnbull

Doug Turnbull

Search at Reddit, Shopify, Wikipedia

Coached teams at
Reddit
Shopify.com
Apple
Amazon Web Services
Wikipedia
See all products from Doug

Who this workshop is for

  • Infrastructure teams - anyone tasked to squeeze the most performance out of a high-scale retrieval system

  • Search developers - anyone that needs to build relevant, fast vector search for RAG or agentic search applications

  • AI engineers - need to find that relevant context? Anyone who needs to find context to answer questions from AI

What's included

Doug Turnbull

Live sessions

Learn directly from Doug Turnbull in a real-time, interactive format.

Lifetime access

Go back to course content and recordings whenever you need to.

Community of peers

Stay accountable and share insights with like-minded professionals.

Certificate of completion

Share your new skills with your employer or on LinkedIn.

Maven Guarantee

Your purchase is backed by the Maven Guarantee.

Frequently asked questions

Maven for Teams

Reimbursement

Get your company to pay

Everything L&D needs: email template, receipts, and certificate of completion.

Get reimbursed

Team discount

Learn with your teammates

Save 20%+ when 2 or more teammates enroll in the same cohort.

Save 20%+ with a team

Private cohort

Run a cohort for your org

A dedicated cohort with a custom schedule and curriculum, tailored to your team.

Book a private cohort

$600

USD

Aug 11
Enroll