CTO's Playbook for Faster Agentic RAG

Free Lesson

CTO's Playbook for Faster Agentic RAG

Hosted by Nirant Kasiwal and Doug Turnbull

661 students

In this video

What you'll learn

Latency vs Recall Cheat-Sheet

know exactly when to swap OpenAI for FastEmbed and improve both recall and latency.

Hybrid-to-Agentic Upgrade Path

live-coded: BM25 + vector ➜ tool-calling agent

Stealable RAG Audit Framework

7-point checklist that spots 90 % of real-world retrieval failures before users do.

Why this topic matters

Gen-AI POCs die when retrieval lags or RAG hallucinates. This session shows a repeatable, metrics-driven loop to ship RAG that. (1) Scales to prod traffic without latency meltdowns. (2) Heals itself with lightweight agent logic, and (3) Proves ROI through an audit trail and analytics your exec team & customers can trust.. Walk in with curiosity, walk out with a runnable notebook and a checklist.

You'll learn from

Nirant Kasiwal

Creator FastEmbed

Nirant Kasliwal is a notable AI Engineer with over 7 years of expertise in areas like chatbots, language models, and vector databases. He founded FastEmbed, an embedding library praised for its speed and utilized by companies including NVIDIA.

Recognized by AI luminaries such as Dr. Andrew Ng, Nirant is one of India's leading GenAI scientists. He has significantly contributed to AI education through projects like "Awesome NLP," a resource for engineers learning NLP, and continues to enhance AI accessibility and knowledge sharing.

Doug Turnbull

Led Search at Reddit, Shopify, Wikipedia

Doug has done embedding-based retrieval since using Latent Semantic Indexing to generate search synonyms in 2013. Author of Relevant Search + AI Powered Search, he now helps teams build RAG and search applications. Previous work includes leading search at Reddit, Shopify, and several AI Startups..