Agentic RAG Systems & Multi-Agent Architectures: Developers Edition

4.8 (54)

·

7 Weeks

·

Cohort-based Course

Master Advanced Techniques for Building and Optimizing Agentic RAG Systems and Multi-Agent Workflows — Designed for Builders

This course is popular

7 people enrolled last week.

Previously at

Google
Ucla
company logo
University of Minnesota

Course overview

The Agent Technical Course: Build and Deploy Production-Grade Gen AI Products

🧠 Masterclass in Agentic RAG & Multi-Agent Deployment


From Naive Bots to Production-Ready Agentic Systems


Welcome to the AI builder’s dojo — a technically rigorous, unapologetically hands-on course designed for builders ready to go beyond plug-and-play wrappers and start designing agent systems that reason, route, and adapt.


This is not a surface-level “course.” It’s a 10-week, live-led deep dive into autonomous systems:


7 instructor-led sessions · 7 interactive office hours · 1 epic Demo Day.


No fluff. No wrappers. Just agents that work.


🧠 Course Breakdown – What You'll Learn


1. Agentic RAG with Routers — Why Naive RAG Breaks

We begin by deconstructing naive RAG — systems that fail under multi-turn, context-rich queries. You’ll build your own agentic retrieval system with intelligent routers, reflection, memory, and reasoning — capable of tool invocation, multi-agent coordination, and smart chunk selection.

🔍 You’ll learn:

-Stateless vs. stateful RAG

-When cosine similarity fails

-Designing context-aware routing logic

-Reflection, ReAsk, and multi-hop search strategies


2. Hosting & Quantizing LLMs — Local + Runpod

Production-ready agents can’t always rely on OpenAI.

You’ll learn to quantize models (GPTQ, GGUF) for speed and cost-efficiency, and deploy them with Ollama locally and RunPod in the cloud, with tools like FastAPI and auto-scaling on demand.


🚀 You’ll learn:

-LLM quantization strategies (4-bit, GGML, QLoRA)

-On-device hosting via Ollama

-Deployment via RunPod or serverless GCP

-Streamed inference + latency benchmarking


3. Semantic Caching — Build It from Scratch

We’ll implement a semantic caching layer from scratch that recognizes similar queries, avoids unnecessary calls to the model, and improves performance over time using vector proximity and feedback loops.

💡 You’ll build:

Feedback loop to train your cache

Cache hit/miss architecture

Semantic distance functions + reranking

Cost-saving and latency benchmarks


4. Knowledge Graphs from Scratch — Text-to-Cypher

Go beyond flat retrieval with structured reasoning using Knowledge Graphs. You’ll implement a graph-based memory layer with Cypher generation from natural language, and use DSPy to guide model outputs toward your schema.

🌐 You’ll learn:

Graph modeling for agent memory

Extracting entities + relations from unstructured text

Generating Cypher queries from prompts

Integrating Neo4j or Memgraph with RAG


5. ReAct Agents — Python & No-Code with n8n

We’ll go deep into ReAct (Reason + Act) — one of the most powerful agent paradigms — and then rebuild it using both Python and no-code tools like n8n. Perfect for teams and workflows where technical + non-technical builders collaborate.

🔧 You’ll build:

Modular ReAct pipelines (tool use, planning, reflection)

Human-in-the-loop agents

No-code agents in n8n connected to APIs + databases

Multi-step workflows with visual orchestration


6. Bringing It All Together — ADK, MCP, A2A, and Guardrails

The final sprint: we combine everything into a production system using Google’s ADK (Agent Development Kit), implement MCP (Modular Cognitive Planning), and create agent-to-agent (A2A) collaboration. You'll also implement industrial-grade guardrails and deploy securely with GCP integrations.


🛡️ You’ll ship:

Multi-agent collaboration workflows

-Safety guardrails using Llama Guard

- Production deployment + monitoring

- A capstone project solving a real-world enterprise task


🧩 Who This Is For

If you’ve already built basic RAG tools or chatbots, and want to level up into true agent systems with autonomy, orchestration, and scale — this course is for you.

-AI Engineers / LLM Builders

-Technical Product Managers

-Devs working on internal tools or AI copilots

-Cloud / MLOps Engineers


🔧 What You’ll Get

✅ 10 weeks of high-signal live instruction

✅ Weekly office hours + async support

✅ A final capstone project + demo day

✅ Real-world projects, not toy demos

✅ Guest lectures from AI builders at Google, Meta, OpenAI

✅ Early access to tools from Traversaal.ai

✅ Lifetime access to content + all future updates


🎓 Prerequisites

Experience building RAG or LLM-based tools

Knowledge of APIs, encoders/decoders, Python


Basic understanding of cloud or serverless deployment

👉 Need a ramp-up? Start with: Building LLM Applications: https://maven.com/boring-bot/ml-system-design


🎯 This Course Is For Builders

Not marketers, not spectators — builders who want to deploy real AI.


This course is for you if you are a:

01

Machine Learning Engineer exploring different techniques to scale LLM solutions

02

Researcher, who would like to delve in to various aspects of open-source LLMs

03

Software Engineer, looking to learn how to integrate AI into their products

What you’ll get out of this course

Advanced AI Architectures

Understand and implement complex AI architectures, including enterprise-level RAG systems and agentic RAG strategies. You will also dive deep into the Mixture of Experts (MoE) technique and other model merging strategies to enhance the capabilities of your AI systems.

Practical Skills for Deployment

From building semantic caches using GCP and Redis to deploying LLMs on serverless platforms like AWS Bedrock, you'll learn the practical skills to deploy and manage AI applications in real-world scenarios. 

Fine-Tuning Expertise

Acquire advanced techniques for fine-tuning LLMs, enabling you to adapt these models to specific tasks or domains and enhance their performance in targeted applications.

Efficient Inference Processing

Explore strategies for exploring and optimizing inference speeds, ensuring that your language models perform efficiently in real-time scenarios, a crucial skill for deploying responsive and scalable applications.

Knowledge of Responsible AI

Understand the importance of ethical AI development and learn to implement guardrails using tools like NeMo, Colang, and Llama Guard to ensure your AI systems align with responsible AI principles.




What’s included

Hamza Farooq

Live sessions

Learn directly from Hamza Farooq in a real-time, interactive format.

Lifetime access

Go back to course content and recordings whenever you need to.

Community of peers

Stay accountable and share insights with like-minded professionals.

Certificate of completion

Share your new skills with your employer or on LinkedIn.

Maven Guarantee

This course is backed by the Maven Guarantee. Students are eligible for a full refund up until the halfway point of the course.

Course syllabus

14 live sessions • 35 lessons • 6 projects

Week 1

Jul 12—Jul 13

    Recordings from previous Talks/ Sessions

    2 items

    Enterprise RAG Solutions with Semantic Caching

    6 items

    Jul

    12

    Session 1: Enterprise RAG

    Sat 7/124:00 PM—6:00 PM (UTC)

Week 2

Jul 14—Jul 20

    Optimizing and Deploying Large Language Models

    10 items

    Jul

    19

    Session 2: Deploying LLMs with Quantization

    Sat 7/194:00 PM—6:00 PM (UTC)

    Jul

    16

    Office Hours

    Wed 7/167:00 PM—7:30 PM (UTC)
    Optional

Week 3

Jul 21—Jul 27

    Jul

    26

    Session 3: Guardrails and AI Safety

    Sat 7/264:00 PM—6:00 PM (UTC)

    Module: DSPy and Implementing Guardrails for Responsible AI

    6 items

    Jul

    23

    Office Hours

    Wed 7/237:00 PM—7:30 PM (UTC)
    Optional

Week 4

Jul 28—Aug 3

    Holiday Assignment

    2 items

    Jul

    30

    Office Hours

    Wed 7/307:00 PM—7:30 PM (UTC)
    Optional

    Aug

    1

    Tech Friday: Deploying LLM Endpoints in Enterprise

    Fri 8/17:00 PM—8:00 PM (UTC)

Week 5

Aug 4—Aug 10

    Knowledge Graphs

    8 items

    Aug

    9

    Session 4: Knowledge Graphs

    Sat 8/94:00 PM—6:00 PM (UTC)

Week 6

Aug 11—Aug 17

    Agents: A Deeper Overview

    3 items

    Aug

    16

    Session 5: Agents

    Sat 8/164:00 PM—6:00 PM (UTC)

    Aug

    15

    Tech Friday: Agents in Action

    Fri 8/157:00 PM—8:00 PM (UTC)

    Aug

    13

    Office Hours

    Wed 8/136:00 PM—6:30 PM (UTC)

Week 7

Aug 18—Aug 24

    Semantic and Agentic RAG

    2 items

    Model Merging and Fine-tuning Video recordings

    2 items

    Aug

    20

    Office Hours

    Wed 8/206:00 PM—7:00 PM (UTC)

    Aug

    23

    Session 6: Agentic RAG and Semantic Chunking

    Sat 8/234:00 PM—6:00 PM (UTC)

Post-course

    Demo Day

    0 items

    Sep

    13

    Demo Day

    Sat 9/134:00 PM—6:00 PM (UTC)

4.8 (54 ratings)

What students are saying

Meet your instructor

Hamza Farooq

Hamza Farooq

I am the founder of Traversaal.ai, an LLM-based startup dedicated to creating scalable, customizable, and cost-efficient language model solutions for enterprises.


With over 15 years of experience in machine learning, my journey has spanned three continents and seven countries, covering a diverse range of industries such as tech, telecommunications, finance, and retail.


As a former Senior Research Manager at Google and Walmart Labs, I have led data science and machine learning teams, focusing on optimization, natural language processing, recommender systems, and time series forecasting.

I am also an adjunct professor at UCLA, and Instructor for Stanford Continuing Studies where I bridge the gap between academic theory and real-world AI applications.


Additionally, I frequently speak at conferences and conduct training sessions, sharing insights on large language models, deep learning, and cloud computing.

A pattern of wavy dots

Join an upcoming cohort

Agentic RAG Systems & Multi-Agent Architectures: Developers Edition

2025-cohort-3

$1,500

Dates

July 12—Aug 24, 2025

Payment Deadline

July 11, 2025
Get reimbursed

Course schedule

4-6 hours per week

  • Saturdays

    9:00 - 11:00am PT

    Virtual Class

  • Weekly projects

    2-3 hours per week

    Work in teams to build solutions, this requires engagement with other team members

Free resource

🚀 Join Me for a 7-Day Journey into AI Agents & LLM-Powered Applications

If you're curious about building the next generation of AI Agents and want to master how RAG systems, multi-agent frameworks, and LLMs are transforming the way we build, then this course is for you.


Over the next 7 days, I'll walk you through how developers, researchers, and teams are going beyond basic tools and building production-grade, low-latency, and secure AI solutions—and how you can do it too.

Get this free resource

Here's what others have to say about this course

Here's what others have to say about this course

Active hands-on learning

This course builds on live workshops and hands-on projects

Interactive and project-based

You’ll be interacting with other learners through breakout rooms and project teams

Learn with a cohort of peers

Join a community of like-minded people who want to learn and grow alongside you

Join the free course

Join here today

Frequently Asked Questions

A pattern of wavy dots

Join an upcoming cohort

Agentic RAG Systems & Multi-Agent Architectures: Developers Edition

2025-cohort-3

$1,500

Dates

July 12—Aug 24, 2025

Payment Deadline

July 11, 2025
Get reimbursed

$1,500

4.8 (54)

·

7 Weeks