Agentic RAG Systems & Multi-Agent Architectures: Developers Edition

4.8 (54)

7 Weeks

Cohort-based Course

Master Advanced Techniques for Building and Optimizing Agentic RAG Systems and Multi-Agent Workflows — Designed for Builders

This course is popular

7 people enrolled last week.

Agentic RAG Systems & Multi-Agent Architectures: Developers Edition

4.8 (54)

7 Weeks

Cohort-based Course

Master Advanced Techniques for Building and Optimizing Agentic RAG Systems and Multi-Agent Workflows — Designed for Builders

This course is popular

7 people enrolled last week.

Hosted by

Hamza Farooq

AI Specialist & Adjunct Professor | 15+ years | Google | UCLA

Hamza Farooq

AI Specialist & Adjunct Professor | 15+ years | Google | UCLA

Previously at

Course overview

The Agent Technical Course: Build and Deploy Production-Grade Gen AI Products

🧠 Masterclass in Agentic RAG & Multi-Agent Deployment

From Naive Bots to Production-Ready Agentic Systems

Welcome to the AI builder’s dojo — a technically rigorous, unapologetically hands-on course designed for builders ready to go beyond plug-and-play wrappers and start designing agent systems that reason, route, and adapt.

This is not a surface-level “course.” It’s a 10-week, live-led deep dive into autonomous systems:

7 instructor-led sessions · 7 interactive office hours · 1 epic Demo Day.

No fluff. No wrappers. Just agents that work.

🧠 Course Breakdown – What You'll Learn

1. Agentic RAG with Routers — Why Naive RAG Breaks

We begin by deconstructing naive RAG — systems that fail under multi-turn, context-rich queries. You’ll build your own agentic retrieval system with intelligent routers, reflection, memory, and reasoning — capable of tool invocation, multi-agent coordination, and smart chunk selection.

🔍 You’ll learn:

-Stateless vs. stateful RAG

-When cosine similarity fails

-Designing context-aware routing logic

-Reflection, ReAsk, and multi-hop search strategies

2. Hosting & Quantizing LLMs — Local + Runpod

Production-ready agents can’t always rely on OpenAI.

You’ll learn to quantize models (GPTQ, GGUF) for speed and cost-efficiency, and deploy them with Ollama locally and RunPod in the cloud, with tools like FastAPI and auto-scaling on demand.

🚀 You’ll learn:

-LLM quantization strategies (4-bit, GGML, QLoRA)

-On-device hosting via Ollama

-Deployment via RunPod or serverless GCP

-Streamed inference + latency benchmarking

3. Semantic Caching — Build It from Scratch

We’ll implement a semantic caching layer from scratch that recognizes similar queries, avoids unnecessary calls to the model, and improves performance over time using vector proximity and feedback loops.

💡 You’ll build:

Feedback loop to train your cache

Cache hit/miss architecture

Semantic distance functions + reranking

Cost-saving and latency benchmarks

4. Knowledge Graphs from Scratch — Text-to-Cypher

Go beyond flat retrieval with structured reasoning using Knowledge Graphs. You’ll implement a graph-based memory layer with Cypher generation from natural language, and use DSPy to guide model outputs toward your schema.

🌐 You’ll learn:

Graph modeling for agent memory

Extracting entities + relations from unstructured text

Generating Cypher queries from prompts

Integrating Neo4j or Memgraph with RAG

5. ReAct Agents — Python & No-Code with n8n

We’ll go deep into ReAct (Reason + Act) — one of the most powerful agent paradigms — and then rebuild it using both Python and no-code tools like n8n. Perfect for teams and workflows where technical + non-technical builders collaborate.

🔧 You’ll build:

Modular ReAct pipelines (tool use, planning, reflection)

Human-in-the-loop agents

No-code agents in n8n connected to APIs + databases

Multi-step workflows with visual orchestration

6. Bringing It All Together — ADK, MCP, A2A, and Guardrails

The final sprint: we combine everything into a production system using Google’s ADK (Agent Development Kit), implement MCP (Modular Cognitive Planning), and create agent-to-agent (A2A) collaboration. You'll also implement industrial-grade guardrails and deploy securely with GCP integrations.

🛡️ You’ll ship:

Multi-agent collaboration workflows

-Safety guardrails using Llama Guard

- Production deployment + monitoring

- A capstone project solving a real-world enterprise task

🧩 Who This Is For

If you’ve already built basic RAG tools or chatbots, and want to level up into true agent systems with autonomy, orchestration, and scale — this course is for you.

-AI Engineers / LLM Builders

-Technical Product Managers

-Devs working on internal tools or AI copilots

-Cloud / MLOps Engineers

🔧 What You’ll Get

✅ 10 weeks of high-signal live instruction

✅ Weekly office hours + async support

✅ A final capstone project + demo day

✅ Real-world projects, not toy demos

✅ Guest lectures from AI builders at Google, Meta, OpenAI

✅ Early access to tools from Traversaal.ai

✅ Lifetime access to content + all future updates

🎓 Prerequisites

Experience building RAG or LLM-based tools

Knowledge of APIs, encoders/decoders, Python

Basic understanding of cloud or serverless deployment

👉 Need a ramp-up? Start with: Building LLM Applications: https://maven.com/boring-bot/ml-system-design

🎯 This Course Is For Builders

Not marketers, not spectators — builders who want to deploy real AI.

This course is for you if you are a:

Machine Learning Engineer exploring different techniques to scale LLM solutions

Researcher, who would like to delve in to various aspects of open-source LLMs

Software Engineer, looking to learn how to integrate AI into their products

Machine Learning Engineer exploring different techniques to scale LLM solutions

Researcher, who would like to delve in to various aspects of open-source LLMs

Software Engineer, looking to learn how to integrate AI into their products

What you’ll get out of this course

Advanced AI Architectures

Understand and implement complex AI architectures, including enterprise-level RAG systems and agentic RAG strategies. You will also dive deep into the Mixture of Experts (MoE) technique and other model merging strategies to enhance the capabilities of your AI systems.

Practical Skills for Deployment

From building semantic caches using GCP and Redis to deploying LLMs on serverless platforms like AWS Bedrock, you'll learn the practical skills to deploy and manage AI applications in real-world scenarios.

Fine-Tuning Expertise

Acquire advanced techniques for fine-tuning LLMs, enabling you to adapt these models to specific tasks or domains and enhance their performance in targeted applications.

Efficient Inference Processing

Explore strategies for exploring and optimizing inference speeds, ensuring that your language models perform efficiently in real-time scenarios, a crucial skill for deploying responsive and scalable applications.

Knowledge of Responsible AI

Understand the importance of ethical AI development and learn to implement guardrails using tools like NeMo, Colang, and Llama Guard to ensure your AI systems align with responsible AI principles.

What’s included

Live sessions

Learn directly from Hamza Farooq in a real-time, interactive format.

Lifetime access

Go back to course content and recordings whenever you need to.

Community of peers

Stay accountable and share insights with like-minded professionals.

Certificate of completion

Share your new skills with your employer or on LinkedIn.

Maven Guarantee

This course is backed by the Maven Guarantee. Students are eligible for a full refund up until the halfway point of the course.

Course syllabus

14 live sessions • 35 lessons • 6 projects

Week 1

Jul 12—Jul 13

Recordings from previous Talks/ Sessions

2 items

Enterprise RAG Solutions with Semantic Caching

6 items

Jul

Session 1: Enterprise RAG

Sat 7/124:00 PM—6:00 PM (UTC)

Week 2

Jul 14—Jul 20

Optimizing and Deploying Large Language Models

10 items

Jul

Session 2: Deploying LLMs with Quantization

Sat 7/194:00 PM—6:00 PM (UTC)

Jul

Office Hours

Wed 7/167:00 PM—7:30 PM (UTC)

Optional

Week 3

Jul 21—Jul 27

Jul

Session 3: Guardrails and AI Safety

Sat 7/264:00 PM—6:00 PM (UTC)

Module: DSPy and Implementing Guardrails for Responsible AI

6 items

Jul

Office Hours

Wed 7/237:00 PM—7:30 PM (UTC)

Optional

Week 4

Jul 28—Aug 3

Holiday Assignment

2 items

Jul

Office Hours

Wed 7/307:00 PM—7:30 PM (UTC)

Optional

Aug

Tech Friday: Deploying LLM Endpoints in Enterprise

Fri 8/17:00 PM—8:00 PM (UTC)

Week 5

Aug 4—Aug 10

Knowledge Graphs

8 items

Aug

Session 4: Knowledge Graphs

Sat 8/94:00 PM—6:00 PM (UTC)

Week 6

Aug 11—Aug 17

Agents: A Deeper Overview

3 items

Aug

Session 5: Agents

Sat 8/164:00 PM—6:00 PM (UTC)

Aug

Tech Friday: Agents in Action

Fri 8/157:00 PM—8:00 PM (UTC)

Aug

Office Hours

Wed 8/136:00 PM—6:30 PM (UTC)

Week 7

Aug 18—Aug 24

Semantic and Agentic RAG

2 items

Model Merging and Fine-tuning Video recordings

2 items

Aug

Office Hours

Wed 8/206:00 PM—7:00 PM (UTC)

Aug

Session 6: Agentic RAG and Semantic Chunking

Sat 8/234:00 PM—6:00 PM (UTC)

Post-course

Demo Day

0 items

Sep

Demo Day

Sat 9/134:00 PM—6:00 PM (UTC)

4.8 (54 ratings)

What students are saying

Meet your instructor

Hamza Farooq

I am the founder of Traversaal.ai, an LLM-based startup dedicated to creating scalable, customizable, and cost-efficient language model solutions for enterprises.

With over 15 years of experience in machine learning, my journey has spanned three continents and seven countries, covering a diverse range of industries such as tech, telecommunications, finance, and retail.

As a former Senior Research Manager at Google and Walmart Labs, I have led data science and machine learning teams, focusing on optimization, natural language processing, recommender systems, and time series forecasting.

I am also an adjunct professor at UCLA, and Instructor for Stanford Continuing Studies where I bridge the gap between academic theory and real-world AI applications.

Additionally, I frequently speak at conferences and conduct training sessions, sharing insights on large language models, deep learning, and cloud computing.

Join an upcoming cohort

Agentic RAG Systems & Multi-Agent Architectures: Developers Edition

Cost

$1,500

Upcoming

2025-cohort-3

Dates

July 12—Aug 24, 2025

Payment Deadline

July 11, 2025

2025-cohort-3

$1,500

Dates

July 12—Aug 24, 2025

Payment Deadline

July 11, 2025

Get reimbursed

Course schedule

4-6 hours per week

Saturdays
9:00 - 11:00am PT
Virtual Class
Weekly projects
2-3 hours per week
Work in teams to build solutions, this requires engagement with other team members

Free resource

🚀 Join Me for a 7-Day Journey into AI Agents & LLM-Powered Applications

If you're curious about building the next generation of AI Agents and want to master how RAG systems, multi-agent frameworks, and LLMs are transforming the way we build, then this course is for you.

Over the next 7 days, I'll walk you through how developers, researchers, and teams are going beyond basic tools and building production-grade, low-latency, and secure AI solutions—and how you can do it too.

Get this free resource

Here's what others have to say about this course

Active hands-on learning

This course builds on live workshops and hands-on projects

Interactive and project-based

You’ll be interacting with other learners through breakout rooms and project teams

Learn with a cohort of peers

Join a community of like-minded people who want to learn and grow alongside you

Join the free course

Join here today

Frequently Asked Questions

Join an upcoming cohort

Agentic RAG Systems & Multi-Agent Architectures: Developers Edition

Cost

$1,500

Upcoming

2025-cohort-3

Dates

July 12—Aug 24, 2025

Payment Deadline

July 11, 2025

2025-cohort-3

$1,500

Dates

July 12—Aug 24, 2025

Payment Deadline

July 11, 2025

Get reimbursed

$1,500

4.8 (54)

7 Weeks