Production-Ready Systems with LLMs and Agents: An Intensive for Engineers

Ehsan Gazar

Principal Engineer | AI & System Design

Build LLM and agent systems that survive real traffic, cost, and failure.

You can wire up an LLM and demo an agent that wows the room. Then it meets production, and everything you weren't taught shows up at once: costs spiral, latency balloons, the model does something confidently wrong in front of a user, and you have no evals to catch it and no traces to debug it. The demo was the easy part.

If you're the engineer now expected to ship AI features, that gap is yours to own, and the field moves faster than anyone can teach it. So you improvise, lean on framework tutorials that stop at the happy path, and quietly hope it holds.

This intensive teaches the part the tutorials skip: the architecture decisions that make LLM and agent systems survive real traffic, real cost ceilings, and real failure. Where to draw the boundary between code and model. How to bound cost and latency. How to design for non-determinism, evals, observability, and human oversight.

You leave with production-grade artifacts you can use at work on Monday: an agent architecture doc, a cost and latency budget, an eval harness plan, and a failure-mode runbook. Not another demo. A system that holds.

What you’ll learn

Master the decisions that take AI agents from demo to production, and become the engineer your team trusts with anything LLM.

  • Decide what to hand the LLM and what to keep in deterministic code, the highest-leverage choice in any agent system

  • Spot the tasks where a model adds risk without adding value, and replace them with plain logic.

  • Design prompts and tool interfaces as narrow contracts, so the model's job stays small and testable.

  • Set a per-request cost and latency budget, then design the system to live within it

  • Apply caching, batching, and model routing to cut spend without losing quality

  • Right-size the model per task instead of defaulting to the biggest one everywhere

  • Build retries, timeouts, and fallbacks so a slow or failing model never stalls the system

  • dd graceful degradation paths for when the model is wrong, unsure, or unavailable

  • Contain non-determinism with validation and guardrails before output reaches a user

  • Know when to use tool calls, planning loops, memory, or multiple coordinated agents.

  • Recognize when a single well-scoped agent beats a complex multi-agent design

  • Map each pattern to its failure modes so you choose with eyes open, not by hype

  • Build an eval harness that scores changes before they ship, not after users complain

  • Combine offline test sets with online signals to catch drift and silent regressions

  • Turn a vague "it feels worse" into measurable quality gates in your pipeline

  • Trace every LLM call so you can debug what the system did, not what you assumed

  • Watch cost, latency, and quality on dashboards that surface problems early

  • Place human oversight and prompt versioning where they actually reduce risk

Learn directly from Ehsan

Ehsan Gazar

Ehsan Gazar

Principal Engineer, 500+ mentees, 16 years in production

See all products from Gaz

Who this course is for

  • A product or backend engineer suddenly asked to ship LLM features, who can build a demo but hasn't taken one to production before.

  • A senior or staff engineer responsible for AI systems in production, who wants real rigor around cost, failure, evals, and architecture.

  • A tech lead or architect evaluating agent patterns for their team, who needs to choose designs that scale and won't break under load.

What's included

Ehsan Gazar

Live sessions

Learn directly from Ehsan Gazar in a real-time, interactive format.

Lifetime access

Go back to course content and recordings whenever you need to.

Community of peers

Stay accountable and share insights with like-minded professionals.

Certificate of completion

Share your new skills with your employer or on LinkedIn.

Hands-on projects on your own system

Apply every lesson to a real LLM or agent system you bring. You leave with five production-grade artifacts: a cost & latency budget, an agent architecture and threat-model doc, an eval harness plan, a failure-mode runbook, and a full system design document.

Maven Guarantee

Your purchase is backed by the Maven Guarantee.

Course syllabus

12 live sessions • 5 projects

Week 1

Jul 13—Jul 19

    Jul

    14

    Why demos die in production, and when you even need an agent

    Tue 7/146:30 PM—8:00 PM (UTC)

    Jul

    16

    The deterministic / model boundary

    Thu 7/166:30 PM—8:00 PM (UTC)

Week 2

Jul 20—Jul 26

    Jul

    21

    Context engineering: what goes in the window, and why cramming hurts

    Tue 7/216:30 PM—8:00 PM (UTC)

    Jul

    23

    Workshop: design your context pipeline

    Thu 7/236:30 PM—8:00 PM (UTC)

    Project 1 — Context & Retrieval Design

    1 item

Schedule

Live sessions

3 hrs / week

Two 90-minute live sessions each week (Tuesdays + Thursdays, 7:30-9:00pm London). All sessions recorded and available in your portal.

    • Tue, Jul 14

      6:30 PM—8:00 PM (UTC)

    • Thu, Jul 16

      6:30 PM—8:00 PM (UTC)

    • Tue, Jul 21

      6:30 PM—8:00 PM (UTC)

Projects

2 hrs / week

Build that week's production-grade artifact on the real system you bring.

Async content

1 hr / week

Short pre-reads and reference material between live sessions.

Frequently asked questions

Maven for Teams

Reimbursement

Get your company to pay

Everything L&D needs: email template, receipts, and certificate of completion.

Get reimbursed

Team discount

Learn with your teammates

Save 20%+ when 2 or more teammates enroll in the same cohort.

Save 20%+ with a team

Private cohort

Run a cohort for your org

A dedicated cohort with a custom schedule and curriculum, tailored to your team.

Book a private cohort

$1,500

USD

Jul 13Aug 23
Enroll