Production-Ready Systems with LLMs and Agents: An Intensive for Engineers

Ehsan Gazar

Principal Engineer | AI & System Design

Build LLM and agent systems that survive real traffic, cost, and failure.

You can wire up an LLM and demo an agent that wows the room. Then it meets production, and everything you weren't taught shows up at once: costs spiral, latency balloons, the model does something confidently wrong in front of a user, and you have no evals to catch it and no traces to debug it. The demo was the easy part.

If you're the engineer now expected to ship AI features, that gap is yours to own, and the field moves faster than anyone can teach it. So you improvise, lean on framework tutorials that stop at the happy path, and quietly hope it holds.

This intensive teaches the part the tutorials skip: the architecture decisions that make LLM and agent systems survive real traffic, real cost ceilings, and real failure. Where to draw the boundary between code and model. How to bound cost and latency. How to design for non-determinism, evals, observability, and human oversight.

You leave with production-grade artifacts you can use at work on Monday: an agent architecture doc, a cost and latency budget, an eval harness plan, and a failure-mode runbook. Not another demo. A system that holds.

What you’ll learn

Master the decisions that take AI agents from demo to production, and become the engineer your team trusts with anything LLM.

Decide what to hand the LLM and what to keep in deterministic code, the highest-leverage choice in any agent system
Spot the tasks where a model adds risk without adding value, and replace them with plain logic.
Design prompts and tool interfaces as narrow contracts, so the model's job stays small and testable.

Set a per-request cost and latency budget, then design the system to live within it
Apply caching, batching, and model routing to cut spend without losing quality
Right-size the model per task instead of defaulting to the biggest one everywhere

Build retries, timeouts, and fallbacks so a slow or failing model never stalls the system
dd graceful degradation paths for when the model is wrong, unsure, or unavailable
Contain non-determinism with validation and guardrails before output reaches a user

Know when to use tool calls, planning loops, memory, or multiple coordinated agents.
Recognize when a single well-scoped agent beats a complex multi-agent design
Map each pattern to its failure modes so you choose with eyes open, not by hype

Build an eval harness that scores changes before they ship, not after users complain
Combine offline test sets with online signals to catch drift and silent regressions
Turn a vague "it feels worse" into measurable quality gates in your pipeline

Trace every LLM call so you can debug what the system did, not what you assumed
Watch cost, latency, and quality on dashboards that surface problems early
Place human oversight and prompt versioning where they actually reduce risk

Learn directly from Ehsan

Ehsan Gazar

Contact

Principal Engineer, 500+ mentees, 16 years in production

See all products from Gaz

Who this course is for

A product or backend engineer suddenly asked to ship LLM features, who can build a demo but hasn't taken one to production before.
A senior or staff engineer responsible for AI systems in production, who wants real rigor around cost, failure, evals, and architecture.
A tech lead or architect evaluating agent patterns for their team, who needs to choose designs that scale and won't break under load.

What's included

Live sessions

Learn directly from Ehsan Gazar in a real-time, interactive format.

Lifetime access

Go back to course content and recordings whenever you need to.

Community of peers

Stay accountable and share insights with like-minded professionals.

Certificate of completion

Share your new skills with your employer or on LinkedIn.

Hands-on projects on your own system

Apply every lesson to a real LLM or agent system you bring. You leave with five production-grade artifacts: a cost & latency budget, an agent architecture and threat-model doc, an eval harness plan, a failure-mode runbook, and a full system design document.

Maven Guarantee

Your purchase is backed by the Maven Guarantee.

Course syllabus

8 live sessions • 3 projects

Week 1

Oct 5—Oct 11

Oct

Why demos die in production, and when you even need an agent

Tue 10/66:30 PM—7:45 PM (UTC)

Oct

Workshop: decide what the model should never touch

Thu 10/86:30 PM—7:45 PM (UTC)

Week 2

Oct 12—Oct 18

Oct

Why your agent gets worse the longer it runs

Tue 10/136:30 PM—7:45 PM (UTC)

Oct

Workshop: build a context pipeline that survives a long session

Thu 10/156:30 PM—7:45 PM (UTC)

Your context and retrieval design

1 item

Schedule

Live sessions

3 hrs / week

Two 90-minute live sessions each week (Tuesdays + Thursdays, 7:30-9:00pm London). All sessions recorded and available in your portal.

Tue, Oct 6
6:30 PM—7:45 PM (UTC)
Thu, Oct 8
6:30 PM—7:45 PM (UTC)
Tue, Oct 13
6:30 PM—7:45 PM (UTC)

Projects

2 hrs / week

Build that week's production-grade artifact on the real system you bring.

Async content

1 hr / week

Short pre-reads and reference material between live sessions.

Frequently asked questions

Maven for Teams

Reimbursement

Get your company to pay

Everything L&D needs: email template, receipts, and certificate of completion.

Get reimbursed

Team discount

Learn with your teammates

Save 20%+ when 2 or more teammates enroll in the same cohort.

Save 20%+ with a team

Private cohort

Run a cohort for your org

A dedicated cohort with a custom schedule and curriculum, tailored to your team.

Book a private cohort

$1,500

USD

Oct 5—Nov 2

1 more cohort