Hands-On Multimodal AI: Build Your First Interactive Apps

New
·

2 Weeks

·

Cohort-based Course

Create Voice, Vision & Sensors-powered AI Apps in 14 Day—no ML expertise required. 

Previously at

MIT research group
Google
Microsoft

Course overview

Hands-On Multimodal AI: Build Your First Interactive Apps

The Problem We're Solving Everyone’s cheering Gemini, GPT-4V and other Multimodal AI demos, yet most tutorials stop at “Hello World.”

You watch demos of AI that can see sketches and understand gestures, but the gap between inspiration and implementation feels insurmountable.

Meanwhile, stakeholders are presented with a significant opportunity to capitalize on the expanding market in multimodal AI.



Why This Course Is Different:

* Expert in your corner: Learn from someone building these systems at Google DeepMind, live workshops + Slack hotline with me.

* Hands-on focus : Walk away with 3 functional prototypes built based on project guides

* Discover best practices – Master tools that work fast and reliably( Google AI Studio and the Gemini API ecosystem) and learn how to use frameworks that run on devices (llama.cpp, Ollama).

* Tiny Cohort, Big accountability: Max 35 students, peer reviews, demo day. Get personalized feedback and tight accountability.


In just two weeks you’ll be shipping portfolio-ready apps that prove you can harness multimodal AI—even on a phone with no prior experience needed. 

Who is this course for

01

Mid-Career Pivoters: Professionals (e.g. Marketing) seeking AI roles. Lack build experience, find ML theory dense. Need practical AI skills.

02

Designers/Creatives: UX/UI designers wanting AI in prototypes (voice, vision). Think coding is a barrier. Aim to rapidly build via APIs.

03

Product Managers: PMs needing to grasp multimodal APIs beyond text to define/prototype new AI-powered product features (voice, image).

What you’ll get out of this course

Demystify Multimodal AI & Start BUILDING

Go from feeling intimidated about AI to confidently creating your own simple, interactive multimodal applications using provided templates and bespoke no-code tools for using APIs for vision, voice, and other sensors.

Rapidly Prototype Your AI Ideas

Learn to take a cool ideas, perhaps inspired by novel AI ability to understand sketches or analyze video, and quickly build a working proof-of-concept. Think: a voice-activated note-taker, a gesture-controlled music player, or a tool that describes images aloud.

Add Tangible AI Skills to Your Resume/Portfolio

Showcase concrete, project-based experience with modern multimodal AI, crucial for career pivots or leveling up. You’ll leave with tangible projects and the empowerment to continue exploring.

"Speak AI" More Fluently & Design Better Interactions

Understand multimodal AI's real-world capabilities and limits to confidently discuss ideas, evaluate tools, and guide AI projects. Discover why effective interaction design—how vision, voice, and sensors work together—is often more crucial for success than just model accuracy.

What’s included

Stefania Druga

Live sessions

Learn directly from Stefania Druga in a real-time, interactive format.

Lifetime access

Go back to course content and recordings whenever you need to.

Community of peers

Stay accountable and share insights with like-minded professionals.

Certificate of completion

Share your new skills with your employer or on LinkedIn.

Maven Guarantee

This course is backed by the Maven Guarantee. Students are eligible for a full refund up until the halfway point of the course.

Course syllabus

2 live sessions • 5 lessons • 3 projects

Week 1

Jul 1—Jul 6

    Module 0: Multimodal AI Fast Track Prep

    1 item

    Week 1: Foundation Sprint – Vision & Voice

    • Jul

      2

      Kick-Off – Voice & Vision Fundamentals

      Wed 7/211:00 PM—12:00 AM (UTC)
    • Jul

      3

      Interactive AI & Voice Integration

      Thu 7/311:00 PM—12:00 AM (UTC)
    4 more items

Week 2

Jul 7—Jul 11

    Week 2: Advanced Integration & Capstone Prototype

    3 items

What people are saying

        I'm excited to gain hands-on experience simplifying AI for non-technical adults and learn to incorporate AI into presentations with verbal commands or motion triggers. The idea of multimodal health tracking is also incredibly appealing!"
Daria Dubois

Daria Dubois

VP, AI Solutions @RNL
        This course is exactly what I need to move my app ideas from just 'discussing concepts' to tangible action. I am excited to learning how to integrate multimodal AI for things like understanding 3D printing or evaluating art.
Blake Harper

Blake Harper

Trust & Safety Strategy @Meta

Meet your instructor

Stefania Druga

Stefania Druga

This is where you'll add your bio as a way to establish credibility and demonstrate to your audience why you're the right person to teach this course.

A pattern of wavy dots

Join an upcoming cohort

Hands-On Multimodal AI: Build Your First Interactive Apps

Regular

$649

Dates

July 1—12, 2025

Payment Deadline

June 30, 2025
Get reimbursed

Course schedule

4-6 hours per week

  • Tuesdays & Thursdays

    1:00pm - 2:00pm EST

    If your events are recurring and at the same time, it might be easiest to use a single line item to communicate your course schedule to students

  • May 7, 2022

    Feel free to type out dates as your title as a way to communicate information about specific live sessions or other events.

  • Weekly projects

    2 hours per week

    Schedule items can also be used to convey commitments outside of specific time slots (like weekly projects or daily office hours).

Learning is better with cohorts

Learning is better with cohorts

Active hands-on learning

This course builds on live workshops and hands-on projects

Interactive and project-based

You’ll be interacting with other learners through breakout rooms and project teams

Learn with a cohort of peers

Join a community of like-minded people who want to learn and grow alongside you

Frequently Asked Questions

A pattern of wavy dots

Join an upcoming cohort

Hands-On Multimodal AI: Build Your First Interactive Apps

Regular

$649

Dates

July 1—12, 2025

Payment Deadline

June 30, 2025
Get reimbursed

$649

2 Weeks