2 Weeks
·Cohort-based Course
Build analytics for human-behavior surveys: clean data, test claims, train logistic and trees, present a decision brief.
This course is popular
5 people enrolled last week.
2 Weeks
·Cohort-based Course
Build analytics for human-behavior surveys: clean data, test claims, train logistic and trees, present a decision brief.
This course is popular
5 people enrolled last week.
Course overview
Most orgs collect surveys but stop at pretty charts. Decisions need tested claims and interpretable models: “Does stress at work predict care-seeking?” “Which factors most move satisfaction?” This session shows a repeatable path from messy responses to evidence you can present with confidence—fast enough for busy analysts and PMs, rigorous enough for research reviews.
✅ No prior coding required—start at true zero
🤖 AI-assisted coding—prompt → runnable code
🧩 Guided prompts—what to ask AI and why
🧪 Human-behavior datasets—clean synthetic core + optional real-survey snapshot
🛠️ Reusable templates—notebooks you can keep
Work with a real-world-style survey (Sleep Health & Lifestyle) to turn messy data into decisions—without a coding background. You’ll guide AI to draft clean R/Python notebooks, test claims correctly, and build practical models (logistic + decision tree). We focus on evaluation, thresholds, fairness, and clear reporting—so you leave with a 5-slide Decision Brief and a reproducible notebook.
Objective: build a repeatable, AI-assisted, evidence-based workflow for human-behavior data—prompting and verifying code, diagnosing errors, and communicating results clearly.
(Optional Week-1 add-on: a tiny OSMI snapshot for real-survey “reality check.”)
📚 What we’ll cover (from messy to actionable)
📦 Week 1 — Prep & Theory (Async + Office Hours)
Light, self-paced foundations + two live setup clinics so everyone is bootcamp-ready.
🧩 Module A — Environment & Preflight
Use Google Colab (Python) and RStudio on Posit Cloud (R); organize project folders
Starter notebook tour and ✅ Readiness check (imports, data access via Drive / Posit Cloud)
📊 Module B — Quantifying One Variable
Center (mean/median), spread (SD/IQR), shape (skew/kurtosis), percentiles
Pick the right plot: histogram/box/violin/bar
Missingness & data types; tidy columns
🧭 Module C — Relationships & Correlation
Association ≠ causation; Simpson’s paradox (quick)
Pearson vs Spearman; nonlinearity/outliers
Right plots; stratify by BMI/Stress tier
🧪 Module D — Hypothesis Tests & Effects
Choose χ², t-test, or ANOVA (assumptions)
Effect sizes (risk diff, odds ratio) + CIs
“Claim → test → effect → limitation” memo
🧠 Module E — Logistic Regression (concepts)
Intuition: log-odds → probabilities (S-curve)
Target = HasSleepCondition; leakage pitfalls; one-hot BMI/Gender
Interpret coefficients as odds ratios; ROC/AUC, PR, calibration
🌳 Module F — Decision Trees (concepts)
Splits/impurity; depth caps; overfitting
Interpreting path rules; when trees vs logistic
Delivery format: Live coding in Python on Google Colab. R provided as recorded RStudio tutorials with full project files and a Posit Cloud fallback (RStudio in your browser—no install, just click the link and make a copy). We will use ChatGPT; other assistants like Copilot, Codeium, or Tabnine are also welcome. Please verify outputs in Colab.
📌 Week-1 Deliverables (due Fri):
Preflight pass (screenshot)
EDA one-pager: 2 plots + 3 insights
Claim & Check memo (≤150 words)
(Optional): first logistic + decision tree run using provided cells)
🛠️ Live Office Hours (setup clinics):
📅 12:00–1:00 PM (PST): Setup & environment troubleshooting
📅 12:00–1:00 PM (PST): Preflight, plotting help, claim selection
🧑💻 Week 2 — Weekend Bootcamp (Live, 2 days)
From one-variable to two-variable to baseline models → thresholded decision.
📅 Day 1 (Sat)
AM (9:00–12:00 PST) — 🔍 One-Variable → Two-Variable
Finalize EDA + tested claim; stratified visuals; peer feedback
PM (1:00–3:00 PST) — 🧮 Logistic Baseline
Train/validation split; odds-ratio table; ROC/AUC; quick calibration
Save mini 📇 model card (data, target, metric, assumptions, risks)
📅 Day 2 (Sun)
AM (9:00–12:00 PST) — 🌲 Decision Tree + Comparison
Depth-capped tree (≤4), path interpretation; compare to logistic
Error analysis & subgroup checks (Gender, Age bands, BMI)
PM (1:00–3:00 PST) — 🎛️ Thresholds → 🧾 Decision Brief
Tune threshold (PR/ROC) → action rule (e.g., sleep-hygiene nudge)
Build & present your 5-slide Decision Brief:
Problem & data
Key chart (from Week 1)
Model results (logistic + tree)
Decision rule & trade-offs
Risks & next steps
🏁 Final Deliverables: two-model baseline + Decision Brief + reproducible notebook
🧰 What you will practice
Data: Sleep Duration, Quality of Sleep, Stress Level, Physical Activity, BMI Category, Systolic/Diastolic BP (parsed), Heart Rate, Daily Steps, Sleep Disorder (+ derived flags/tiers).
Languages: R and Python (parallel materials).
Skills: disciplined EDA; correct test selection; odds-ratio interpretation; model evaluation (ROC/AUC, PR, calibration); error analysis; thresholding; fairness & leakage checks; decision reporting.
🧳 What you will leave with
A methodical workflow: question → data → model → evaluation → decision
A concise portfolio on the Sleep dataset: EDA one-pager, Claim & Check, two-model baseline, and a Decision Brief
The confidence to state assumptions, avoid leakage, and defend recommendations
🎁 What’s included
🎥 Live sessions: interactive instruction with Omid
🛟 Office hours: two Week-1 setup clinics + optional follow-up hour
📼 Lifetime access: recordings & materials
🤝 Community: peer feedback & shared project space
📜 Certificate of completion
✅ Maven Guarantee: full refund up to the halfway point
01
Analysts, researchers, and Product Managers who work with human-behavior survey data and want practical ML results fast.
02
Policy, public health, and government staff who need evidence-based briefs without heavy coding.
03
Data-curious professionals (UX, ops, HR, marketing) ready to move from charts to testable claims and simple models.
but basic stats literacy helps: mean/median, percentages, what a correlation is.
for Colab; ability to open shared links and save a copy to Drive.
We’ll use ChatGPT; Copilot/Codeium/Tabnine also fine
Ship two working models by Sunday.
Train a logistic regression and a decision tree on human behavior dataset; report AUC/PR, calibration, and a confusion matrix with a chosen threshold.
Turn messy survey data into insight.
Produce an EDA one-pager (2 plots + 3 insights) and a Claim & Check memo using the right test (χ² or t-test/ANOVA) with effect size and a confidence interval.
Present a decision your team can act on.
Build a 5-slide Decision Brief: problem, key chart, model results, decision rule with trade-offs, and risks/next steps.
Use AI effectively and safely.
Follow prompt templates to generate code, then verify and debug in Colab; learn leakage checks, seeds/splits, and plain-English reporting.
Leave with reusable assets.
Keep the polished Python (Colab) notebook and the recorded RStudio materials (with Posit Cloud fallback) so you can repeat the workflow at work.

Live sessions
Learn directly from Omid Gheysar in a real-time, interactive format.
Lifetime access
Go back to course content and recordings whenever you need to.
Community of peers
Stay accountable and share insights with like-minded professionals.
Certificate of completion
Share your new skills with your employer or on LinkedIn.
Maven Guarantee
This course is backed by the Maven Guarantee. Students are eligible for a full refund up until the halfway point of the course.
6 live sessions • 6 lessons • 2 projects
Dec
13
🛠️ Office Hour #1 (Tue 12–1 PM PST) — Workspace Setup Session
Dec
14
🛠️ Office Hour #2 (Thu 12–1 PM PST) — Readiness Check & First Plots
Dec
19
🔍 Day 1 AM (Sat 9–12): One → Two-Variable + claim finalize
Dec
19
🧮 Day 1 PM (Sat 1–3): Logistic baseline + model card
Dec
20
🌲 Day 2 AM (Sun 9–12): Decision Tree + comparison + error analysis
Dec
20
🎛️ Day 2 PM (Sun 1–3): Thresholds → 5-slide Decision Brief (presentations)
I’m Omid Gheysar, a data scientist and educator focused on turning messy human-behavior data into decision-ready insights. I hold a PhD in Applied & Computational Mathematics from Simon Fraser University and have published on predictive modeling of infectious diseases, including a paper in Nature. My background spans real public-health analytics—building dashboards, statistical pipelines, and machine-learning models that combine survey, clinical, and genomic data. I’ve taught widely through SFU math courses and workshops, and I design my classes to be practical and tool-agnostic: learn the core ideas, then implement them in R or Python. In this course you’ll move from clear questions to defensible models, with step-by-step checklists, interpretable results, and habits you can apply at work the next day.
Join an upcoming cohort
Public Cohort
$799
Dates
Payment Deadline
4-6 hours per week
Week 1 (Async)
Weekend Bootcamp (Week 2)
Weekly projects
Active hands-on learning
This course builds on live workshops and hands-on projects
Interactive and project-based
You’ll be interacting with other learners through breakout rooms and project teams
Learn with a cohort of peers
Join a community of like-minded people who want to learn and grow alongside you
Sign up to be the first to know about course updates.
Join an upcoming cohort
Public Cohort
$799
Dates
Payment Deadline