add walkthrough and result interpretation by arthurmello · Pull Request #806 · pymc-labs/CausalPy

arthurmello · 2026-03-23T21:54:29Z

Updated docs/source/notebooks/staggered_did_pymc.ipynb with a focused Staggered DiD sensitivity walkthrough for PreTreatmentPlaceboCheck.
Adresses #791

review-notebook-app · 2026-03-23T21:54:35Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

github-actions · 2026-03-23T21:54:41Z

👋 Welcome to CausalPy, @arthurmello!

Thank you for opening your first pull request! We're excited to have you contribute to the project. 🎉

Here are a few tips to help your PR get merged smoothly:

✅ Make sure all CI checks pass (tests, linting, type checking)
📝 Run prek run --all-files locally before pushing
📖 Check our Contributing Guide for more details

A maintainer will review your changes soon. Thanks for helping make CausalPy better! 🚀

💼 LinkedIn Shoutout: Once your PR is merged, we'd love to give you a shoutout on LinkedIn to thank you for your contribution! If you're interested, just drop your LinkedIn profile URL in a comment below.

read-the-docs-community · 2026-03-23T22:00:47Z

Documentation build overview

📚 causalpy | 🛠️ Build #32237438 | 📁 Comparing 29718e2 against latest (ccfaf5c)

🔍 Preview build

Show files changed (2 files in total): 📝 2 modified | ➕ 0 added | ➖ 0 deleted

File	Status
404.html	📝 modified
notebooks/staggered_did_pymc.html	📝 modified

codecov · 2026-03-23T22:01:19Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 94.26%. Comparing base (ccfaf5c) to head (29718e2).
⚠️ Report is 86 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #806   +/-   ##
=======================================
  Coverage   94.26%   94.26%           
=======================================
  Files          78       78           
  Lines       12005    12005           
  Branches      699      699           
=======================================
  Hits        11317    11317           
  Misses        496      496           
  Partials      192      192

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

drbenvincent

Thanks for this! This definitely gets towards closing #791, but I've got a few requests that would take it to the next level. Let me know if you think you can give this a go, fully or partially?

1. Acceptance criterion gap: failure patterns and recommended follow-ups are still too thin

Issue #791 explicitly asks to "Document typical failure patterns and recommended follow-ups." The new section currently stops at:

"If it fails, treat that as a red flag for misspecification or pre-existing divergence ... as a reason to investigate further."

That is directionally correct, but it is still quite generic. It does not yet give readers concrete troubleshooting guidance such as:

what a failure pattern looks like in practice
how to distinguish noisy pre-period estimates from systematic pre-trends
what follow-up actions to take next

Suggested fix:

add a short "If this check fails" subsection with 3-4 concrete bullets
examples could include: inspect the event-study plot for directional pre-trends, check whether pre-treatment periods are sparse, revisit the outcome model / covariates / fixed effects, and optionally compare robustness with PriorSensitivity

My take: until this is added, I would say PR #806 only partially satisfies #791.

2. Minor clarity risk: `PlaceboInTime` is name-dropped even though it is not applicable here

The "Related checks" paragraph says:

"Other pipeline checks (for example placebo-in-time or design-based tests from other experiment classes) are not wired by default for this estimator; use SensitivityAnalysis(checks=[...]) explicitly when applicable."

This is not strictly false because of the "when applicable" qualifier, but it may still mislead readers into thinking PlaceboInTime is a nearby option for Staggered DiD. In the current implementation, PlaceboInTime is only applicable to ITS and Synthetic Control, not StaggeredDifferenceInDifferences.

Suggested fix:

either remove the PlaceboInTime example from this notebook
or rephrase to say that for Staggered DiD the main additional pipeline check to consider is PriorSensitivity, while other checks are method-specific elsewhere in the docs

drbenvincent · 2026-04-13T14:46:18Z

@arthurmello

Rebased onto main to resolve the notebook conflict in staggered_did_pymc.ipynb. The conflict was between the maketables integration (added on main via #773) and the sensitivity walkthrough changes in this PR — the two edits were independent, so the resolution was straightforward: keep both.

Outstanding review feedback from the previous change request still applies. No rush, but are you still able to continue with that?

arthurmello · 2026-04-13T18:17:48Z

@arthurmello

Rebased onto main to resolve the notebook conflict in staggered_did_pymc.ipynb. The conflict was between the maketables integration (added on main via #773) and the sensitivity walkthrough changes in this PR — the two edits were independent, so the resolution was straightforward: keep both.

Outstanding review feedback from the previous change request still applies. No rush, but are you still able to continue with that?

@drbenvincent sorry, haven't really had the time recently, for personal reasons. I think it's best if someone else takes this on, so I won't block its development (not sure when I'll have the time).
Sorry about that!

drbenvincent · 2026-04-13T20:04:18Z

@arthurmello no problem, I understand. Happy if you want to pick something up in the future :)

drbenvincent · 2026-04-15T18:49:00Z

Continued in #846 — rebased onto current main, addressed the two outstanding review items:

Failure patterns: Added a concrete "If this check fails" subsection with 4 troubleshooting steps (directional pre-trends, sparse pre-treatment window, model specification, prior sensitivity comparison).
PlaceboInTime reference: Replaced the misleading bullet with a clear statement that only PreTreatmentPlaceboCheck and PriorSensitivity apply to staggered DiD.

Thanks @arthurmello for the initial implementation!

arthurmello mentioned this pull request Mar 24, 2026

Docs: Add Staggered DiD sensitivity checks walkthrough (follow-up #749) #791

Open

drbenvincent requested changes Apr 1, 2026

View reviewed changes

add walkthrough and result interpretation

29718e2

drbenvincent force-pushed the staggered-did-sensitivity-checks-walkthrough branch from 9643ef3 to 29718e2 Compare April 13, 2026 14:43

drbenvincent mentioned this pull request Apr 13, 2026

docs: ITS sensitivity walkthrough (#788) #821

Open

6 tasks

drbenvincent mentioned this pull request Apr 15, 2026

Docs: Add Staggered DiD sensitivity checks walkthrough #846

Open

4 tasks

drbenvincent closed this Apr 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add walkthrough and result interpretation#806

add walkthrough and result interpretation#806
arthurmello wants to merge 1 commit into
pymc-labs:mainfrom
arthurmello:staggered-did-sensitivity-checks-walkthrough

arthurmello commented Mar 23, 2026 •

edited

Loading

Uh oh!

review-notebook-app Bot commented Mar 23, 2026

Uh oh!

github-actions Bot commented Mar 23, 2026

Uh oh!

read-the-docs-community Bot commented Mar 23, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Mar 23, 2026 •

edited

Loading

Uh oh!

drbenvincent left a comment

Uh oh!

drbenvincent commented Apr 13, 2026 •

edited

Loading

Uh oh!

arthurmello commented Apr 13, 2026

Uh oh!

drbenvincent commented Apr 13, 2026

Uh oh!

drbenvincent commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

arthurmello commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app Bot commented Mar 23, 2026

Uh oh!

github-actions Bot commented Mar 23, 2026

Uh oh!

read-the-docs-community Bot commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation build overview

Uh oh!

codecov Bot commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

1. Acceptance criterion gap: failure patterns and recommended follow-ups are still too thin

2. Minor clarity risk: PlaceboInTime is name-dropped even though it is not applicable here

Uh oh!

drbenvincent commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arthurmello commented Apr 13, 2026

Uh oh!

drbenvincent commented Apr 13, 2026

Uh oh!

drbenvincent commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

arthurmello commented Mar 23, 2026 •

edited

Loading

read-the-docs-community Bot commented Mar 23, 2026 •

edited

Loading

codecov Bot commented Mar 23, 2026 •

edited

Loading

2. Minor clarity risk: `PlaceboInTime` is name-dropped even though it is not applicable here

drbenvincent commented Apr 13, 2026 •

edited

Loading