Skip to content

OMOP samples#107

Draft
ofermend wants to merge 3 commits into
malloydata:mainfrom
ofermend:pr/2-omop-sample
Draft

OMOP samples#107
ofermend wants to merge 3 commits into
malloydata:mainfrom
ofermend:pr/2-omop-sample

Conversation

@ofermend
Copy link
Copy Markdown

@ofermend ofermend commented Jun 3, 2026

What this adds

  • A new OMOP sample showing how to analyze observational health data in the OMOP Common Data Model using Malloy. OMOP is the dominant open standard (maintained by OHDSI) for standardizing electronic health records, used across hundreds of academic medical centers and research networks. The sample is built around a generic semantic model (omop.malloy) that conforms to OMOP CDM v5.3 and v5.4.
  • We use here the synthea-covid19-10k dataset (~10,700 synthetic patients, OMOP CDM v5.3) which uses Synthetic data (no PHI).

The model captures some OMOP-specific querying challenges Malloy handles well:

  • Symmetric aggregates: counting conditions, drugs, visits, and procedures per patient in one query without fan-out errors.
  • Reusable vocabulary joins: the concept dictionary reused under plain-English aliases (gender, drug_concept, condition_concept, race, ethnicity) so queries ask for readable names instead of numeric codes.
  • Named views: cohort logic and demographic breakdowns defined once and reused across analyses.

The specific dataset has no laboratory results (measurement/observation tables are empty) and unpopulated drug dosing fields, so lab/vital and prescription-length analyses are out of scope. Noted in the README.

ofermend added 3 commits June 1, 2026 13:08
Signed-off-by: Ofer Mendelevitch <ofermend@gmail.com>
Add baseline characteristics, cohort outcomes/follow-up, comorbidity
burden, and healthcare utilization notebooks. Switch the data source to
the Synthea COVID-19 dataset (omop_synthea_covid.malloy), update the
shared omop.malloy model and existing notebooks accordingly, and remove
the standalone measurement notebook and build_parquet.py script.

Signed-off-by: Ofer Mendelevitch <ofermend@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant