Audit date: 2026-05-19
This repository is a small GitHub Pages static site that publishes a single oral-health heatmap and explanatory narrative for the Tsimane population of Bolivia. The site is centered on one visualization asset (oral-health-heatmap.png), one page of written interpretation (index.md), and lightweight Jekyll/Minima configuration for deployment through GitHub Pages.
- Core content files exist:
README.md,index.md,_config.yml,oral-health-heatmap.png, andassets/css/style.scss.css. - The site content includes descriptive alt text for the primary visualization.
- A committed
Gemfileand.ruby-versionpin the GitHub Pages toolchain for local builds. Figure2-Teeth_v4.1.pdfandscripts/render_heatmap.pyprovide a reproducible source-to-raster path fororal-health-heatmap.png.scripts/check_homepage.pyprovides a non-UI smoke test for the rendered homepage.- The Jekyll config excludes
vendor/, so the bundled gem tree is kept out of the public site build. - The validate workflow uses Node 24-compatible
actions/checkout@v6andactions/setup-python@v6. README.CRAWL.mdis explicitly documented as a retained historical artifact indocs/historical-artifacts.md.- The heatmap provenance boundary is explicit: this repo supports PDF-to-PNG regeneration, but the raw source dataset is not published here.
- A repo-local validation command now checks required files, local links, key config values, and tracked OS junk.
- The live GitHub Pages deployment returned HTTP 200 and served the expected page content in this session.
- Provenance refinement branch QA passed on 2026-05-19:
docs/heatmap-provenance.mdclearly states that the raw/analysis-ready source dataset is not published, distinguishes PDF/figure-to-PNG regeneration from full data-to-figure regeneration, and documents the Windows/MiKTeXpdftoppmtemp/log permission blocker. python scripts/validate_repo.pypassed on 2026-05-19 with only the known staleREADME.CRAWL.mdwarning.python -m compileall scriptspassed on 2026-05-19.- A local homepage QA snapshot passed
python scripts/check_homepage.pyand was then captured at desktop and mobile widths in a browser-rendered view. - The browser captures showed the heatmap image, page heading, and download link in the expected positions with no visible layout regressions.
- Historical artifact governance passed non-UI QA:
python scripts/validate_repo.py, Python script compilation, policy-doc discoverability checks, and_config.ymlexclusion checks. - Live GitHub Pages QA returned HTTP 200, found the expected title and heatmap, and did not find a
README.CRAWL.htmllink.
| Command | Purpose | Result |
|---|---|---|
python scripts/validate_repo.py |
Repo validation | Pass with warning about stale README.CRAWL.md |
python -m compileall scripts |
Python syntax/import compilation check | Pass |
python scripts/render_heatmap.py |
Heatmap rebuild | Currently blocked locally by Windows/MiKTeX pdftoppm temp/log permission errors |
python scripts/check_homepage.py |
Homepage smoke test | Requires _site/index.html from a Jekyll build, or an explicit rendered HTML path or URL |
Invoke-WebRequest https://mschwar.github.io/data-visualization/ |
Live site verification | 200 OK |
bundle exec jekyll build |
Local Jekyll build | Not run in this environment because Ruby was unavailable at audit time |
The pinned build path is now committed in-repo, so local build instructions are canonical even though this environment did not have Ruby installed when the audit started.
README.md: public-facing repo overview and live site linkindex.md: main GitHub Pages content page_config.yml: GitHub Pages/Jekyll site metadataassets/css/style.scss.css: hand-authored theme overridesoral-health-heatmap.png: primary visualization assetFigure2-Teeth_v4.1.pdf: source PDF for the published heatmapscripts/render_heatmap.py: reproducible heatmap rebuild scriptscripts/check_homepage.py: homepage smoke test scriptdocs/heatmap-provenance.md: provenance notes and rebuild commandscripts/validate_repo.py: lightweight validation spinedocs/agentic-overhaul/2026-05-audit.md: detailed audit and backlogAGENTS.md: contributor and agent operating guide
README.CRAWL.mdcontains a machine-local path (/Users/mschwar/...) and a historical directory crawl snapshot. It is useful as a historical artifact only and should not be treated as canonical documentation.- The repo now references a pinned local Ruby/Jekyll toolchain, so local build instructions are canonical from the repo itself.
- Local GitHub Pages rendering is not reproducible from committed tooling in the current environment.
- Local build execution still depends on having Ruby installed on the host.
- Heatmap rebuild execution is currently blocked in this Windows environment by MiKTeX
pdftoppmpermission errors. - The historical source dataset that fed the original figure is still not published in this repo.
README.CRAWL.mdremains as a historical artifact but is excluded from the public site.
- Track the follow-on fixes listed in docs/agentic-first-buildout-plan.md.
- Keep historical artifacts documented in docs/historical-artifacts.md.