Skip to content

feat: add demo review artifact generator#60

Merged
abrichr merged 6 commits into
mainfrom
feat/demo-review-artifacts
Mar 2, 2026
Merged

feat: add demo review artifact generator#60
abrichr merged 6 commits into
mainfrom
feat/demo-review-artifacts

Conversation

@abrichr
Copy link
Copy Markdown
Member

@abrichr abrichr commented Mar 2, 2026

Summary

  • Adds scripts/generate_demo_review.py that generates markdown artifacts for reviewing the demo pipeline output
  • Generates docs/demo_review.md with thumbnail screenshots, text-only vs VLM comparison tables, and collapsible step-by-step details
  • Uses relative image paths for GitHub rendering
  • Includes generated output for the 04d9aeaf LibreOffice Calc task (21 steps, 42 thumbnails)

Usage

python scripts/generate_demo_review.py \
    --recording waa_recordings/04d9aeaf-7bed-4024-bedb-e10e6f00eb7f-WOS \
    --text-demo demo_prompts/04d9aeaf-7bed-4024-bedb-e10e6f00eb7f-WOS.txt \
    --vlm-demo demo_prompts_vlm/04d9aeaf-7bed-4024-bedb-e10e6f00eb7f-WOS.txt \
    --output docs/demo_review.md

Test plan

  • Verify docs/demo_review.md renders correctly on GitHub with thumbnails
  • Verify collapsible <details> sections work
  • Verify comparison table shows text-only vs VLM side by side

🤖 Generated with Claude Code

abrichr and others added 4 commits March 2, 2026 01:09
Adds scripts/generate_demo_review.py that generates markdown with
thumbnail screenshots, comparison tables, and collapsible step details
for reviewing the demo pipeline output.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Match run_dc_eval.py's _setup_eval_proxy pattern: try systemctl restart
socat-waa-evaluate.service first (auto-restarts on failure), fall back
to legacy nohup for older VMs. Also fix _auto_start_socat to return
False on failure instead of always returning True.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Remove collapsed <details> sections — all steps visible by default
- Add full-resolution image copies when originals are available
- Thumbnails link to full-res versions (clickable)
- Increase default thumbnail width from 400 to 600px
- Skip resize if source is already smaller than target width

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Restore full-res 1280x720 originals to docs/artifacts/full/ and
regenerate docs/demo_review.md with expanded layout (no collapsed
sections), 600px thumbnails linking to full-res versions.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@abrichr abrichr force-pushed the feat/demo-review-artifacts branch from 8109a74 to 499927e Compare March 2, 2026 06:11
@abrichr abrichr merged commit caf0311 into main Mar 2, 2026
1 check passed
@abrichr abrichr deleted the feat/demo-review-artifacts branch March 2, 2026 06:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant