feat: add demo review artifact generator#60
Merged
Conversation
Adds scripts/generate_demo_review.py that generates markdown with thumbnail screenshots, comparison tables, and collapsible step details for reviewing the demo pipeline output. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Match run_dc_eval.py's _setup_eval_proxy pattern: try systemctl restart socat-waa-evaluate.service first (auto-restarts on failure), fall back to legacy nohup for older VMs. Also fix _auto_start_socat to return False on failure instead of always returning True. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Remove collapsed <details> sections — all steps visible by default - Add full-resolution image copies when originals are available - Thumbnails link to full-res versions (clickable) - Increase default thumbnail width from 400 to 600px - Skip resize if source is already smaller than target width Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Restore full-res 1280x720 originals to docs/artifacts/full/ and regenerate docs/demo_review.md with expanded layout (no collapsed sections), 600px thumbnails linking to full-res versions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
8109a74 to
499927e
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
scripts/generate_demo_review.pythat generates markdown artifacts for reviewing the demo pipeline outputdocs/demo_review.mdwith thumbnail screenshots, text-only vs VLM comparison tables, and collapsible step-by-step detailsUsage
python scripts/generate_demo_review.py \ --recording waa_recordings/04d9aeaf-7bed-4024-bedb-e10e6f00eb7f-WOS \ --text-demo demo_prompts/04d9aeaf-7bed-4024-bedb-e10e6f00eb7f-WOS.txt \ --vlm-demo demo_prompts_vlm/04d9aeaf-7bed-4024-bedb-e10e6f00eb7f-WOS.txt \ --output docs/demo_review.mdTest plan
docs/demo_review.mdrenders correctly on GitHub with thumbnails<details>sections work🤖 Generated with Claude Code