| layout | minimal |
|---|---|
| title | Content Pipeline |
| description | How canonical data becomes generated pages and validated output. |
| breadcrumb | Content Pipeline |
| breadcrumb_parent_name | Docs |
| breadcrumb_parent_url | /backlog/docs/ |
| id | doc-002 |
{% include breadcrumbs.html %}
_data/interviews.yml_data/video_assets.yml_data/interview_conferences.yml_data/interview_communities.yml_data/resources.yml_data/transcripts/*.yml
- link alignment:
bin/sync_interview_asset_links.rb - summary derivation:
bin/generate_context_summaries.rb - page generation:
bin/generate_interview_pages.rbbin/generate_video_asset_pages.rbbin/generate_interview_taxonomy_pages.rb
- Generated taxonomy pages now render through:
_templates/generated/interview-taxonomy-index.erb_templates/generated/interview-taxonomy-detail.erb
./bin/pipeline validate runs:
- data uniqueness/integrity checks
- transcript integrity checks (
bin/audit_transcripts.rb) - resources registry validation
- last-modified parity checks
- SEO/canonical and semantic/schema validators
- metadata reporting + HTMLProofer
Use bin/import_transcripts_from_outbox.rb to onboard transcript files from an external directory.
- Run a dry run to generate mapping reports:
ruby ./bin/import_transcripts_from_outbox.rb --source-dir /Volumes/Dock_1TB/vimeo/outbox
- Review:
tmp/transcript-import-report.jsontmp/transcript-import-report.md
- Apply high-confidence mappings:
ruby ./bin/import_transcripts_from_outbox.rb --source-dir /Volumes/Dock_1TB/vimeo/outbox --apply
- Re-run validation:
./bin/pipeline validate
Shortcut wrapper:
./bin/transcripts dry-run --source-dir /Volumes/Dock_1TB/vimeo/outbox./bin/transcripts ingest --source-dir /Volumes/Dock_1TB/vimeo/outbox --auto-commit