Skip to content

Latest commit

 

History

History
235 lines (173 loc) · 13.2 KB

File metadata and controls

235 lines (173 loc) · 13.2 KB

⚙️ Workflow Audit Template — Agentic Run Self-Assessment

📌 Template Instructions — Copy to analysis/daily/$ARTICLE_DATE/$SUBFOLDER/workflow-audit.md. Produced at the end of Pass 2 as a structured self-audit of how the 7-module prompt pipeline was executed. See per-artifact-methodologies.md §workflow-audit.

🎯 Purpose — Transparent record of workflow execution: which phases ran, which MCP tools were called, which rules of ai-driven-analysis-guide.md were satisfied, and where the run fell short. Combined with reference-analysis-quality.md it is the two-artifact pair that answers "was this a good run?".

🔄 Tradecraft Context

This artifact is produced at the end of Pass 2 to verify that the workflow followed the repository's full analysis contract before article publication. It documents actual execution, not intentions: prompt modules run, MCP/tool usage, validation gates reached, deviations from standard procedure, and whether the required AI FIRST second-pass improvement was genuinely completed.

Use this template as an evidence-backed self-audit of tradecraft quality. Record where the run met the required methodology, where shortcuts or skips occurred, and what risks those deviations introduce for analytical completeness, reproducibility, or policy compliance. If fast paths, partial runs, or fallback behaviours were used, state them explicitly and assess their impact.


📐 Template Contract — every fill of this template MUST satisfy this row.

Slot Value
Owning methodology per-artifact-methodologies.md
Owning gate check Supplementary (Tier-C mandatory) — see 05-analysis-gate.md
Required inputs 7-module prompt pipeline run record
Horizon band per-run (per scripts/horizon-context.ts)
Output family Operational Supplementary
Aggregation order appended (alphabetical, after canonical block) (see scripts/render-lib/aggregator/order.ts)
Reader Intelligence Guide row generated from workflow-audit.md (see scripts/render-lib/aggregator/reader-guide.ts)
Canonical evidence anchor | claim | evidence (dok_id / vote / MP intressent_id / primary-source URL) | retrieved_at | confidence | — every analytical claim row uses this schema.

Cross-reference: README.md §Template ↔ Methodology ↔ Gate-Check Matrix.

📋 Document Metadata

articleType: [REQUIRED]
runId: [REQUIRED]
date: [REQUIRED: YYYY-MM-DD]
analysisPhase: workflow-audit
confidenceLevel: [REQUIRED: HIGH / MEDIUM / LOW]
rulesAudited: [REQUIRED: integer]
complianceRate: [REQUIRED: 0-100]
passTwoCompleted: [REQUIRED: true/false]

Confidence Compliance


1️⃣ Prompt Module Execution Summary

%%{init: {"theme":"dark","themeVariables":{"primaryColor":"#1565C0","primaryTextColor":"#ffffff","lineColor":"#90CAF9","secondaryColor":"#2E7D32","tertiaryColor":"#FF9800","fontFamily":"Inter, Helvetica, Arial, sans-serif"}}}%%
flowchart LR
    M00["00 — base contract"] --> M01["01 — bash & shell"]
    M01 --> M02["02 — MCP access"]
    M02 --> M03["03 — data download"]
    M03 --> M04["04 — analysis pipeline (Pass 1 + Pass 2)"]
    M04 --> M05["05 — analysis gate"]
    M05 --> M06["06 — article generation"]
    M06 --> M07["07 — commit & PR"]
Loading

Annotate each node ✅ / ⚠️ / ❌.

Module Status Wall-clock Notes
00-base-contract.md […] [min]
01-bash-and-shell-safety.md […] [min]
02-mcp-access.md […] [min]
03-data-download.md […] [min] fast-path SKIP_ANALYSIS = [true/false]
04-analysis-pipeline.md — Pass 1 […] [min] floor [standard 15 / deep 20 / comprehensive 25]
04-analysis-pipeline.md — Pass 2 […] [min] floor [standard 7 / deep 10 / comprehensive 12]
05-analysis-gate.md […] [min]
06-article-generation.md […] [min]
07-commit-and-pr.md […] [min]

Total runtime[HH:MM:SS]. Under 45 min → insufficient iteration (AI-FIRST violation). Over 70 min → likely infinite loop, investigate.


2️⃣ Phase-Checkpoint Log

Checkpoint label Time Artifacts snapshotted Recovery used?
phase-04-pass1 [ts] [#] [y/n]
phase-04-pass2 [ts] [#] [y/n]
phase-05-gate [ts] [y/n]
phase-06-article [ts] [y/n]
phase-07-commit [ts] [y/n]

Missing checkpoint → ⚠️ flag, document cause.


3️⃣ Core-Principle Compliance Audit

Audit against the 11 core principles of ai-driven-analysis-guide.md. Mark ✅ / ⚠️ / ❌ with evidence.

# Principle Status Evidence
1 Read all methodologies before writing […] [one tool call per file logged]
2 Read all 23 templates before writing […] […]
3 Pass 1 produces all 23 artifacts […] [manifest of files created]
4 Pass-1 snapshot copied to pass1/ […] [ls pass1/ output]
5 Pass 2 improves every artifact […] [diff summary from cross-run-diff if same-type]
6 Evidence standard (dok_id / vote / named actor / URL + Admiralty) […] [spot-check table below]
7 WEP + horizon on every forecast […] [sample]
8 Party-neutrality arithmetic done […] [reference-analysis-quality §4]
9 ≥ 10 SATs applied […] [methodology-reflection §SATs]
10 methodology-reflection.md ICD 203 audit complete […] [link]
11 DIW tiering coherent (L1/L2/L2+/L3) […] [significance-scoring §rank]

Evidence spot-check (Principle 6)

Pick 5 random P0/P1 claims from synthesis-summary.md. Each must resolve:

# Claim (1 line) dok_id / vote / actor Admiralty ✅/❌
1 [REQUIRED] [REQUIRED] [A1/B2/…] […]
2
3
4
5

Compliance rate[#]/11 = [%] (target: 100%).


4️⃣ Deviations & Causes

For every ⚠️ or ❌ above, document:

Deviation 1

Principle / module[REQUIRED]
What happened[1-sentence fact]
Cause[data / time / MCP / prompt-ambiguity / human-waiver]
Mitigation in this run[REQUIRED]
Next-run follow-up[concrete action]

(Repeat for each deviation.)


5️⃣ Time-Budget Attribution

Module Target Actual Variance Verdict
03 — download 3 min [min] [±] […]
04 — Pass 1 ≥ 20 min [min] [±] […]
04 — Pass 2 ≥ 10 min [min] [±] […]
05 — gate 2 min [min] [±] […]
06 — article 10 min [min] [±] […]
07 — commit 2 min [min] [±] […]

AI-FIRST enforcement — Pass 2 < target ⇒ automatic ❌ on this row even if artifacts pass the line-count floor.


6️⃣ Tool-Call Histogram

Top 10 tools by call count:

Rank Tool Calls Avg latency Notes
1 [REQUIRED] [#] [ms]

Any tool with > 50 calls → flag potential inefficiency.


7️⃣ Next-Run Action List

  1. [REQUIRED — concrete action executable by next run]
  2. [REQUIRED]
  3. [REQUIRED]

These flow into the next run's methodology-reflection.md §Improvements section.


🔗 Cross-References


Template version: v1.2 · Last updated: 2026-04-25


✅ Pass-2 Self-Audit Checklist (v4.4 — required)

Purpose: AI-FIRST principle requires a Pass-2 read-back-and-improve. After producing this artifact in Pass 1, re-read it end-to-end and verify each item below. Document any remediation in methodology-reflection.md §"Pass-2 audit log". Any unchecked ❌ box at the end of Pass 2 forces a Pass-3 rewrite of the affected section.

  • Tradecraft anchors honoured — F3EAD stage matches the artifact's role; PIRs declared in the §Tradecraft Context block are actually addressed in the body; Admiralty grades attached to every external source; WEP band + ODNI confidence on every probabilistic judgement.
  • Source diversity floor met — at least the minimum number of independent MCP sources required by the artifact's tradecraft block are cited; single-source claims are explicitly labelled [SINGLE-SOURCE — corroboration pending].
  • Evidence specificity — every quantified claim cites a dok_id (Riksdag), an SCB / IMF dataflow code, or a named external source with date; no "according to data" / "studies show" hand-waves.
  • Named-actor discipline — every political claim names ≥ 1 person (party + role + dated act/quote) or labels the absence ([diffuse — no named actor]).
  • Counter-narrative present — at least one explicit competing hypothesis, dissent quote, or framed objection appears in the body; "no opposition recorded" is itself a finding to label, not silence.
  • Election 2026 lens applied — the §"Election 2026 Implications" subsection (or equivalent) addresses electoral salience, coalition pressure, and forward indicators; not boilerplate.
  • No illustrative content shipped as fact — every [REQUIRED] placeholder is filled OR removed; every Example: block is clearly fenced or removed; no fabricated dok_id, vote count, or quote leaks into the final artifact.
  • Cross-references resolve — every [link](file.md) in this artifact points to a file that exists in the run folder (analysis/daily/$ARTICLE_DATE/$SUBFOLDER/) or to a methodology / template under analysis/.
  • Mermaid renders — every fenced ```mermaid block parses (no missing class definitions, no orphan nodes, no >40-node graphs that overflow viewport on mobile).
  • Line-floor check — artifact length ≥ the per-artifact floor in reference-quality-thresholds.json; shorter artifacts trigger Pass-2 rewrite, never a [truncated] note.