Improve README showcase visuals

oceanusXXD · oceanusXXD · commit b97acb3fbba4 · 2026-06-07T02:30:03.000+10:00
diff --git a/README.md b/README.md
@@ -41,6 +41,60 @@ Use it when a Python project needs coding assistants to follow the current modul
 | Platform automation | A DevEx team runs the workflow across many Python services | Python API returns structured results and readiness status |
 | Contributor onboarding | New contributors need project-specific implementation rules | Generated Skills and docs describe the repo's working contracts |
 
+## Pipeline
+
+![code2skill pipeline](docs/assets/code2skill-pipeline.svg)
+
+The final product is a repository-owned Skill layer, not a chat transcript. Structural artifacts stay available for review, cost estimation, CI refresh, and readiness checks.
+
+## Example Generated Skills
+
+Generated Skills are source-cited Markdown files under `.code2skill/skills/*.md`. These shortened examples show the kind of output `code2skill` is designed to produce from repository evidence.
+
+<details>
+<summary>Repository analysis pipeline</summary>
+
+```markdown
+# Repository Analysis Pipeline
+
+## Overview
+Use this Skill when changing how code2skill scans a repository, builds evidence, or writes structural artifacts.
+
+## Core Rules
+- Keep `execute_repository(...)` as the orchestration entrypoint. Source: src/code2skill/core.py
+- Resolve dependencies through `ImportGraph` before ranking files or computing affected files. Source: src/code2skill/import_graph.py, src/code2skill/impact.py
+- Treat `project-summary.md`, `skill-blueprint.json`, `report.json`, and `state/analysis-state.json` as review and CI artifacts. Source: src/code2skill/core.py
+
+## Common Flows
+1. Scan candidates and extract source/config summaries.
+2. Build import graph, PageRank, evidence coverage, and blueprint.
+3. Render summary/reference/report artifacts before optional Skill generation.
+```
+
+</details>
+
+<details>
+<summary>Assistant target publishing</summary>
+
+```markdown
+# Assistant Target Publishing
+
+## Overview
+Use this Skill when publishing generated Skills into Codex, Claude Code, Cursor, GitHub Copilot, or Windsurf target files.
+
+## Core Rules
+- Use `adapt` for target publishing; generated target content must stay inside managed blocks or manifest-tracked files. Source: src/code2skill/adapt.py, src/code2skill/capabilities/adapt/targets.py
+- Run `doctor` after adaptation to verify the bundle, Skill plan, generated Skill files, state, and selected target output. Source: src/code2skill/capabilities/adoption_service.py
+- Preserve hand-written target-file content outside the managed block. Source: src/code2skill/capabilities/output_bundle_service.py
+
+## Common Flows
+1. Generate or refresh `.code2skill/skills/*.md`.
+2. Run `code2skill adapt . --target <tool>`.
+3. Run `code2skill doctor . --target <tool>`.
+```
+
+</details>
+
 ## Benchmark
 
 `code2skill` is evaluated on structural evidence extraction before any LLM call. The benchmark compares two simple baselines against the semantic scanner used by the Skill generation pipeline.
diff --git a/README.zh-CN.md b/README.zh-CN.md
@@ -41,6 +41,60 @@
 | 平台自动化 | DevEx 团队跨多个 Python 服务运行同一流程 | Python API 返回结构化结果和 readiness |
 | 开源贡献者 onboarding | 新贡献者改代码前需要项目实现规则 | 生成的 Skills 和 docs 说明仓库的工作契约 |
 
+## 流程图
+
+![code2skill pipeline](docs/assets/code2skill-pipeline.svg)
+
+最终产物是一套可以提交到仓库里的 Skill 层，而不是一段聊天记录。结构化产物会保留下来，用于审阅、成本估算、CI 刷新和 readiness 检查。
+
+## 生成 Skill 示例
+
+生成的 Skill 是 `.code2skill/skills/*.md` 下的 Markdown 文件。下面是基于当前仓库证据整理的缩短示例，展示最终输出应该长什么样。
+
+<details>
+<summary>Repository analysis pipeline</summary>
+
+```markdown
+# Repository Analysis Pipeline
+
+## Overview
+Use this Skill when changing how code2skill scans a repository, builds evidence, or writes structural artifacts.
+
+## Core Rules
+- Keep `execute_repository(...)` as the orchestration entrypoint. Source: src/code2skill/core.py
+- Resolve dependencies through `ImportGraph` before ranking files or computing affected files. Source: src/code2skill/import_graph.py, src/code2skill/impact.py
+- Treat `project-summary.md`, `skill-blueprint.json`, `report.json`, and `state/analysis-state.json` as review and CI artifacts. Source: src/code2skill/core.py
+
+## Common Flows
+1. Scan candidates and extract source/config summaries.
+2. Build import graph, PageRank, evidence coverage, and blueprint.
+3. Render summary/reference/report artifacts before optional Skill generation.
+```
+
+</details>
+
+<details>
+<summary>Assistant target publishing</summary>
+
+```markdown
+# Assistant Target Publishing
+
+## Overview
+Use this Skill when publishing generated Skills into Codex, Claude Code, Cursor, GitHub Copilot, or Windsurf target files.
+
+## Core Rules
+- Use `adapt` for target publishing; generated target content must stay inside managed blocks or manifest-tracked files. Source: src/code2skill/adapt.py, src/code2skill/capabilities/adapt/targets.py
+- Run `doctor` after adaptation to verify the bundle, Skill plan, generated Skill files, state, and selected target output. Source: src/code2skill/capabilities/adoption_service.py
+- Preserve hand-written target-file content outside the managed block. Source: src/code2skill/capabilities/output_bundle_service.py
+
+## Common Flows
+1. Generate or refresh `.code2skill/skills/*.md`.
+2. Run `code2skill adapt . --target <tool>`.
+3. Run `code2skill doctor . --target <tool>`.
+```
+
+</details>
+
 ## 基准测试
 
 `code2skill` 评测的是 LLM 调用前的结构证据抽取能力。这个 benchmark 用两个简单 baseline 对比 Skill 生成流水线使用的语义扫描器。
diff --git a/benchmarks/evaluate_structural_evidence.py b/benchmarks/evaluate_structural_evidence.py
@@ -190,56 +190,86 @@ def code2skill_facts(repo_path: Path) -> set[str]:
 
 def render_svg(report: dict[str, object]) -> str:
     results = report["results"]
-    width = 880
-    height = 330
-    left = 210
-    top = 88
-    chart_width = 610
-    bar_height = 34
-    gap = 52
+    width = 980
+    height = 430
+    left = 250
+    top = 134
+    chart_width = 680
+    bar_height = 32
+    gap = 58
+    gold_total = int(report["gold_total"])
+    result_by_method = {result["method"]: result for result in results}
+    semantic_delta = (
+        result_by_method["code2skill-semantic"]["recall"]
+        - result_by_method["ast-symbols"]["recall"]
+    ) * 100
     colors = {
-        "path-only": "#8a8f98",
-        "ast-symbols": "#3f6d8f",
-        "code2skill-semantic": "#198754",
+        "path-only": "#9ca3af",
+        "ast-symbols": "#4f6f8f",
+        "code2skill-semantic": "#0f766e",
     }
     labels = {
         "path-only": "Path-only baseline",
         "ast-symbols": "AST symbols baseline",
         "code2skill-semantic": "code2skill semantic",
     }
+    tick_values = [0.0, 0.25, 0.5, 0.75, 1.0]
     lines = [
-        '<svg xmlns="http://www.w3.org/2000/svg" width="880" height="330" viewBox="0 0 880 330" role="img" aria-labelledby="title desc">',
+        '<svg xmlns="http://www.w3.org/2000/svg" width="980" height="430" viewBox="0 0 980 430" role="img" aria-labelledby="title desc">',
         '<title id="title">Structural evidence extraction benchmark</title>',
         '<desc id="desc">Gold evidence recall for path-only, AST-symbols, and code2skill-semantic extraction.</desc>',
-        '<rect width="880" height="330" fill="#ffffff"/>',
-        '<text x="36" y="38" font-family="Arial, sans-serif" font-size="24" font-weight="700" fill="#111827">Structural Evidence Extraction</text>',
-        '<text x="36" y="62" font-family="Arial, sans-serif" font-size="13" fill="#4b5563">Gold evidence recall before any LLM call. Higher is better.</text>',
+        '<rect width="980" height="430" fill="#ffffff"/>',
+        '<text x="38" y="44" font-family="Arial, sans-serif" font-size="15" font-weight="700" fill="#111827">A</text>',
+        '<text x="64" y="44" font-family="Arial, sans-serif" font-size="22" font-weight="700" fill="#111827">Structural Evidence Extraction</text>',
+        f'<text x="64" y="70" font-family="Arial, sans-serif" font-size="13" fill="#4b5563">Deterministic benchmark before any LLM call; gold structural facts, n={gold_total}.</text>',
+        f'<text x="64" y="94" font-family="Arial, sans-serif" font-size="12" fill="#0f766e">code2skill recovers all gold facts and improves over the AST-symbol baseline by {semantic_delta:.1f} percentage points.</text>',
     ]
-    for tick in range(0, 6):
-        x = left + tick * chart_width / 5
-        lines.append(f'<line x1="{x:.1f}" y1="78" x2="{x:.1f}" y2="252" stroke="#eef0f3"/>')
+    for value in tick_values:
+        x = left + value * chart_width
+        lines.append(f'<line x1="{x:.1f}" y1="118" x2="{x:.1f}" y2="320" stroke="#e5e7eb" stroke-width="1"/>')
         lines.append(
-            f'<text x="{x - 8:.1f}" y="276" font-family="Arial, sans-serif" font-size="11" fill="#6b7280">{tick / 5:.1f}</text>'
+            f'<text x="{x - 10:.1f}" y="342" font-family="Arial, sans-serif" font-size="12" fill="#374151">{value:.2f}</text>'
         )
+    lines.append(
+        f'<line x1="{left}" y1="118" x2="{left + chart_width}" y2="118" stroke="#111827" stroke-width="1"/>'
+    )
+    lines.append(
+        f'<line x1="{left}" y1="118" x2="{left}" y2="320" stroke="#111827" stroke-width="1"/>'
+    )
+    lines.append(
+        f'<text x="{left + chart_width / 2 - 58:.1f}" y="372" font-family="Arial, sans-serif" font-size="13" fill="#111827">Gold evidence recall</text>'
+    )
     for index, result in enumerate(results):
         method = result["method"]
         y = top + index * gap
         value = result["recall"]
         bar_width = value * chart_width
         lines.append(
-            f'<text x="36" y="{y + 23}" font-family="Arial, sans-serif" font-size="14" font-weight="700" fill="#111827">{labels[method]}</text>'
+            f'<text x="64" y="{y + 22}" font-family="Arial, sans-serif" font-size="14" font-weight="700" fill="#111827">{labels[method]}</text>'
+        )
+        lines.append(
+            f'<rect x="{left}" y="{y}" width="{bar_width:.1f}" height="{bar_height}" fill="{colors[method]}"/>'
         )
         lines.append(
-            f'<rect x="{left}" y="{y}" width="{bar_width:.1f}" height="{bar_height}" rx="4" fill="{colors[method]}"/>'
+            f'<text x="{left + bar_width + 10:.1f}" y="{y + 21}" font-family="Arial, sans-serif" font-size="13" fill="#111827">{value:.3f}</text>'
         )
         lines.append(
-            f'<text x="{left + bar_width + 10:.1f}" y="{y + 23}" font-family="Arial, sans-serif" font-size="13" fill="#111827">{value:.3f} ({result["gold_hits"]}/{result["gold_total"]})</text>'
+            f'<text x="{left - 66}" y="{y + 22}" font-family="Arial, sans-serif" font-size="12" fill="#4b5563">{result["gold_hits"]}/{result["gold_total"]}</text>'
         )
     lines.append(
-        '<text x="36" y="314" font-family="Arial, sans-serif" font-size="12" fill="#6b7280">code2skill captures routes, calls, type references, data-flow, dynamic imports, re-exported symbols, exceptions, and dependency edges.</text>'
+        f'<text x="{left - 80}" y="118" font-family="Arial, sans-serif" font-size="12" font-weight="700" fill="#374151">hits</text>'
+    )
+    lines.append(
+        f'<line x1="{left}" y1="313" x2="{left + chart_width}" y2="313" stroke="#111827" stroke-width="1"/>'
+    )
+    lines.append(
+        '<text x="64" y="404" font-family="Arial, sans-serif" font-size="12" fill="#4b5563">Gold set: roles, imports, routes, calls, type references, models, data-flow, dynamic imports, exceptions, main guards, re-exports, and dependency edges.</text>'
+    )
+    lines.append(
+        '<text x="64" y="388" font-family="Arial, sans-serif" font-size="12" fill="#6b7280">Bars report exact recall on a synthetic fixture repository; higher is better.</text>'
     )
     lines.append("</svg>")
-    return "\n".join(lines)
+    return "\n".join(lines) + "\n"
 
 
 def gold_facts() -> list[str]:
diff --git a/benchmarks/results/structural-evidence-benchmark.json b/benchmarks/results/structural-evidence-benchmark.json
@@ -1,6 +1,6 @@
 {
   "name": "structural-evidence-extraction",
-  "generated_at": "2026-06-06T16:13:09.683771+00:00",
+  "generated_at": "2026-06-06T16:27:56.843435+00:00",
   "gold_total": 45,
   "results": [
     {
diff --git a/docs/assets/code2skill-pipeline.svg b/docs/assets/code2skill-pipeline.svg
@@ -0,0 +1,87 @@
+<svg xmlns="http://www.w3.org/2000/svg" width="980" height="500" viewBox="0 0 980 500" role="img" aria-labelledby="title desc">
+  <title id="title">code2skill repository-to-skill pipeline</title>
+  <desc id="desc">A left-to-right pipeline showing repository input, structural analysis, Skill planning, Skill generation, target adaptation, and readiness validation.</desc>
+  <defs>
+    <marker id="arrow" viewBox="0 0 10 10" refX="9" refY="5" markerWidth="8" markerHeight="8" orient="auto-start-reverse">
+      <path d="M 0 0 L 10 5 L 0 10 z" fill="#374151"/>
+    </marker>
+    <style>
+      .title { font: 700 22px Arial, sans-serif; fill: #111827; }
+      .subtitle { font: 13px Arial, sans-serif; fill: #4b5563; }
+      .stage-title { font: 700 14px Arial, sans-serif; fill: #111827; }
+      .stage-text { font: 12px Arial, sans-serif; fill: #374151; }
+      .small { font: 11px Arial, sans-serif; fill: #6b7280; }
+      .box { fill: #ffffff; stroke: #111827; stroke-width: 1.2; }
+      .soft { fill: #f9fafb; stroke: #d1d5db; stroke-width: 1; }
+      .accent-a { fill: #e0f2fe; stroke: #0f4c81; stroke-width: 1; }
+      .accent-b { fill: #ecfdf5; stroke: #0f766e; stroke-width: 1; }
+      .accent-c { fill: #fff7ed; stroke: #b45309; stroke-width: 1; }
+      .line { stroke: #374151; stroke-width: 1.4; fill: none; marker-end: url(#arrow); }
+      .thin { stroke: #9ca3af; stroke-width: 1; fill: none; }
+    </style>
+  </defs>
+
+  <rect width="980" height="500" fill="#ffffff"/>
+  <text x="42" y="44" class="title">Repository Evidence To Assistant Skills</text>
+  <text x="42" y="70" class="subtitle">The LLM sits behind a structural evidence layer: scan files, measure evidence, generate Skills, validate targets.</text>
+
+  <rect x="42" y="112" width="142" height="92" class="box"/>
+  <text x="62" y="141" class="stage-title">Python repo</text>
+  <text x="62" y="164" class="stage-text">source files</text>
+  <text x="62" y="183" class="stage-text">config files</text>
+  <text x="62" y="202" class="stage-text">git state</text>
+
+  <path d="M 184 158 H 234" class="line"/>
+
+  <rect x="234" y="104" width="162" height="108" class="accent-a"/>
+  <text x="254" y="134" class="stage-title">1. Analyze</text>
+  <text x="254" y="157" class="stage-text">AST extraction</text>
+  <text x="254" y="176" class="stage-text">routes, calls, types</text>
+  <text x="254" y="195" class="stage-text">data-flow, configs</text>
+
+  <path d="M 396 158 H 446" class="line"/>
+
+  <rect x="446" y="104" width="170" height="108" class="accent-b"/>
+  <text x="466" y="134" class="stage-title">2. Build graph</text>
+  <text x="466" y="157" class="stage-text">imports + symbols</text>
+  <text x="466" y="176" class="stage-text">re-export edges</text>
+  <text x="466" y="195" class="stage-text">evidence coverage</text>
+
+  <path d="M 616 158 H 666" class="line"/>
+
+  <rect x="666" y="104" width="170" height="108" class="accent-c"/>
+  <text x="686" y="134" class="stage-title">3. Plan Skills</text>
+  <text x="686" y="157" class="stage-text">blueprint JSON</text>
+  <text x="686" y="176" class="stage-text">project summary</text>
+  <text x="686" y="195" class="stage-text">skill-plan.json</text>
+
+  <path d="M 751 212 V 260" class="line"/>
+
+  <rect x="646" y="260" width="210" height="102" class="box"/>
+  <text x="668" y="290" class="stage-title">4. Generate Skill Markdown</text>
+  <text x="668" y="313" class="stage-text">evidence-cited rules</text>
+  <text x="668" y="332" class="stage-text">workflow checkpoints</text>
+  <text x="668" y="351" class="stage-text">uncertainty kept explicit</text>
+
+  <path d="M 646 311 H 558" class="line"/>
+
+  <rect x="350" y="260" width="208" height="102" class="soft"/>
+  <text x="372" y="290" class="stage-title">5. Adapt targets</text>
+  <text x="372" y="313" class="stage-text">AGENTS.md, CLAUDE.md</text>
+  <text x="372" y="332" class="stage-text">Cursor, Copilot</text>
+  <text x="372" y="351" class="stage-text">Windsurf rules</text>
+
+  <path d="M 350 311 H 260" class="line"/>
+
+  <rect x="72" y="260" width="188" height="102" class="soft"/>
+  <text x="94" y="290" class="stage-title">6. Validate</text>
+  <text x="94" y="313" class="stage-text">doctor readiness</text>
+  <text x="94" y="332" class="stage-text">report.json</text>
+  <text x="94" y="351" class="stage-text">incremental state</text>
+
+  <path d="M 530 212 V 232 H 166 V 260" class="thin"/>
+  <text x="198" y="235" class="small">reverse dependency state powers incremental refresh</text>
+
+  <rect x="42" y="410" width="896" height="42" class="soft"/>
+  <text x="64" y="436" class="stage-text">Final product: repository-owned Skill files and target assistant instructions, with structural artifacts available for review and CI.</text>
+</svg>
diff --git a/docs/assets/structural-evidence-benchmark.svg b/docs/assets/structural-evidence-benchmark.svg
diff --git a/tests/test_benchmark_artifacts.py b/tests/test_benchmark_artifacts.py

Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "structural-evidence-extraction",`
`3`		`- "generated_at": "2026-06-06T16:13:09.683771+00:00",`
	`3`	`+ "generated_at": "2026-06-06T16:27:56.843435+00:00",`
`4`	`4`	`"gold_total": 45,`
`5`	`5`	`"results": [`
`6`	`6`	`{`