Skip to content

Commit f292689

Browse files
jeremymanningclaude
andcommitted
spec(012) Phase 2.3: README documents convergence pipeline (T053)
Updates the "How it works → The paper pipeline" section to describe the spec-012 convergence pipeline (structured action items, most-recent verdict gate, three-way severity routing, per-specialist re-review protocol, and arxiv-intake guardrail). Closes the last remaining task in the spec-012 task list (T053). With this commit, all 55 of 55 tasks are now landed on PR #198. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent 7fb8149 commit f292689

1 file changed

Lines changed: 30 additions & 5 deletions

File tree

README.md

Lines changed: 30 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -34,13 +34,38 @@ reports it: `paper init` → `paper spec` → `paper plan` → `paper tasks` →
3434
`drafting` (paper-writing + figure-generation + statistics agents; LaTeX is
3535
built and citations verified) → `paper complete``paper review``posted`.
3636

37-
Paper review needs both a points threshold and an accept verdict from **twelve**
38-
specialist reviewers: writing quality, logical consistency, claim accuracy,
39-
over-reach, safety/ethics, scientific evidence, statistical analysis, code
40-
quality, data quality, text formatting, figure critic, jargon police.
41-
37+
Paper review uses a **convergence pipeline** (spec 012). Every reviewer
38+
emits structured `action_items` with severity ∈ {`writing`, `science`,
39+
`fatal`}, and the advancement evaluator uses the **most-recent verdict per
40+
specialist** (against the live artifact hash — stale reviews are ignored).
41+
42+
Three terminal outcomes:
43+
44+
- **All specialists accept**`paper_accepted``posted`.
45+
- **Any `fatal` severity**`brainstormed` (back to the backlog), with a
46+
rejection rationale appended to the idea record citing each fatal item.
47+
- **Otherwise** (writing/science items, no fatal) → `paper_revision_in_progress`,
48+
which auto-kicks a revision-spec pipeline that produces a complete
49+
spec/plan/tasks/analyze directory under
50+
`specs/auto-revisions/<PROJ-ID>/round-<N>/`. The project then sits at
51+
`ready_for_implementation` until an implementer agent picks it up.
52+
53+
The **per-specialist re-review protocol** prevents endless-nit loops: when
54+
a specialist has prior reviews for the same project, its prompt reduces
55+
to two questions — "(a) prior action items addressed? (b) any new
56+
issues?" — instead of starting fresh and finding new nits each round.
57+
58+
The twelve specialist reviewers (writing quality, logical consistency,
59+
claim accuracy, over-reach, safety/ethics, scientific evidence,
60+
statistical analysis, code quality, data quality, text formatting,
61+
figure critic, jargon police) each emit action items in their lane.
4262
Human reviews count double; self-review is rejected by the schema.
4363

64+
arXiv-submitted papers (third-party, source frozen) skip the writing-
65+
revision pipeline. Instead the consolidated action items land in
66+
`projects/<PROJ-ID>/upstream_feedback.yaml`; outcomes are restricted to
67+
accept-with-caveats or reject.
68+
4469
## The agents
4570

4671
There are **50 agents** in [agents/registry.yaml](agents/registry.yaml) — each a

0 commit comments

Comments
 (0)