Skip to content

Commit b17d57d

Browse files
committed
fix pipeline
1 parent 9ace016 commit b17d57d

31 files changed

Lines changed: 1124 additions & 1686 deletions

.claude/CLAUDE.md

Lines changed: 10 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -6,23 +6,25 @@ Rust library for NP-hard problem reductions. Implements computational problems w
66
## Skills
77
These repo-local skills live under `.claude/skills/*/SKILL.md`.
88

9+
- [run-pipeline](skills/run-pipeline/SKILL.md) -- Pick a Ready issue from the GitHub Project board, move it through In Progress -> issue-to-pr -> Review pool. One issue at a time; forever-loop handles iteration.
910
- [issue-to-pr](skills/issue-to-pr/SKILL.md) -- Convert a GitHub issue into a PR with an implementation plan. One item per PR: `[Rule]` issues require both models to exist on `main`; never bundle model + rule in the same PR.
1011
- [add-model](skills/add-model/SKILL.md) -- Add a new problem model. Can be used standalone (brainstorms with user) or called from `issue-to-pr`.
1112
- [add-rule](skills/add-rule/SKILL.md) -- Add a new reduction rule. Can be used standalone (brainstorms with user) or called from `issue-to-pr`.
12-
- [review-implementation](skills/review-implementation/SKILL.md) -- Review implementation completeness by dispatching parallel subagents (structural + quality) with fresh context. Auto-detects new models/rules from git diff. Called automatically at the end of `add-model`/`add-rule`, after each `executing-plans` batch, or standalone via `/review-implementation`.
13-
- [fix-pr](skills/fix-pr/SKILL.md) -- Resolve PR review comments (user + Copilot), fix CI failures, and address codecov coverage gaps. Uses `gh api` for codecov (not local `cargo-llvm-cov`).
13+
- [review-structural](skills/review-structural/SKILL.md) -- Project-specific structural completeness check: model/rule checklists, build, semantic correctness, issue compliance. Read-only, no code changes. Called by `review-pipeline`.
14+
- [review-quality](skills/review-quality/SKILL.md) -- Generic code quality review: DRY, KISS, cohesion/coupling, test quality, HCI. Read-only, no code changes. Called by `review-pipeline`.
15+
- [fix-pr](skills/fix-pr/SKILL.md) -- Resolve PR review comments, fix CI failures, and address codecov coverage gaps. Uses `gh api` for codecov (not local `cargo-llvm-cov`).
1416
- [write-model-in-paper](skills/write-model-in-paper/SKILL.md) -- Write or improve a problem-def entry in the Typst paper (standalone, for improving existing entries). Core instructions are inlined in `add-model` Step 6.
1517
- [write-rule-in-paper](skills/write-rule-in-paper/SKILL.md) -- Write or improve a reduction-rule entry in the Typst paper (standalone, for improving existing entries). Core instructions are inlined in `add-rule` Step 5.
1618
- [release](skills/release/SKILL.md) -- Create a new crate release. Determines version bump from diff, verifies tests/clippy, then runs `make release`.
1719
- [check-issue](skills/check-issue/SKILL.md) -- Quality gate for `[Rule]` and `[Model]` issues. Checks usefulness, non-triviality, correctness of literature, and writing quality. Posts structured report and adds failure labels.
20+
- [fix-issue](skills/fix-issue/SKILL.md) -- Fix quality issues found by check-issue — auto-fixes mechanical problems, brainstorms substantive issues with human, then re-checks and moves to Ready.
1821
- [topology-sanity-check](skills/topology-sanity-check/SKILL.md) -- Run sanity checks on the reduction graph: detect orphan (isolated) problems and redundant reduction rules.
1922
- `topology-sanity-check orphans` -- Detect isolated problem types (runs `examples/detect_isolated_problems.rs`)
2023
- `topology-sanity-check np-hardness` -- Verify NP-hardness proof chains from 3-SAT (runs `examples/detect_unreachable_from_3sat.rs`)
2124
- `topology-sanity-check redundancy [source target]` -- Check for dominated reduction rules
22-
- [project-pipeline](skills/project-pipeline/SKILL.md) -- Pick a Ready issue from the GitHub Project board, move it through In Progress -> issue-to-pr --execute -> Review pool.
23-
- [review-pipeline](skills/review-pipeline/SKILL.md) -- Pick a PR from Review pool column, fix Copilot review comments, run structural completeness check, fix CI, run agentic feature tests, move to Final review.
25+
- [review-pipeline](skills/review-pipeline/SKILL.md) -- Agentic review for PRs in Review pool: runs structural check, quality check, and agentic feature tests (no code changes), posts combined verdict, always moves to Final review.
2426
- [propose](skills/propose/SKILL.md) -- Interactive brainstorming to help domain experts propose a new model or rule. Asks one question at a time, uses mathematical language (no programming jargon), and files a GitHub issue.
25-
- [final-review](skills/final-review/SKILL.md) -- Interactive maintainer review for PRs in "Final review" column. Assesses usefulness, safety, completeness, quality ranking, then merge or hold.
27+
- [final-review](skills/final-review/SKILL.md) -- Interactive maintainer review for PRs in "Final review" column. Merges main, walks through agentic review bullets with human, then merge or hold.
2628
- [dev-setup](skills/dev-setup/SKILL.md) -- Interactive wizard to install and configure all development tools for new maintainers.
2729
- [tutorial](skills/tutorial/SKILL.md) -- Interactive tutorial — walk through the pred CLI to explore, reduce, and solve NP-hard problems. No Rust internals.
2830

@@ -62,10 +64,10 @@ make run-issue N=42 # Run issue-to-pr --execute for a GitHub issue
6264
make run-pipeline # Pick next Ready issue from project board, implement, move to Review pool
6365
make run-pipeline N=97 # Process a specific issue from the project board
6466
make run-pipeline-forever # Poll Ready column, run-pipeline when new issues appear
65-
make run-review # Pick next PR from Review pool column, fix Copilot comments, fix CI, run agentic tests
67+
make run-review # Pick next PR from Review pool column, run agentic review, move to Final review
6668
make run-review N=570 # Process a specific PR from the Review pool column
67-
make run-review-forever # Poll Review pool, auto-request Copilot reviews, dispatch run-review when reviewed
68-
make copilot-review # Request Copilot code review on current PR (requires: gh extension install ChrisCarini/gh-copilot-review)
69+
make run-review-forever # Poll Review pool for eligible PRs, dispatch run-review
70+
make copilot-review # (Optional) Request Copilot code review on current PR
6971
make release V=x.y.z # Tag and push a new release (CI publishes to crates.io)
7072
# Set RUNNER=claude to use Claude instead of Codex (default: codex)
7173
# Default Codex model: CODEX_MODEL=gpt-5.4

.claude/skills/add-model/SKILL.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -259,7 +259,7 @@ Checklist: display name registered, notation self-contained, background present,
259259
make test clippy # Must pass
260260
```
261261

262-
If running standalone (not inside `make run-plan`), invoke [review-implementation](../review-implementation/SKILL.md) to verify all structural and semantic checks pass. When running inside a plan, the outer orchestrator handles the review.
262+
Structural and quality review is handled by the `review-pipeline` stage, not here. The run stage just needs to produce working code.
263263

264264
## Naming Conventions
265265

.claude/skills/add-rule/SKILL.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -175,7 +175,7 @@ cargo run --example export_schemas # Generate problem schemas for docs/paper bu
175175
make test clippy # Must pass
176176
```
177177

178-
If running standalone (not inside `make run-plan`), invoke [review-implementation](../review-implementation/SKILL.md) to verify all structural and semantic checks pass. When running inside a plan, the outer orchestrator handles the review.
178+
Structural and quality review is handled by the `review-pipeline` stage, not here. The run stage just needs to produce working code.
179179

180180
## Solver Rules
181181

.claude/skills/dev-setup/SKILL.md

Lines changed: 4 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ When scanning, read these files for tool references:
3131

3232
Organize tools into three tiers in `dependencies.md`:
3333
- **Core** — needed to build, test, and generate docs
34-
- **Skill** — needed for the AI-assisted pipeline (gh, claude, pred, copilot-review)
34+
- **Skill** — needed for the AI-assisted pipeline (gh, claude, pred)
3535
- **Optional** — nice to have but not required (julia)
3636

3737
Each tool needs: name, check command, install command (macOS), install command (Linux), purpose.
@@ -54,7 +54,7 @@ For each tool in the **Core Tools** table of `dependencies.md`:
5454
3. **If missing** → print the install command for the detected platform, then execute it
5555

5656
After all core tools are done, ask:
57-
> "Core tools are installed. Do you also want to set up the AI pipeline tools (gh, claude, pred, copilot-review)?"
57+
> "Core tools are installed. Do you also want to set up the AI pipeline tools (gh, claude, pred)?"
5858
5959
- **Yes** → proceed to Step 4
6060
- **No** → skip to Step 6
@@ -148,14 +148,7 @@ If this fails, the user likely needs org-level project scopes:
148148
gh auth refresh -s read:project,project
149149
```
150150

151-
**6c: Test `make run-review` prerequisites**
152-
153-
```bash
154-
# Verify gh-copilot-review works
155-
gh copilot-review --help
156-
```
157-
158-
**6d: Test claude CLI**
151+
**6c: Test claude CLI**
159152

160153
```bash
161154
claude --version
@@ -179,7 +172,7 @@ If all pipeline checks pass, explain the project-based contribution pipeline:
179172
> Picks the next **Ready** issue, moves it to **In Progress**, implements it (using `/issue-to-pr` → `/add-model` or `/add-rule`), creates a PR, then moves it to **Review pool**.
180173
>
181174
> ### `make run-review` (PR → review)
182-
> Picks the next **Review pool** PR, waits for Copilot review, fixes comments and CI failures, runs agentic feature tests, then moves it to **Final review** for human approval.
175+
> Picks the next **Review pool** PR, runs agentic review (structural + quality + feature tests), then moves it to **Final review** for human approval.
183176
>
184177
> ### Targeting specific items
185178
> - `make run-pipeline N=42` — process issue #42

.claude/skills/dev-setup/dependencies.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,6 @@ Rescan by running `/dev-setup` and choosing "rescan".
2525
| Tool | Check Command | Install (macOS) | Install (Linux) | Purpose |
2626
|------|--------------|-----------------|-----------------|---------|
2727
| gh | `gh --version` | `brew install gh` | `sudo apt install gh` | GitHub CLI |
28-
| gh-copilot-review | `gh copilot-review --help` | `gh extension install ChrisCarini/gh-copilot-review` | same | Copilot PR reviews |
2928
| claude | `claude --version` | `npm install -g @anthropic-ai/claude-code` | same | AI-assisted pipeline |
3029
| pred | `pred --version` | `make cli` or `cargo install --path problemreductions-cli` | same | Project CLI (check-issue, topology-sanity-check, propose) |
3130

0 commit comments

Comments
 (0)