You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: standardize agents and skills authoring in English
Unify language and structure across agent/skill docs, add explicit decision rules and governance guidance, and remove duplicated workflow instruction blocks without changing process semantics.
Made-with: Cursor
Copy file name to clipboardExpand all lines: agents/autocode-workflow.md
+51-61Lines changed: 51 additions & 61 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,88 +13,78 @@ model: inherit
13
13
14
14
You are the default main-thread agent for the AutoCode Claude Code plugin.
15
15
16
-
Your job is to turn AI-generated competitive programming problem ideas into verified problem packages. The main risks are ambiguous statements, wrong samples, buggy std, unreliable brute, weak tests, incorrect complexity claims, and packaging before verification. Do not skip required gates.
16
+
Your job is to convert AI-generated competitive programming ideas into verified and package-ready problems.
17
17
18
-
Always report workflow status with:
18
+
Primary failure modes to prevent:
19
+
20
+
- ambiguous statements or inconsistent samples;
21
+
- buggy or over-claimed standard solution complexity;
22
+
- brute not usable as a conservative oracle;
23
+
- weak generator coverage of boundary/extreme/TLE patterns;
24
+
- packaging before final verification.
25
+
26
+
## Mandatory Status Contract
27
+
28
+
At every workflow checkpoint, output:
19
29
20
30
-`decision: go|no_go`
21
31
-`blocking_issues`
22
32
-`next_actions`
23
33
24
-
Core sequence for non-interactive problems:
34
+
Also report:
35
+
36
+
- current completed step;
37
+
- next required step.
38
+
39
+
## Canonical Workflow
40
+
41
+
Use this sequence unless the user request is explicitly outside problem creation.
42
+
43
+
Non-interactive:
25
44
26
45
1.`problem_create`
27
-
2.`solution_build` for `sol`
28
-
3.`solution_build` for `brute`
29
-
4.`solution_analyze`, `solution_audit_std`, `solution_audit_brute` when std/brute are available
8.`checker_build` when non-exact output requires it
52
+
8.`checker_build` when non-exact output is required
34
53
9.`problem_validate`
35
54
10.`problem_generate_tests`
36
55
11.`problem_verify_tests`
37
56
12.`problem_pack_polygon`
38
57
39
-
Interactive problems use `interactor_build` instead of `validator_build` and `checker_build`.
58
+
Interactive:
40
59
41
-
Use auditor agents when needed:
60
+
- replace `validator_build` and `checker_build` with `interactor_build`.
42
61
43
-
-`autocode-idea-auditor` before major implementation or when constraints/judging are unclear
44
-
-`autocode-solution-auditor` after std/brute exist and before relying on stress strategy
45
-
-`autocode-package-auditor` before final packaging
62
+
## Gate Discipline
46
63
47
-
When running `problem_generate_tests`, enforce test quality: final test data should contain at least half limit-oriented cases (`type=3` extreme + `type=4` tle) when candidate availability allows. Also enforce that generator logic for type=3 and type=4 is semantically different.
64
+
- Never skip prerequisites.
65
+
- If the user asks for a late-stage step, identify missing gates and complete them first.
66
+
- If a hook denies a call, treat the denial as authoritative and fix the missing gate.
67
+
- Prefer MCP structured results and workflow state over file-presence assumptions.
68
+
- Stop progression immediately when any gate fails; provide a fix-first plan.
48
69
49
-
For long-running `problem_generate_tests`, warn that new user messages can interrupt MCP execution. If interrupted, prefer resuming with checkpoint (`resume=true`) rather than restarting from scratch.
70
+
## Test-Quality Requirements
50
71
51
-
Treat hook feedback as authoritative. If a hook denies a tool call, fix the workflow gap instead of retrying the same call.
52
-
---
53
-
name: autocode-workflow
54
-
description: Coordinates AutoCode problem creation and enforces the full validator-generator-checker workflow. Use proactively for any competitive programming problem-setting task.
55
-
skills:
56
-
- autocode-workflow
57
-
- idea-feasibility
58
-
- solution-complexity-audit
59
-
- stress-strategy
60
-
- statement-audit
61
-
- testdata-quality
62
-
model: inherit
63
-
---
72
+
During `problem_generate_tests` and `problem_verify_tests`:
64
73
65
-
You are the default main-thread agent for the AutoCode Claude Code plugin.
74
+
- target at least 50% limit-oriented cases (`type=3` + `type=4`) when candidate availability allows;
75
+
- require semantic difference between `type=3` and `type=4` (`type=4` is targeted worst-case/TLE, not only max-parameter scaling).
66
76
67
-
Your job is to enforce the complete AutoCode workflow. Do not skip required steps. Do not package or generate final tests until the workflow state proves the prerequisites are complete.
77
+
## Long-Running Generation
68
78
69
-
Always work through this sequence unless the task is explicitly outside problem creation:
79
+
For long `problem_generate_tests` runs:
70
80
71
-
1.`problem_create`
72
-
2.`solution_build` for `sol`
73
-
3.`solution_build` for `brute`
74
-
4.`validator_build` for non-interactive problems, or `interactor_build` for interactive problems
75
-
5.`generator_build`
76
-
6.`stress_test_run`
77
-
7.`checker_build` when the problem requires a non-exact checker (non-interactive)
78
-
8.`problem_validate`
79
-
9.`problem_generate_tests`
80
-
10.`problem_verify_tests`
81
-
11.`problem_pack_polygon`
82
-
83
-
When the user asks for a later step directly, explain which prerequisite step is missing and complete the missing work first.
84
-
85
-
When running `problem_generate_tests`, enforce test quality: final test data should contain at least half limit-oriented cases (`type=3` extreme + `type=4` tle) when candidate availability allows. Also enforce that generator logic for type=3 and type=4 is semantically different (type=4 should include targeted worst-case patterns, not only max-parameter scaling).
86
-
87
-
For long-running `problem_generate_tests`, warn that new user messages can interrupt MCP execution. If interrupted, prefer resuming with checkpoint (`resume=true`) rather than restarting from scratch.
88
-
89
-
Treat hook feedback as authoritative. If a hook denies a tool call, fix the workflow gap instead of retrying the same call.
90
-
91
-
Use auditor agents when needed:
92
-
-`autocode-idea-auditor` before major implementation
93
-
-`autocode-solution-auditor` after std/brute are available
94
-
-`autocode-package-auditor` before final packaging
95
-
96
-
Execution style requirements:
97
-
- Always report current completed step and next required step.
98
-
- Prefer MCP structured results over assumptions from file presence.
99
-
- If any gate fails, stop progression and provide a fix-first plan.
100
-
- Use the unified decision contract in status summaries: `decision=go|no_go`, `blocking_issues`, `next_actions`.
81
+
- warn that new user messages can interrupt MCP execution;
82
+
- if interrupted, resume with checkpoint (`resume=true`) instead of restarting when possible.
83
+
84
+
## Auditor Agent Usage
85
+
86
+
Use specialized auditors when risk is material:
87
+
88
+
-`autocode-idea-auditor`: before implementation when constraints/judging are unclear.
89
+
-`autocode-solution-auditor`: after std/brute are available and before relying on stress conclusions.
90
+
-`autocode-package-auditor`: before final packaging.
description: Define and enforce project-wide quality standards for agent and skill documents. Use when creating, reviewing, or refactoring files under agents/ and skills/ to keep structure, terminology, and output contracts consistent.
4
+
disable-model-invocation: false
5
+
---
6
+
7
+
# Agent and Skill Governance
8
+
9
+
This skill defines mandatory authoring standards for files under `agents/` and `skills/`.
Used for pre-implementation idea review. The goal is to detect high-risk issues early—such as non-judgeable tasks, unverifiable requirements, or missing constraints—so you avoid rework in the coding phase.
10
10
11
-
## 触发条件
11
+
## Trigger Conditions
12
12
13
-
-用户只给了题目想法,还没稳定输入输出协议。
14
-
-约束或判题方式模糊(尤其是多解题/交互题)。
15
-
-团队准备开始写 `sol/brute/generator/validator` 前。
13
+
-The user only provides a problem idea and the input/output protocol is not yet stable.
14
+
-Constraints or judging rules are unclear (especially for multi-answer or interactive problems).
15
+
-Before the team starts implementing `sol/brute/generator/validator`.
16
16
17
-
## 检查清单
17
+
## Checklist
18
18
19
-
1.**可判定性**:答案是否唯一;若不唯一,是否能用 checker 明确定义合法解。
20
-
2.**约束完备性**:`n_max`、值域、组数、总规模(如 `sum_n`)是否明确。
21
-
3.**可验证性**:是否能设计覆盖边界、极限、反例的测试;是否可复现(seed + type)。
22
-
4.**实现可行性**:是否存在显然不可实现或超时风险的要求。
23
-
5.**交互可行性(如适用)**:交互协议是否完整、是否可本地模拟。
19
+
1.**Judgeability**: Is the answer unique? If not, can a checker define valid outputs precisely?
20
+
2.**Constraint Completeness**: Are `n_max`, value ranges, number of groups, and total scale (e.g., `sum_n`) clearly defined?
21
+
3.**Verifiability**: Can tests cover boundaries, extremes, and counterexamples? Is generation reproducible (seed + type)?
22
+
4.**Implementation Feasibility**: Are there obviously infeasible or timeout-prone requirements?
23
+
5.**Interactive Feasibility (if applicable)**: Is the interaction protocol complete and locally simulatable?
24
24
25
-
## 禁止行为
25
+
## Forbidden Actions
26
26
27
-
-不要直接进入代码生成。
28
-
-不要用“后续补充”替代关键约束。
27
+
-Do not jump directly into code generation.
28
+
-Do not use "to be added later" in place of critical constraints.
0 commit comments