You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: enforce limit-ratio quality gate and bump 0.8.0 (#4)
* feat: enforce extreme/tle ratio in final tests and bump to 0.8.0
Guarantee final generated tests prioritize limit-oriented coverage by requiring at least half type=3/4 cases by default, and verify this via manifest-backed quality checks with an explicit opt-out. Also synchronize workflow docs and plugin/package versions for the 0.8.0 release line.
Made-with: Cursor
* fix: align balance description and preserve duplicates in sampling
Address Copilot review by matching schema wording with actual deterministic ordering and preventing unconditional signature-based de-duplication during final sampling, so enable_dedup=false semantics remain effective.
Made-with: Cursor
Copy file name to clipboardExpand all lines: agents/autocode-workflow.md
+2Lines changed: 2 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,4 +25,6 @@ Always work through this sequence unless the task is explicitly outside problem
25
25
26
26
When the user asks for a later step directly, explain which prerequisite step is missing and complete the missing work first.
27
27
28
+
When running `problem_generate_tests`, enforce test quality: final test data should contain at least half limit-oriented cases (`type=3` extreme + `type=4` tle) when candidate availability allows.
29
+
28
30
Treat hook feedback as authoritative. If a hook denies a tool call, fix the workflow gap instead of retrying the same call.
0 commit comments