Commit 30c785c
fix: add /workspace symlink to sg_only Dockerfiles for all 9 prove tasks
All 9 regression-prove tasks in ccb_debug had `ln -sf /app /workspace` in
their baseline Dockerfiles (needed so PATCH_APPLY_DIR=/workspace resolves
to the actual codebase in /app), but the symlink was missing from
Dockerfile.sg_only.
Root cause of MCP scoring 0.00 on 8 prove tasks (BL=0.50):
- In sg_only mode, /workspace did not exist and claude user couldn't create it
- Agent correctly wrote regression_test.py to /app/ instead
- find_and_prove_verifier.sh looked for AGENT_TEST_PATH=/workspace/regression_test.*
- Verifier always found "file not found" → scored 0.0
Fix: add `RUN ln -sf /app /workspace || true` to all 9 Dockerfile.sg_only
files, matching what baseline Dockerfiles already had.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>1 parent 0fd63fe commit 30c785c
File tree
9 files changed
+36
-0
lines changed- benchmarks/ccb_debug
- ansible-galaxy-tar-regression-prove-001/environment
- flipt-auth-cookie-regression-prove-001/environment
- qutebrowser-adblock-cache-regression-prove-001/environment
- qutebrowser-darkmode-threshold-regression-prove-001/environment
- qutebrowser-hsv-color-regression-prove-001/environment
- qutebrowser-url-regression-prove-001/environment
- teleport-ssh-regression-prove-001/environment
- tutanota-search-regression-prove-001/environment
- vuls-oval-regression-prove-001/environment
9 files changed
+36
-0
lines changedLines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
30 | 34 | | |
31 | 35 | | |
32 | 36 | | |
| |||
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
28 | 32 | | |
29 | 33 | | |
30 | 34 | | |
| |||
benchmarks/ccb_debug/qutebrowser-adblock-cache-regression-prove-001/environment/Dockerfile.sg_only
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
30 | 34 | | |
31 | 35 | | |
32 | 36 | | |
| |||
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
30 | 34 | | |
31 | 35 | | |
32 | 36 | | |
| |||
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
30 | 34 | | |
31 | 35 | | |
32 | 36 | | |
| |||
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
30 | 34 | | |
31 | 35 | | |
32 | 36 | | |
| |||
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
28 | 32 | | |
29 | 33 | | |
30 | 34 | | |
| |||
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
28 | 32 | | |
29 | 33 | | |
30 | 34 | | |
| |||
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
28 | 32 | | |
29 | 33 | | |
30 | 34 | | |
| |||
0 commit comments