Commit 63a4770
fix: close git history bypass in sg_only mode and strip scoring from instructions
Two issues found in staging audit:
1. Git bypass: agents used `git show HEAD:` and `git checkout HEAD --` to
recover full source from git history, completely defeating sg_only
truncation. Fix: after truncating source files, recommit the truncated
state so HEAD reflects empty files. Verifier unaffected (restores from
/repo_full/ backup). Regenerated all 66 build-requiring Dockerfile.sg_only.
2. Scoring contamination: 7 code-review, 1 doc-gen, 1 security, and 1 perf
instruction.md files contained exact scoring formulas (F1, ground truth,
fix_score weights). Replaced with vague "evaluated on detection accuracy
and fix quality" to prevent gaming.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 2e56ec6 commit 63a4770
File tree
92 files changed
+648
-723
lines changed- benchmarks
- ccb_build
- bustub-hyperloglog-impl-001/environment
- cgen-deps-install-001/environment
- codecoverage-deps-install-001/environment
- django-dep-refactor-001/environment
- dotenv-expand-deps-install-001/environment
- dotnetkoans-deps-install-001/environment
- envoy-grpc-server-impl-001/environment
- eslint-markdown-deps-install-001/environment
- flipt-dep-refactor-001/environment
- iamactionhunter-deps-install-001/environment
- k8s-noschedule-taint-feat-001/environment
- k8s-runtime-object-impl-001/environment
- k8s-score-normalizer-refac-001/environment
- pcap-parser-deps-install-001/environment
- python-http-class-naming-refac-001/environment
- servo-scrollend-event-feat-001/environment
- similar-asserts-deps-install-001/environment
- tensorrt-mxfp4-quant-feat-001/environment
- vscode-stale-diagnostics-feat-001/environment
- ccb_debug
- ansible-vault-regression-prove-001/environment
- flipt-cache-regression-prove-001/environment
- qutebrowser-bookmark-regression-prove-001/environment
- qutebrowser-download-regression-prove-001/environment
- qutebrowser-tab-regression-prove-001/environment
- qutebrowser-url-regression-prove-001/environment
- teleport-ssh-regression-prove-001/environment
- tutanota-search-regression-prove-001/environment
- vuls-oval-regression-prove-001/environment
- ccb_design
- django-modeladmin-impact-001/environment
- django-pre-validate-signal-design-001/environment
- envoy-routeconfig-dep-chain-001/environment
- envoy-stream-aggregated-sym-001/environment
- etcd-grpc-api-upgrade-001/environment
- flipt-protobuf-metadata-design-001/environment
- flipt-transitive-deps-001/environment
- k8s-typemeta-dep-chain-001/environment
- terraform-provider-iface-sym-001/environment
- ccb_document/terraform-migration-doc-gen-001
- ccb_fix
- ansible-abc-imports-fix-001/environment
- ansible-module-respawn-fix-001/environment
- flipt-cockroachdb-backend-fix-001/environment
- flipt-ecr-auth-oci-fix-001/environment
- flipt-eval-latency-fix-001/environment
- flipt-otlp-exporter-fix-001/environment
- flipt-trace-sampling-fix-001/environment
- nodebb-notif-dropdown-fix-001/environment
- nodebb-plugin-validate-fix-001/environment
- openlibrary-fntocli-adapter-fix-001/environment
- openlibrary-search-query-fix-001/environment
- openlibrary-solr-boolean-fix-001/environment
- protonmail-conv-testhooks-fix-001/environment
- protonmail-dropdown-sizing-fix-001/environment
- protonmail-holiday-calendar-fix-001/environment
- pytorch-cudnn-version-fix-001/environment
- pytorch-dynamo-keyerror-fix-001/environment
- pytorch-release-210-fix-001/environment
- pytorch-relu-gelu-fusion-fix-001/environment
- pytorch-tracer-graph-cleanup-fix-001/environment
- ccb_secure
- curl-vuln-reachability-001
- django-policy-enforcement-001/environment
- django-repo-scoped-access-001/environment
- django-role-based-access-001/environment
- flipt-degraded-context-fix-001/environment
- flipt-repo-scoped-access-001/environment
- ccb_test
- aspnetcore-code-review-001
- environment
- calcom-code-review-001
- environment
- curl-security-review-001
- environment
- envoy-code-review-001
- environment
- ghost-code-review-001
- environment
- kafka-security-review-001/environment
- llamacpp-context-window-search-001/environment
- llamacpp-file-modify-search-001/environment
- numpy-array-sum-perf-001/environment
- openhands-search-file-test-001/environment
- pandas-groupby-perf-001
- environment
- sklearn-kmeans-perf-001/environment
- terraform-code-review-001
- environment
- test-unitgen-py-001/environment
- vscode-code-review-001
- environment
- ccb_understand
- django-composite-field-recover-001/environment
- django-template-inherit-recall-001/environment
- scripts
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
92 files changed
+648
-723
lines changedLines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
22 | 26 | | |
23 | 27 | | |
Lines changed: 6 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| |||
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
21 | 20 | | |
22 | 21 | | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
23 | 26 | | |
24 | 27 | | |
25 | 28 | | |
| |||
Lines changed: 8 additions & 5 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
| 14 | + | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
25 | 24 | | |
26 | 25 | | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
27 | 30 | | |
28 | 31 | | |
29 | 32 | | |
| |||
Lines changed: 13 additions & 22 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
2 | | - | |
3 | | - | |
| 1 | + | |
| 2 | + | |
4 | 3 | | |
5 | | - | |
| 4 | + | |
6 | 5 | | |
7 | | - | |
| 6 | + | |
8 | 7 | | |
9 | | - | |
10 | | - | |
| 8 | + | |
11 | 9 | | |
| 10 | + | |
| 11 | + | |
12 | 12 | | |
13 | | - | |
14 | 13 | | |
15 | 14 | | |
16 | | - | |
17 | | - | |
| 15 | + | |
18 | 16 | | |
19 | | - | |
20 | | - | |
21 | | - | |
| 17 | + | |
| 18 | + | |
22 | 19 | | |
23 | 20 | | |
24 | 21 | | |
25 | | - | |
26 | | - | |
| 22 | + | |
27 | 23 | | |
28 | | - | |
29 | | - | |
30 | | - | |
31 | | - | |
32 | | - | |
33 | | - | |
34 | | - | |
| 24 | + | |
| 25 | + | |
35 | 26 | | |
36 | 27 | | |
Lines changed: 7 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
| 15 | + | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
22 | | - | |
23 | 22 | | |
24 | 23 | | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
25 | 28 | | |
26 | 29 | | |
27 | 30 | | |
| |||
Lines changed: 8 additions & 5 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
| 14 | + | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
25 | 24 | | |
26 | 25 | | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
27 | 30 | | |
28 | 31 | | |
29 | 32 | | |
| |||
Lines changed: 6 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
39 | | - | |
40 | 39 | | |
41 | 40 | | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
42 | 45 | | |
43 | 46 | | |
Lines changed: 7 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
| 15 | + | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
22 | | - | |
23 | 22 | | |
24 | 23 | | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
25 | 28 | | |
26 | 29 | | |
27 | 30 | | |
| |||
0 commit comments