Commit b949309
Phase 0 issue #3: 6 conformance-harness-recall fixtures + harvester improvements
Six hand-curated fixtures from arc's native conformance scenarios on
origin/codex/chio-kb-a-grade-dogfood:
- capability-validation (signed cap verifies)
- delegation-attenuation (child scope ⊂ parent scope)
- receipt-integrity (tampered receipt rejected)
- revocation-propagation (revocation observed within window)
- dpop-verification (proof binds to request)
- governed-transaction-enforcement (deny verdict short-circuits)
Each fixture has:
- failing_test = real arc scenario path + scenario id
- failure_message = plausible Rust-driver assertion-failure output
(FAILED <id>... assertion <name> expected: ..., got: ..., Reason: ...)
- canonical_fix = ranked list with REAL anchors (function names,
schema sections), no "TODO: human-curated" leftovers
- notes explaining curation provenance
eval-outcomes report now shows conformance-harness-recall: 6/20
(BLOCKED — fixtures). 14 more needed for ADR-0002 sign-off.
Harvester improvements (ops/scripts/harvest-conformance-fixtures.py):
- Broader CONFORMANCE_PATHS to include crates/chio-conformance/ and
integrations/mcp-adapter/tests/.
- is_test_file() now recognizes Rust integration tests
(crates/.../tests/*.rs with "conformance" in path), JSON scenario
fixtures (tests/conformance/native/scenarios/*.json), and peer
client/server programs.
- extract_test_signatures() handles Rust test fns (`fn test_*`) and
JSON scenario id fields.
- looks_like_canonical_fix() includes docs/sdk/, docs/protocol/,
docs/release/, docs/mcp/.
- New --branch flag so the walk targets a specific ref without
requiring checkout (used here against
origin/codex/chio-kb-a-grade-dogfood).
Harvester run against arc emitted 9 medium-confidence candidates;
6 were SDK-test noise (api dashboard token tests, sweeping refactor
commits, SDK serialization unit tests). The harvester correctly found
the chio-conformance/ commits but its test-file regex matches every
`*.test.ts` file in the diff, including SDK files. Hand-curation from
the 6 real native scenarios was faster than another harvester
iteration. Tracked as a follow-up: tighten is_test_file() to require
"conformance" in the test file's PATH, not just any test file in a
commit that touched conformance.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent 4ced358 commit b949309
9 files changed
Lines changed: 230 additions & 103 deletions
File tree
- chio-pack/eval/fixtures/conformance-recall
- ops/scripts
- vault/_meta/dashboards
Lines changed: 27 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
Lines changed: 26 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
Lines changed: 27 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
This file was deleted.
Lines changed: 28 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
Lines changed: 28 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
Lines changed: 29 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
39 | | - | |
40 | | - | |
41 | 39 | | |
| 40 | + | |
| 41 | + | |
42 | 42 | | |
43 | 43 | | |
44 | | - | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
45 | 75 | | |
46 | 76 | | |
47 | 77 | | |
| |||
60 | 90 | | |
61 | 91 | | |
62 | 92 | | |
63 | | - | |
64 | | - | |
65 | | - | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
66 | 96 | | |
67 | 97 | | |
68 | 98 | | |
| |||
90 | 120 | | |
91 | 121 | | |
92 | 122 | | |
93 | | - | |
94 | | - | |
95 | | - | |
96 | | - | |
97 | 123 | | |
98 | 124 | | |
99 | 125 | | |
100 | | - | |
101 | | - | |
| 126 | + | |
| 127 | + | |
102 | 128 | | |
103 | 129 | | |
104 | 130 | | |
105 | 131 | | |
106 | 132 | | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
107 | 137 | | |
108 | 138 | | |
109 | 139 | | |
| |||
119 | 149 | | |
120 | 150 | | |
121 | 151 | | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
122 | 169 | | |
123 | 170 | | |
124 | 171 | | |
| |||
180 | 227 | | |
181 | 228 | | |
182 | 229 | | |
183 | | - | |
| 230 | + | |
184 | 231 | | |
185 | 232 | | |
186 | | - | |
| 233 | + | |
187 | 234 | | |
188 | 235 | | |
189 | 236 | | |
| |||
241 | 288 | | |
242 | 289 | | |
243 | 290 | | |
| 291 | + | |
244 | 292 | | |
245 | 293 | | |
246 | 294 | | |
| |||
249 | 297 | | |
250 | 298 | | |
251 | 299 | | |
252 | | - | |
| 300 | + | |
253 | 301 | | |
254 | 302 | | |
255 | 303 | | |
| |||
0 commit comments