chore: mark US-004 as passing, update progress log

LoCoBench Bot · claude · LoCoBench Bot · commit 111b5fe78543 · 2026-02-16T15:21:03.000Z
Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;
diff --git a/ralph-gapfill-infra/prd.json b/ralph-gapfill-infra/prd.json
@@ -64,7 +64,7 @@
         "python3 scripts/generate_manifest.py still runs without errors after archival"
       ],
       "priority": 4,
-      "passes": false,
+      "passes": true,
       "notes": "Check if there's a configs/repoqa_2config.sh or repoqa_3config.sh. Move it too. Don't delete — archive for reference."
     },
     {
diff --git a/ralph-gapfill-infra/progress.txt b/ralph-gapfill-infra/progress.txt
@@ -53,3 +53,18 @@
   - compare_configs.py imports DIR_PREFIX_TO_SUITE from aggregate_status.py rather than defining its own copy
   - generate_manifest.py exits 1 when runs/official doesn't exist — expected in worktrees without run data
 ---
+
+## 2026-02-16 - US-004
+- Archived saturated ccb_repoqa benchmark (1.000/1.000 on both configs = zero signal)
+- Moved benchmarks/ccb_repoqa/ → benchmarks/archive/ccb_repoqa/
+- Moved configs/repoqa_2config.sh → configs/archive/repoqa_2config.sh
+- Removed 10 repoqa entries from configs/selected_benchmark_tasks.json
+- Updated benchmarks/README.md: removed from active list, added Archived Benchmarks section, renumbered remaining, updated totals
+- Recalculated metadata in selected_benchmark_tasks.json (total_selected, tasks_per_benchmark, language stats, avg MCP score)
+- Files changed: benchmarks/ccb_repoqa/ (moved), configs/repoqa_2config.sh (moved), configs/selected_benchmark_tasks.json, benchmarks/README.md
+- **Learnings for future iterations:**
+  - `git mv` preserves history for archived directories — preferred over manual copy+delete
+  - selected_benchmark_tasks.json metadata.total_selected was stale (171 vs actual 200) — use Python to recount from tasks array when modifying
+  - DIR_PREFIX_TO_SUITE mappings for repoqa_ were left in place in scripts — archived suites' run data may still exist in runs/official and the scripts should still recognize them
+  - When archiving suites: move benchmark dir, move config script, remove from selected_benchmark_tasks.json, update benchmarks/README.md
+---

Original file line number	Diff line number	Diff line change
`@@ -64,7 +64,7 @@`
`64`	`64`	`"python3 scripts/generate_manifest.py still runs without errors after archival"`
`65`	`65`	`],`
`66`	`66`	`"priority": 4,`
`67`		`- "passes": false,`
	`67`	`+ "passes": true,`
`68`	`68`	`"notes": "Check if there's a configs/repoqa_2config.sh or repoqa_3config.sh. Move it too. Don't delete — archive for reference."`
`69`	`69`	`},`
`70`	`70`	`{`