@@ -20,7 +20,14 @@ Outputs:
2020- ` report.md ` : markdown report
2121- ` bench.sqlite ` : temporary benchmark database
2222
23- Ranker options:
23+ Strategies (` --strategy ` ):
24+
25+ - ` auto ` : pick ` beam ` for ≤50 nodes, ` block ` otherwise.
26+ - ` beam ` : depth-first LLM beam expansion (small trees).
27+ - ` block ` : token-bounded block partitioning with cross-block merge.
28+ - ` vertical ` : baseline — per-beam-branch subtree blocks, no cross-branch view.
29+
30+ Ranker options (apply to ` block ` / ` vertical ` ):
2431
2532- ` --ranker none ` : preserve traversal and block-local LLM order.
2633- ` --ranker bm25 ` : lexical path ordering for cross-block merge candidates.
@@ -29,7 +36,7 @@ Ranker options:
2936
3037### Latest Full Run
3138
32- Claude Sonnet 4.6, ` tier=all ` , ` strategy=block ` , ` ranker=none ` , ` top_k=10 ` .
39+ Claude Sonnet 4.6, ` tier=all ` , ` ranker=none ` , ` top_k=10 ` , 500 queries, 0 failures .
3340
3441Metric notes:
3542
@@ -38,6 +45,22 @@ Metric notes:
3845- ` exact@gold ` : all gold files are recovered within that same cutoff.
3946- ` found@gold ` : average number of gold files recovered within that cutoff.
4047
48+ #### Block (ConDB) vs Vertical (baseline)
49+
50+ Vertical is a per-beam-branch variant: each parent expands its children into
51+ separate subtree blocks (` A→B ` , ` A→C ` ), one LLM call per branch. It removes
52+ the cross-branch view that Block keeps, so it serves as a direct baseline
53+ for the merged-pool design used in ConDB.
54+
55+ | variant | recall@gold | exact@gold | MRR | nDCG@10 | avg returned | avg latency |
56+ | ---| ---:| ---:| ---:| ---:| ---:| ---:|
57+ | Vertical (baseline) | 0.382 | 0.366 | 0.466 | 0.481 | 3.00 | ~ 24 s |
58+ | ** Block (ConDB)** | ** 0.711** | ** 0.672** | ** 0.805** | ** 0.813** | 7.20 | ~ 8 s |
59+
60+ Block is ** +0.33 recall@gold** at ~ 3× lower latency.
61+
62+ #### Block — per-gold-count breakdown
63+
4164| gold files | queries | cutoff | recall@gold | exact@gold | found@gold | avg returned |
4265| ---| ---:| ---:| ---:| ---:| ---:| ---:|
4366| 1 | 430 | 1 | 0.749 | 0.749 | 0.75 | 7.00 |
@@ -47,5 +70,13 @@ Metric notes:
4770| 5 | 1 | 5 | 0.200 | 0.000 | 1.00 | 2.00 |
4871| 6+ | 2 | gold | 0.274 | 0.000 | 2.00 | 10.00 |
4972
50- Full-set aggregate: ` n=500 ` , ` recall@gold=0.711 ` , ` exact@gold=0.672 ` ,
51- ` MRR=0.805 ` , ` nDCG@10=0.813 ` , ` avg gold=1.24 ` , ` avg returned=7.20 ` .
73+ #### Vertical — per-gold-count breakdown
74+
75+ | gold files | queries | cutoff | recall@gold | exact@gold | found@gold | avg returned |
76+ | ---| ---:| ---:| ---:| ---:| ---:| ---:|
77+ | 1 | 430 | 1 | 0.412 | 0.412 | 0.41 | 3.07 |
78+ | 2 | 48 | 2 | 0.250 | 0.125 | 0.50 | 2.67 |
79+ | 3 | 13 | 3 | 0.103 | 0.000 | 0.31 | 2.54 |
80+ | 4 | 6 | 4 | 0.042 | 0.000 | 0.17 | 2.50 |
81+ | 5 | 1 | 5 | 0.400 | 0.000 | 2.00 | 5.00 |
82+ | 6+ | 2 | gold | 0.000 | 0.000 | 0.00 | 0.00 |
0 commit comments