Commit 6ff0572
authored
docs(dogfood): backfill jina-base Hit@k for v3.10.1-dev.80 (#1186)
* docs(dogfood): backfill jina-base Hit@k for v3.10.1-dev.80
Replaces the placeholder jina-base row in §8 with actual recall numbers
(Hit@1: 72.9%, Hit@3: 91.3%, Hit@5: 95.0%, misses: 41/1500) and rewrites
the assessment to reflect the finding that jina-base (768d) underperforms
jina-small (512d) at every rank cutoff on the code-identifier corpus.
Reproduced against the dev.80 source commit (1a6ee7b) using the
v3.10.1-dev.81 native tarball; the dev.80 tarball had been pruned from
GitHub releases but the only commit between dev.80 and dev.81 is a
CI-workflow refactor (4d8df7b) that leaves the Rust source unchanged.
Re-running minilm and jina-small as controls produced +1-2 pp drift vs
published values, attributable to a +2-file / +46-node corpus shift
between session-time (612 files / 17,873 nodes) and re-run-time
(614 files / 17,919 nodes). The footnote in §8 discloses this so future
readers can read the jina-base row with the same tolerance.
Closes #1181
* docs: correct embedding-size and miss-rate multipliers in dev.80 report (#1186)
* docs: correct corpus-drift range in dev.80 jina-base footnote (#1186)
The footnote claimed controls were ~+1-2 pp higher than published values
but the only cited example (jina-small +0.4 pp) sat well below 1 pp.
Replaces the inflated range with the actual observed deltas (minilm
+1.2 pp, jina-small +0.4 pp) and updates the jina-base tolerance to
match (~+0.4-1.2 pp instead of +/-1-2 pp).1 parent 4a621b9 commit 6ff0572
1 file changed
Lines changed: 5 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
312 | 312 | | |
313 | 313 | | |
314 | 314 | | |
315 | | - | |
| 315 | + | |
| 316 | + | |
| 317 | + | |
316 | 318 | | |
317 | 319 | | |
318 | 320 | | |
319 | 321 | | |
320 | 322 | | |
321 | 323 | | |
322 | | - | |
| 324 | + | |
| 325 | + | |
323 | 326 | | |
324 | 327 | | |
325 | 328 | | |
| |||
0 commit comments