Deferred from PR #1180 review.
Original reviewer comment: #1180 (comment)
Context: The Phase 8 embedding-benchmark table in generated/dogfood/DOGFOOD_REPORT_v3.10.1-dev.80.md published the jina-base (768d) row with a placeholder ("benchmark still running at report cut"). Greptile flagged that an incomplete data point in an official report is misleading.
The follow-up work is:
- Finish the
jina-base Hit@1/Hit@3/Hit@5 run on the same 1500-sample corpus used for minilm and jina-small.
- Either:
- Backfill the numbers into the v3.10.1-dev.80 report (preferred — keeps the historical record intact), or
- Open a small addendum file under
generated/dogfood/ linking back to the original report.
- Update §8 "Benchmark Assessment" to compare jina-base vs jina-small / minilm once the data is in.
For the in-flight PR #1180, the row has been clarified as "not completed — see this issue" so reviewers don't read it as suppressed data.
Deferred from PR #1180 review.
Original reviewer comment: #1180 (comment)
Context: The Phase 8 embedding-benchmark table in
generated/dogfood/DOGFOOD_REPORT_v3.10.1-dev.80.mdpublished thejina-base (768d)row with a placeholder ("benchmark still running at report cut"). Greptile flagged that an incomplete data point in an official report is misleading.The follow-up work is:
jina-baseHit@1/Hit@3/Hit@5 run on the same 1500-sample corpus used forminilmandjina-small.generated/dogfood/linking back to the original report.For the in-flight PR #1180, the row has been clarified as "not completed — see this issue" so reviewers don't read it as suppressed data.