Skip to content

Document z_score normalization and rrf combination in Hybrid Optimizer#12483

Open
RexyZhang2001 wants to merge 2 commits into
opensearch-project:mainfrom
RexyZhang2001:doc/onboard-zscore-rrf-12482
Open

Document z_score normalization and rrf combination in Hybrid Optimizer#12483
RexyZhang2001 wants to merge 2 commits into
opensearch-project:mainfrom
RexyZhang2001:doc/onboard-zscore-rrf-12482

Conversation

@RexyZhang2001
Copy link
Copy Markdown
Contributor

@RexyZhang2001 RexyZhang2001 commented May 28, 2026

Description

Updates the Hybrid Optimizer documentation to reflect two newly added techniques:

  • z_score normalization
  • rrf (Reciprocal Rank Fusion) combination, parameterized by rank_constant

Existing language and structure are preserved; the bullet list is split into a Score-based variants group and a Rank-based variants group to accommodate RRF, which has different parameters.

Issues Resolved

Closes #12482

Version

3.7.0

Related PR

Checklist

  • By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and subject to the Developers Certificate of Origin.
    For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Rex Zhang <yichenzg@amazon.com>
@github-actions
Copy link
Copy Markdown

Thank you for submitting your PR. The PR states are In progress (or Draft) -> Tech review -> Doc review -> Merged.

Before you submit your PR for doc review, make sure the content is technically accurate. If you need help finding a tech reviewer, tag a maintainer.

When you're ready for doc review, tag the assignee of this PR. The doc reviewer may push edits to the PR directly or leave comments and editorial suggestions for you to address (let us know in a comment if you have a preference).

* Two normalization techniques: `l2` and `min_max`.
**Score-based variants**:

* Three normalization techniques: `l2`, `min_max`, and `z_score`. (`z_score` is only paired with `arithmetic_mean`.)
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we add link to z_score documentation (https://docs.opensearch.org/latest/search-plugins/search-pipelines/normalization-processor/#request-body-fields), fact that it's compatible with only am is not coming from SRW implementation but it's z_score level restriction

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added in new commit


**Rank-based variants**:

* The `rrf` (Reciprocal Rank Fusion) combination technique, evaluated with `rank_constant` values of `1`, `5`, `10`, `20`, and `60`.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please add link to score based processor page https://docs.opensearch.org/latest/search-plugins/search-pipelines/score-ranker-processor/. Worth mentioning that we are using default equal weights

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added in new commit

Signed-off-by: Rex Zhang <yichenzg@amazon.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

release-notes PR: Include this PR in the automated release notes Tech review PR: Tech review in progress v3.7.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[DOC] Document new normalization (z_score) and combination (rrf) techniques in HYBRID_OPTIMIZER experiments

4 participants