Skip to content

Add lasttoken and none pooling modes to text embedding model docs#12390

Open
aneesh-db wants to merge 1 commit into
opensearch-project:mainfrom
aneesh-db:add-lasttoken-none-pooling-modes
Open

Add lasttoken and none pooling modes to text embedding model docs#12390
aneesh-db wants to merge 1 commit into
opensearch-project:mainfrom
aneesh-db:add-lasttoken-none-pooling-modes

Conversation

@aneesh-db
Copy link
Copy Markdown

Description

Documents two new pooling_mode values supported by the ml-commons text embedding model registration API:

  • lasttoken — uses the last non-padding token's embedding; useful for decoder-only models (e.g., Qwen3-Embedding) where the final token captures cumulative context through causal attention.
  • none — uses pre-pooled output from the model directly without additional pooling, suitable for models that already provide pooled embeddings (e.g., sentence_embedding, pooler_output).

Issues Resolved

Closes #12076
Closes #12075

Related ml-commons PRs

Version

3.4

Checklist

  • By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and DCO sign-off has been added.

Document the new lasttoken and none pooling_mode values supported by
the ml-commons text embedding model registration API.

Closes opensearch-project#12076
Closes opensearch-project#12075

Signed-off-by: Aneesh Nema <aneesh.nema@databricks.com>
@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 6, 2026

Thank you for submitting your PR. The PR states are In progress (or Draft) -> Tech review -> Doc review -> Merged.

Before you submit your PR for doc review, make sure the content is technically accurate. If you need help finding a tech reviewer, tag a maintainer.

When you're ready for doc review, tag the assignee of this PR. The doc reviewer may push edits to the PR directly or leave comments and editorial suggestions for you to address (let us know in a comment if you have a preference).

@github-actions github-actions Bot added the Tech review PR: Tech review in progress label May 6, 2026
@kolchfa-aws
Copy link
Copy Markdown
Collaborator

@rithinpullela Could you review this PR?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Tech review PR: Tech review in progress

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[DOC] Add LAST_TOKEN pooling mode to text embedding model documentation [DOC] Add NONE pooling mode to text embedding model documentation

2 participants