Skip to content

Tutorial for LLM as a Judge, and documenting the apis in SRW pages.#12083

Merged
kolchfa-aws merged 30 commits into
opensearch-project:mainfrom
o19s:add_llm_as_a_judge_docs
May 28, 2026
Merged

Tutorial for LLM as a Judge, and documenting the apis in SRW pages.#12083
kolchfa-aws merged 30 commits into
opensearch-project:mainfrom
o19s:add_llm_as_a_judge_docs

Conversation

@epugh
Copy link
Copy Markdown
Collaborator

@epugh epugh commented Mar 10, 2026

Description

Documentation in SRW about this new judgment creation capabilty, and then a dedicated tutorial inspired by tutorials/vector-search/neural-search-tutorial example.

Issues Resolved

n/a

Version

3.5

Frontend features

n/a

Checklist

  • [x ] By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and subject to the Developers Certificate of Origin.
    For more information on following Developer Certificate of Origin and signing off your commits, please check here.

@github-actions
Copy link
Copy Markdown

Thank you for submitting your PR. The PR states are In progress (or Draft) -> Tech review -> Doc review -> Merged.

Before you submit your PR for doc review, make sure the content is technically accurate. If you need help finding a tech reviewer, tag a maintainer.

When you're ready for doc review, tag the assignee of this PR. The doc reviewer may push edits to the PR directly or leave comments and editorial suggestions for you to address (let us know in a comment if you have a preference).

@epugh
Copy link
Copy Markdown
Collaborator Author

epugh commented Mar 10, 2026

cc @wrigleyDan @heemin32

gweyeratlassian and others added 17 commits April 6, 2026 09:14
* Add RomanianAnalyzer to 3.x breaking changes

Signed-off-by: Gabriel Weyer <159976942+gweyeratlassian@users.noreply.github.com>

* Improve Romanian analyzer breaking change wording

Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Signed-off-by: Gabriel Weyer <159976942+gweyeratlassian@users.noreply.github.com>

---------

Signed-off-by: Gabriel Weyer <159976942+gweyeratlassian@users.noreply.github.com>
Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
…-project#11978)

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
…11981)

* Add warning about truncation to ML documentation

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

* Apply suggestions from code review

Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

* Apply suggestion from @kolchfa-aws

Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

---------

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: xiaoyuan0821 <yuan13215@163.com>
Co-authored-by: x00815292 <niklaus.xiao@huawei.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
…#12065)

Signed-off-by: Mykola Shestopal <nick4eva@users.noreply.github.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
…project#12020)

* Add restoring snapshot from a remote-backed cluster docs

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

* Formatting

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

* Link fix and reword

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

---------

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
…roject#12071)

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
* Add 2.19.5 to version history

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

* Updated release description

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

---------

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
…ct#11982)

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
…pensearch-project#12089)

* Fix create sparse vector index error

Signed-off-by: xiaoyuan0821 <yuan13215@163.com>

* Create index with explicit mapping for sort by geo distance example

Signed-off-by: xiaoyuan0821 <yuan13215@163.com>

* Add copy buttons and intro sentence to mapping

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

---------

Signed-off-by: xiaoyuan0821 <yuan13215@163.com>
Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Co-authored-by: x00815292 <niklaus.xiao@huawei.com>
Co-authored-by: Fanit Kolchina <kolchfa@amazon.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
)

* Adding rerank documentation for agentic search

Signed-off-by: Joshua Palis <jpalis@amazon.com>

* Doc review

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

* Reformat requests and clarify steps

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

* Apply suggestion from @kolchfa-aws

Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

---------

Signed-off-by: Joshua Palis <jpalis@amazon.com>
Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Co-authored-by: Fanit Kolchina <kolchfa@amazon.com>
Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
@epugh epugh force-pushed the add_llm_as_a_judge_docs branch from 735a0a0 to 223dfef Compare April 6, 2026 07:14
epugh added 2 commits May 21, 2026 21:08
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
@epugh epugh marked this pull request as ready for review May 22, 2026 01:13
@epugh epugh requested a review from kolchfa-aws as a code owner May 22, 2026 01:13
epugh added 2 commits May 21, 2026 22:17
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
@pajuric
Copy link
Copy Markdown

pajuric commented May 22, 2026

@kolchfa-aws - What's the status here? Is this ready to push or does it require more reviews?

@kolchfa-aws
Copy link
Copy Markdown
Collaborator

@pajuric This is documentation, not a blog. It is in my queue for review since today.

@kolchfa-aws kolchfa-aws added Doc review PR: Doc review in progress and removed Tech review PR: Tech review in progress labels May 26, 2026
@epugh
Copy link
Copy Markdown
Collaborator Author

epugh commented May 27, 2026

@kolchfa-aws I will make sure to prioritize any feedback... One thing I've been struggling with is should I weave in more screenshots of the SRW UI.... Steps 3,4,5 can all be done via the SRW UI, though instead I went with the API-first approach. I was thinking of saying "You can also do these following steps using the SRW UI"? I know we try to minimize the number of images....

Copy link
Copy Markdown
Collaborator Author

@epugh epugh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Vale flagged a couple of false positive alerts on "LLM" in code chunk. Some correct style errors to be fixed!

:--- | :--- | :---
`name` | String | The name of the judgment list.
`description` | String | Optional. A description of the judgment list.
`type` | String | Set to `LLM_JUDGMENT`.
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if we can tweak vale to not do this on code chunks? I think this is technically a false positive? Would you agree @kolchfa-aws ?

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Vale already ignores code blocks and code font by default. Very rarely you'll see a false positive. You can try and tweak it but usually it's not worth the time.

Comment thread _tutorials/llm-as-a-judge-tutorial.md Outdated

You can adapt the connector configuration for other providers:

#### AWS Bedrock example:
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bedrock is a name, so maybe it needs adding to our vocabulary? AWS bedrock example seems werid!

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's because it should be "Amazon Bedrock", not "AWS Bedrock".

Comment thread _tutorials/llm-as-a-judge-tutorial.md Outdated
Congratulations, you are now ready to [run an experiment to evaluate search quality]({{site.url}}{{site.baseurl}}/search-plugins/search-relevance/evaluate-search-quality/#creating-a-pointwise-experiment) using the LLM-generated judgments that you have just created. You can reuse the search configuration and query set's that you have already created in this tutorial when you run your first evaluation.


## Advanced features
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kolchfa-aws do you think this advacned features sections more properly belongs in the documentation? I'm getting that feeling now that I am re-reading....

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay, I went ahead and moved it to the reference page since we don't talk to it in the tutorial.

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, that's better

epugh added 5 commits May 27, 2026 09:06
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
…ordered with the top level descirpiton, and rewrite a lot of passive language to make ti punchier

Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
…e tutorial

Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
@epugh
Copy link
Copy Markdown
Collaborator Author

epugh commented May 27, 2026

Okay, went and reworked the judgments page to have a better structure now that we added llm-as-a-judge content. Plus revamped the text to be less passive!

@epugh epugh changed the title first pass of dedicated tutorial for LLM as a Judge, and documenting the apis in SRW page. Tutorial for LLM as a Judge, and documenting the apis in SRW pages. May 27, 2026
@kolchfa-aws
Copy link
Copy Markdown
Collaborator

@epugh Normally, we try to minimize the number of screenshots because they go stale and are hard to maintain with each UI change. Instructions should be enough for users to do the tasks they want to do.

@pajuric
Copy link
Copy Markdown

pajuric commented May 27, 2026

@pajuric This is documentation, not a blog. It is in my queue for review since today.

I realize it's not a blog, but it requires internal review before I can publish the blog that relates to the topic. I appreciate your review and feedback.

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Comment thread _tutorials/llm-as-a-judge-tutorial.md Outdated
Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Copy link
Copy Markdown
Collaborator

@kolchfa-aws kolchfa-aws left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, @epugh! I simplified the tutorial steps because this is a more advanced tutorial that assumes that users are already familiar with creating connectors, registering models, etc. Doc review complete and changes applied.

@kolchfa-aws
Copy link
Copy Markdown
Collaborator

@pajuric Review complete. Merging this PR.

@kolchfa-aws kolchfa-aws merged commit 8e020f9 into opensearch-project:main May 28, 2026
6 checks passed
@pajuric
Copy link
Copy Markdown

pajuric commented May 28, 2026

@kolchfa-aws thank you for the review and merge. It's appreciated.

kolchfa-aws added a commit that referenced this pull request May 28, 2026
…12083) (#12479)

* Add RomanianAnalyzer to 3.x breaking changes (#12008)

* Add RomanianAnalyzer to 3.x breaking changes



* Improve Romanian analyzer breaking change wording




---------





* first pass of tutorial



* add reference docs.



* Fix up vale violations



* Standarize on the name wikipedia uses for this technique



* Clarify the effect of bulk operations on ingest pipelines (#11978)




* Add warning about truncation to ML documentation (#11981)

* Add warning about truncation to ML documentation



* Apply suggestions from code review



* Apply suggestion from @kolchfa-aws



---------





* Fix create sparse vector index error (#12066)





* Fix formatting for Boolean AND examples in dql.md (#12065)




* Add restoring snapshot from a remote-backed cluster docs (#12020)

* Add restoring snapshot from a remote-backed cluster docs



* Formatting



* Link fix and reword



---------




* Change copy curl buttons to copy only in workspace docs (#12071)




* Add 2.19.5 to version history (#12078)

* Add 2.19.5 to version history



* Updated release description



---------




* Add additional details to search backpressure stats (#11982)




* Create index with explicit mapping for sort by geo distance example (#12089)

* Fix create sparse vector index error



* Create index with explicit mapping for sort by geo distance example



* Add copy buttons and intro sentence to mapping



---------







* Adding rerank documentation for agentic search (#12081)

* Adding rerank documentation for agentic search



* Doc review



* Reformat requests and clarify steps



* Apply suggestion from @kolchfa-aws



---------








* fix up links



* prune back extra verbiage and ideas that arent going to land



* Backout local only change



* Refinig text and links



* Refining text



* Try to be clearer in Step 6



* Responding to vale feedback



* Ensure standard naming patterns followed



* use ALL CAPS format for variables



* Rework the order of the descriptions of the types of judgments to be ordered with the top level descirpiton, and rewrite a lot of passive language to make ti punchier



* Move the advanced feature text to the ref page, they arent part of the tutorial



* Doc review



* Apply suggestion from @kolchfa-aws



---------

Signed-off-by: Gabriel Weyer <159976942+gweyeratlassian@users.noreply.github.com>
Signed-off-by: Eric Pugh <epugh@opensourceconnections.com>
Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>
Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Signed-off-by: xiaoyuan0821 <yuan13215@163.com>
Signed-off-by: Mykola Shestopal <nick4eva@users.noreply.github.com>
Signed-off-by: Joshua Palis <jpalis@amazon.com>
Co-authored-by: Eric Pugh <epugh@opensourceconnections.com>
Co-authored-by: Gabriel Weyer <159976942+gweyeratlassian@users.noreply.github.com>
Co-authored-by: Yuan <yuan13215@163.com>
Co-authored-by: x00815292 <niklaus.xiao@huawei.com>
Co-authored-by: Mykola Shestopal <nick4eva@users.noreply.github.com>
Co-authored-by: Joshua Palis <jpalis@amazon.com>
@epugh
Copy link
Copy Markdown
Collaborator Author

epugh commented May 28, 2026

Awesome @kolchfa-aws !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backport 3.6 Doc review PR: Doc review in progress

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants