[MLOB-6998] added pydantic documentation by jennm · Pull Request #35758 · DataDog/documentation

jennm · 2026-04-03T21:22:13Z

What does this PR do? What is the motivation?

This PR adds documentation for the pydantic evals integration with LLM Obs experiments.

Merge instructions

We are waiting on the next release of dd-trace to merge this pr as it contains the pydantic integration with LLM Obs experiments.

Merge readiness:

Ready for merge

For Datadog employees:

Your branch name MUST follow the <name>/<description> convention and include the forward slash (/). Without this format, your pull request will not pass CI, the GitLab pipeline will not run, and you won't get a branch preview. Getting a branch preview makes it easier for us to check any issues with your PR, such as broken links.

If your branch doesn't follow this format, rename it or create a new branch and PR.

[6/5/2025] Merge queue has been disabled on the documentation repo. If you have write access to the repo, the PR has been reviewed by a Documentation team member, and all of the required checks have passed, you can use the Squash and Merge button to merge the PR. If you don't have write access, or you need help, reach out in the #documentation channel in Slack.

AI assistance

Additional notes

github-actions · 2026-04-03T21:26:19Z

Preview links (active after the `build_preview` check completes)

New or renamed files

https://docs-staging.datadoghq.com/jenn/MLOB-6998/llm_observability/evaluations/pydantic_evaluations

Modified Files

FouadWahabi · 2026-04-07T08:23:05Z

content/en/llm_observability/evaluations/pydantic_evaluations.md

+
+Pydantic is an open source framework that provides ready-to-use evaluations and allows for customizable LLM evaluations. For more information, see [Pydantic's documentation][3].
+
+You can use LLM Observability to run Pydantic evaluations and scalar Pydantic report evaluations in [Experiments][1]. Pydantic evaluation results appear as evaluator results tied to each instance in an [LLM Observability dataset][5]. Pydantic report evaluations appear as a scalar result tied to an LLM Observability dataset.


Pydantic report evaluations appear as a scalar result tied to an LLM Observability dataset

The evaluator result should be tied to the spans not the dataset, this could be a bit misleading

Maybe this should be rephrased. There is one report evaluation result for an entire dataset. I rephrased to

Pydantic report evaluations run on an entire LLM Observability dataset and report one scalar result for the dataset.

which is hopefully more clear :)

content/en/llm_observability/evaluations/pydantic_evaluations.md

…o jenn/MLOB-6998

added pydantic documentation

3db5216

jennm requested a review from a team as a code owner April 3, 2026 21:22

github-actions bot added Architecture Everything related to the Doc backend Images Images are added/removed with this PR Guide Content impacting a guide labels Apr 3, 2026

gsvigruha approved these changes Apr 3, 2026

View reviewed changes

Update pydantic_evaluations.md

6662943

cswatt approved these changes Apr 6, 2026

View reviewed changes

FouadWahabi reviewed Apr 7, 2026

View reviewed changes

content/en/llm_observability/evaluations/pydantic_evaluations.md Show resolved Hide resolved

jennm added 4 commits April 7, 2026 09:08

added LLMJudge as example

5d40a42

Merge branch 'jenn/MLOB-6998' of github.com:DataDog/documentation int…

1aa2a32

…o jenn/MLOB-6998

rephrased to make more clear

d8e25cd

improved wording

5fca433

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MLOB-6998] added pydantic documentation#35758

[MLOB-6998] added pydantic documentation#35758
jennm wants to merge 6 commits intomasterfrom
jenn/MLOB-6998

jennm commented Apr 3, 2026

Uh oh!

github-actions bot commented Apr 3, 2026

Uh oh!

FouadWahabi Apr 7, 2026

Uh oh!

jennm Apr 7, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		Pydantic is an open source framework that provides ready-to-use evaluations and allows for customizable LLM evaluations. For more information, see [Pydantic's documentation][3].

		You can use LLM Observability to run Pydantic evaluations and scalar Pydantic report evaluations in [Experiments][1]. Pydantic evaluation results appear as evaluator results tied to each instance in an [LLM Observability dataset][5]. Pydantic report evaluations appear as a scalar result tied to an LLM Observability dataset.

Conversation

jennm commented Apr 3, 2026

What does this PR do? What is the motivation?

Merge instructions

AI assistance

Additional notes

Uh oh!

github-actions bot commented Apr 3, 2026

Preview links (active after the build_preview check completes)

New or renamed files

Modified Files

Uh oh!

FouadWahabi Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

jennm Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Preview links (active after the `build_preview` check completes)

jennm Apr 7, 2026 •

edited

Loading