Skip to content

example dataset/pipeline for terminus judge#1374

Open
kbhardwaj-nvidia wants to merge 4 commits into
mainfrom
kbhardwaj/tb-dataset-dco
Open

example dataset/pipeline for terminus judge#1374
kbhardwaj-nvidia wants to merge 4 commits into
mainfrom
kbhardwaj/tb-dataset-dco

Conversation

@kbhardwaj-nvidia
Copy link
Copy Markdown
Contributor

  • Added example data pipeline under resources_servers/terminus_judge/ only (no core Gym code/config changes).
  • Converts terminus trajectory conversations into per-turn samples matching terminus_judge schema shape.
  • Docs in resources_servers/terminus_judge/scripts/README.md for all 3 stages.

Used https://huggingface.co/datasets/open-thoughts/OpenThoughts-Agent-v1-SFT as an example for the trajectories

Signed-off-by: Khushi Bhardwaj <kbhardwaj@nvidia.com>
Signed-off-by: Khushi Bhardwaj <kbhardwaj@nvidia.com>
Signed-off-by: Khushi Bhardwaj <kbhardwaj@nvidia.com>
Signed-off-by: Khushi Bhardwaj <kbhardwaj@nvidia.com>
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented May 20, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@kbhardwaj-nvidia kbhardwaj-nvidia changed the title Kbhardwaj/tb dataset dco example dataset/pipeline for terminus judge May 20, 2026
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can this go somewhere else? huggingface or something? i dont think we normally put train jsonl on github.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants