Skip to content

cp: docs: cross-link Nemotron-CC recipe as a production NeMo Curator example (1767) into r1.2.0#1860

Merged
sarahyurick merged 1 commit intor1.2.0from
cherry-pick-1767-r1.2.0
Apr 22, 2026
Merged

cp: docs: cross-link Nemotron-CC recipe as a production NeMo Curator example (1767) into r1.2.0#1860
sarahyurick merged 1 commit intor1.2.0from
cherry-pick-1767-r1.2.0

Conversation

@svcnvidia-nemo-ci
Copy link
Copy Markdown
Contributor

beep boop [🤖]: Hi @arhamm1 👋,

we've cherry picked #1767 into  for you! 🚀

Please review and approve this cherry pick by your convenience!

…ple (#1767)

* docs: cross-link Nemotron-CC recipe as a production NeMo Curator example

Adds references to the Nemotron-CC data curation pipeline in README.md
and tutorials/README.md so users can discover a production-scale,
end-to-end example of NeMo Curator in use.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* docs: address review comments — add in-repo SDG link and complete pipeline stages

- Link to tutorials/synthetic/nemotron_cc/ alongside external recipe in both README.md and tutorials/README.md
- Expand Key Components to include language ID & filtering and all 7 pipeline stages per sarahyurick's feedback
- Add language identification step to the Real-World Recipe callout

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

---------

Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-authored-by: Sarah Yurick <53962159+sarahyurick@users.noreply.github.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
@svcnvidia-nemo-ci
Copy link
Copy Markdown
Contributor Author

/ok to test 7290531

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Apr 22, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@greptile-apps
Copy link
Copy Markdown
Contributor

greptile-apps Bot commented Apr 22, 2026

Greptile Summary

This is a documentation-only cherry-pick (from main #1767 into r1.2.0) that adds cross-links to the Nemotron-CC end-to-end curation recipe as a production NeMo Curator example. Internal relative links resolve correctly to the existing tutorials/synthetic/nemotron_cc/ directory.

Confidence Score: 5/5

Documentation-only change with valid internal links; safe to merge.

No code changes; only markdown documentation additions. Internal relative paths (tutorials/synthetic/nemotron_cc/) were verified to exist in the repository. No P0/P1 issues found.

No files require special attention.

Important Files Changed

Filename Overview
README.md Adds a "Real-World Recipe" prose block and a "Recipes" row to the resources table, cross-linking the Nemotron-CC external pipeline and the in-repo SDG tutorial.
tutorials/README.md Adds a "Production Recipes" section with a table entry for Nemotron-CC, linking to the external GitHub recipe and the in-repo synthetic tutorial.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[README.md] -->|links to| B[External: Nemotron-CC GitHub Recipe]
    A -->|links to| C[tutorials/synthetic/nemotron_cc/]
    D[tutorials/README.md] -->|Production Recipes section| B
    D -->|links to| C
    B -->|reproduces| E[HuggingFace: nvidia/Nemotron-CC-v2]
Loading

Reviews (1): Last reviewed commit: "docs: cross-link Nemotron-CC recipe as a..." | Re-trigger Greptile

@sarahyurick sarahyurick merged commit 1180b1f into r1.2.0 Apr 22, 2026
49 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants