Commit 06b8388
docs: cross-link Nemotron-CC recipe as a production NeMo Curator example (#1767)
* docs: cross-link Nemotron-CC recipe as a production NeMo Curator example
Adds references to the Nemotron-CC data curation pipeline in README.md
and tutorials/README.md so users can discover a production-scale,
end-to-end example of NeMo Curator in use.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
* docs: address review comments — add in-repo SDG link and complete pipeline stages
- Link to tutorials/synthetic/nemotron_cc/ alongside external recipe in both README.md and tutorials/README.md
- Expand Key Components to include language ID & filtering and all 7 pipeline stages per sarahyurick's feedback
- Add language identification step to the Real-World Recipe callout
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-authored-by: Sarah Yurick <53962159+sarahyurick@users.noreply.github.com>1 parent 4c3af97 commit 06b8388
2 files changed
Lines changed: 11 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
103 | 103 | | |
104 | 104 | | |
105 | 105 | | |
| 106 | + | |
| 107 | + | |
106 | 108 | | |
107 | 109 | | |
108 | 110 | | |
| |||
125 | 127 | | |
126 | 128 | | |
127 | 129 | | |
| 130 | + | |
128 | 131 | | |
129 | 132 | | |
130 | 133 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
18 | 26 | | |
19 | 27 | | |
20 | 28 | | |
| |||
0 commit comments