File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 22 "cells" : [
33 {
44 "cell_type" : " markdown" ,
5- "id" : " e9bc2aab" ,
6- "metadata" : {},
5+ "id" : " f5bc03e0" ,
6+ "metadata" : {
7+ "nemo_colab_inject" : true
8+ },
79 "source" : [
810 " <a href=\" https://colab.research.google.com/github/NVIDIA-NeMo/DataDesigner/blob/main/docs/colab_notebooks/1-the-basics.ipynb\" target=\" _parent\" ><img src=\" https://colab.research.google.com/assets/colab-badge.svg\" alt=\" Open In Colab\" /></a>"
911 ]
1012 },
1113 {
1214 "cell_type" : " markdown" ,
13- "id" : " 33dcb5be " ,
15+ "id" : " 3454d676 " ,
1416 "metadata" : {},
1517 "source" : [
1618 " # 🎨 Data Designer Tutorial: The Basics\n " ,
2224 },
2325 {
2426 "cell_type" : " markdown" ,
25- "id" : " adb77b8d " ,
27+ "id" : " 4737bc0d " ,
2628 "metadata" : {},
2729 "source" : [
2830 " ### 📦 Import Data Designer\n " ,
3436 },
3537 {
3638 "cell_type" : " markdown" ,
37- "id" : " 170ce1ea" ,
38- "metadata" : {},
39+ "id" : " cf21b784" ,
40+ "metadata" : {
41+ "nemo_colab_inject" : true
42+ },
3943 "source" : [
4044 " ### ⚡ Colab Setup\n " ,
4145 " \n " ,
4549 {
4650 "cell_type" : " code" ,
4751 "execution_count" : null ,
48- "id" : " 67e478f9" ,
49- "metadata" : {},
52+ "id" : " a87ee38a" ,
53+ "metadata" : {
54+ "nemo_colab_inject" : true
55+ },
5056 "outputs" : [],
5157 "source" : [
5258 " %%capture\n " ,
5662 {
5763 "cell_type" : " code" ,
5864 "execution_count" : null ,
59- "id" : " 533fc40d" ,
60- "metadata" : {},
65+ "id" : " d6c4cc8a" ,
66+ "metadata" : {
67+ "nemo_colab_inject" : true
68+ },
6169 "outputs" : [],
6270 "source" : [
6371 " import getpass\n " ,
7482 {
7583 "cell_type" : " code" ,
7684 "execution_count" : null ,
77- "id" : " 9ad92889 " ,
85+ "id" : " 848dea00 " ,
7886 "metadata" : {},
7987 "outputs" : [],
8088 "source" : [
8492 },
8593 {
8694 "cell_type" : " markdown" ,
87- "id" : " 0232c4c6 " ,
95+ "id" : " b97786a8 " ,
8896 "metadata" : {},
8997 "source" : [
9098 " ### ⚙️ Initialize the Data Designer interface\n " ,
97105 {
98106 "cell_type" : " code" ,
99107 "execution_count" : null ,
100- "id" : " fbbd0cab " ,
108+ "id" : " b31c1fc9 " ,
101109 "metadata" : {},
102110 "outputs" : [],
103111 "source" : [
106114 },
107115 {
108116 "cell_type" : " markdown" ,
109- "id" : " 305f635e " ,
117+ "id" : " 2fef5ae8 " ,
110118 "metadata" : {},
111119 "source" : [
112120 " ### 🎛️ Define model configurations\n " ,
123131 {
124132 "cell_type" : " code" ,
125133 "execution_count" : null ,
126- "id" : " d0865b58 " ,
134+ "id" : " 7a9f6398 " ,
127135 "metadata" : {},
128136 "outputs" : [],
129137 "source" : [
153161 },
154162 {
155163 "cell_type" : " markdown" ,
156- "id" : " 6e1624f7 " ,
164+ "id" : " 1d0a178f " ,
157165 "metadata" : {},
158166 "source" : [
159167 " ### 🏗️ Initialize the Data Designer Config Builder\n " ,
168176 {
169177 "cell_type" : " code" ,
170178 "execution_count" : null ,
171- "id" : " 33562cda " ,
179+ "id" : " aacc0ec5 " ,
172180 "metadata" : {},
173181 "outputs" : [],
174182 "source" : [
177185 },
178186 {
179187 "cell_type" : " markdown" ,
180- "id" : " d8ec3063 " ,
188+ "id" : " 4be3497f " ,
181189 "metadata" : {},
182190 "source" : [
183191 " ## 🎲 Getting started with sampler columns\n " ,
194202 {
195203 "cell_type" : " code" ,
196204 "execution_count" : null ,
197- "id" : " 70de1b0a " ,
205+ "id" : " e212d83e " ,
198206 "metadata" : {},
199207 "outputs" : [],
200208 "source" : [
203211 },
204212 {
205213 "cell_type" : " markdown" ,
206- "id" : " 991a8f34 " ,
214+ "id" : " c28350d3 " ,
207215 "metadata" : {},
208216 "source" : [
209217 " Let's start designing our product review dataset by adding product category and subcategory columns.\n "
212220 {
213221 "cell_type" : " code" ,
214222 "execution_count" : null ,
215- "id" : " 222cbbcc " ,
223+ "id" : " 070f14e7 " ,
216224 "metadata" : {},
217225 "outputs" : [],
218226 "source" : [
293301 },
294302 {
295303 "cell_type" : " markdown" ,
296- "id" : " 29ca2aa3 " ,
304+ "id" : " e0d8497d " ,
297305 "metadata" : {},
298306 "source" : [
299307 " Next, let's add samplers to generate data related to the customer and their review.\n "
302310 {
303311 "cell_type" : " code" ,
304312 "execution_count" : null ,
305- "id" : " 4ca9ba1c " ,
313+ "id" : " 62e84282 " ,
306314 "metadata" : {},
307315 "outputs" : [],
308316 "source" : [
339347 },
340348 {
341349 "cell_type" : " markdown" ,
342- "id" : " f4d54299 " ,
350+ "id" : " 8cb147fa " ,
343351 "metadata" : {},
344352 "source" : [
345353 " ## 🦜 LLM-generated columns\n " ,
354362 {
355363 "cell_type" : " code" ,
356364 "execution_count" : null ,
357- "id" : " 361b63b1 " ,
365+ "id" : " 37a4a6d0 " ,
358366 "metadata" : {},
359367 "outputs" : [],
360368 "source" : [
390398 },
391399 {
392400 "cell_type" : " markdown" ,
393- "id" : " 49ca028a " ,
401+ "id" : " 49559576 " ,
394402 "metadata" : {},
395403 "source" : [
396404 " ### 🔁 Iteration is key – preview the dataset!\n " ,
407415 {
408416 "cell_type" : " code" ,
409417 "execution_count" : null ,
410- "id" : " 068ea8c3 " ,
418+ "id" : " 0d52b447 " ,
411419 "metadata" : {},
412420 "outputs" : [],
413421 "source" : [
417425 {
418426 "cell_type" : " code" ,
419427 "execution_count" : null ,
420- "id" : " bf196a77 " ,
428+ "id" : " 088a5004 " ,
421429 "metadata" : {},
422430 "outputs" : [],
423431 "source" : [
428436 {
429437 "cell_type" : " code" ,
430438 "execution_count" : null ,
431- "id" : " 36ebb017 " ,
439+ "id" : " 9780021a " ,
432440 "metadata" : {},
433441 "outputs" : [],
434442 "source" : [
438446 },
439447 {
440448 "cell_type" : " markdown" ,
441- "id" : " 1dcba545 " ,
449+ "id" : " c9122bc6 " ,
442450 "metadata" : {},
443451 "source" : [
444452 " ### 📊 Analyze the generated data\n " ,
451459 {
452460 "cell_type" : " code" ,
453461 "execution_count" : null ,
454- "id" : " e5164902 " ,
462+ "id" : " 4d6bb3c5 " ,
455463 "metadata" : {},
456464 "outputs" : [],
457465 "source" : [
461469 },
462470 {
463471 "cell_type" : " markdown" ,
464- "id" : " cc433fae " ,
472+ "id" : " 6003ae71 " ,
465473 "metadata" : {},
466474 "source" : [
467475 " ### 🆙 Scale up!\n " ,
474482 {
475483 "cell_type" : " code" ,
476484 "execution_count" : null ,
477- "id" : " 17132fe2 " ,
485+ "id" : " e343639d " ,
478486 "metadata" : {},
479487 "outputs" : [],
480488 "source" : [
484492 {
485493 "cell_type" : " code" ,
486494 "execution_count" : null ,
487- "id" : " 6669442a " ,
495+ "id" : " cd328abd " ,
488496 "metadata" : {},
489497 "outputs" : [],
490498 "source" : [
497505 {
498506 "cell_type" : " code" ,
499507 "execution_count" : null ,
500- "id" : " ee689b41 " ,
508+ "id" : " 6a09793a " ,
501509 "metadata" : {},
502510 "outputs" : [],
503511 "source" : [
509517 },
510518 {
511519 "cell_type" : " markdown" ,
512- "id" : " 6965e6ac " ,
520+ "id" : " 769dd181 " ,
513521 "metadata" : {},
514522 "source" : [
515523 " ## ⏭️ Next Steps\n " ,
You can’t perform that action at this time.
0 commit comments