Skip to content

Commit 4d981fb

Browse files
committed
sync generated colab notebooks
1 parent 72b86e9 commit 4d981fb

6 files changed

Lines changed: 249 additions & 201 deletions

docs/colab_notebooks/1-the-basics.ipynb

Lines changed: 44 additions & 36 deletions
Original file line numberDiff line numberDiff line change
@@ -2,15 +2,17 @@
22
"cells": [
33
{
44
"cell_type": "markdown",
5-
"id": "e9bc2aab",
6-
"metadata": {},
5+
"id": "f5bc03e0",
6+
"metadata": {
7+
"nemo_colab_inject": true
8+
},
79
"source": [
810
"<a href=\"https://colab.research.google.com/github/NVIDIA-NeMo/DataDesigner/blob/main/docs/colab_notebooks/1-the-basics.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
911
]
1012
},
1113
{
1214
"cell_type": "markdown",
13-
"id": "33dcb5be",
15+
"id": "3454d676",
1416
"metadata": {},
1517
"source": [
1618
"# 🎨 Data Designer Tutorial: The Basics\n",
@@ -22,7 +24,7 @@
2224
},
2325
{
2426
"cell_type": "markdown",
25-
"id": "adb77b8d",
27+
"id": "4737bc0d",
2628
"metadata": {},
2729
"source": [
2830
"### 📦 Import Data Designer\n",
@@ -34,8 +36,10 @@
3436
},
3537
{
3638
"cell_type": "markdown",
37-
"id": "170ce1ea",
38-
"metadata": {},
39+
"id": "cf21b784",
40+
"metadata": {
41+
"nemo_colab_inject": true
42+
},
3943
"source": [
4044
"### ⚡ Colab Setup\n",
4145
"\n",
@@ -45,8 +49,10 @@
4549
{
4650
"cell_type": "code",
4751
"execution_count": null,
48-
"id": "67e478f9",
49-
"metadata": {},
52+
"id": "a87ee38a",
53+
"metadata": {
54+
"nemo_colab_inject": true
55+
},
5056
"outputs": [],
5157
"source": [
5258
"%%capture\n",
@@ -56,8 +62,10 @@
5662
{
5763
"cell_type": "code",
5864
"execution_count": null,
59-
"id": "533fc40d",
60-
"metadata": {},
65+
"id": "d6c4cc8a",
66+
"metadata": {
67+
"nemo_colab_inject": true
68+
},
6169
"outputs": [],
6270
"source": [
6371
"import getpass\n",
@@ -74,7 +82,7 @@
7482
{
7583
"cell_type": "code",
7684
"execution_count": null,
77-
"id": "9ad92889",
85+
"id": "848dea00",
7886
"metadata": {},
7987
"outputs": [],
8088
"source": [
@@ -84,7 +92,7 @@
8492
},
8593
{
8694
"cell_type": "markdown",
87-
"id": "0232c4c6",
95+
"id": "b97786a8",
8896
"metadata": {},
8997
"source": [
9098
"### ⚙️ Initialize the Data Designer interface\n",
@@ -97,7 +105,7 @@
97105
{
98106
"cell_type": "code",
99107
"execution_count": null,
100-
"id": "fbbd0cab",
108+
"id": "b31c1fc9",
101109
"metadata": {},
102110
"outputs": [],
103111
"source": [
@@ -106,7 +114,7 @@
106114
},
107115
{
108116
"cell_type": "markdown",
109-
"id": "305f635e",
117+
"id": "2fef5ae8",
110118
"metadata": {},
111119
"source": [
112120
"### 🎛️ Define model configurations\n",
@@ -123,7 +131,7 @@
123131
{
124132
"cell_type": "code",
125133
"execution_count": null,
126-
"id": "d0865b58",
134+
"id": "7a9f6398",
127135
"metadata": {},
128136
"outputs": [],
129137
"source": [
@@ -153,7 +161,7 @@
153161
},
154162
{
155163
"cell_type": "markdown",
156-
"id": "6e1624f7",
164+
"id": "1d0a178f",
157165
"metadata": {},
158166
"source": [
159167
"### 🏗️ Initialize the Data Designer Config Builder\n",
@@ -168,7 +176,7 @@
168176
{
169177
"cell_type": "code",
170178
"execution_count": null,
171-
"id": "33562cda",
179+
"id": "aacc0ec5",
172180
"metadata": {},
173181
"outputs": [],
174182
"source": [
@@ -177,7 +185,7 @@
177185
},
178186
{
179187
"cell_type": "markdown",
180-
"id": "d8ec3063",
188+
"id": "4be3497f",
181189
"metadata": {},
182190
"source": [
183191
"## 🎲 Getting started with sampler columns\n",
@@ -194,7 +202,7 @@
194202
{
195203
"cell_type": "code",
196204
"execution_count": null,
197-
"id": "70de1b0a",
205+
"id": "e212d83e",
198206
"metadata": {},
199207
"outputs": [],
200208
"source": [
@@ -203,7 +211,7 @@
203211
},
204212
{
205213
"cell_type": "markdown",
206-
"id": "991a8f34",
214+
"id": "c28350d3",
207215
"metadata": {},
208216
"source": [
209217
"Let's start designing our product review dataset by adding product category and subcategory columns.\n"
@@ -212,7 +220,7 @@
212220
{
213221
"cell_type": "code",
214222
"execution_count": null,
215-
"id": "222cbbcc",
223+
"id": "070f14e7",
216224
"metadata": {},
217225
"outputs": [],
218226
"source": [
@@ -293,7 +301,7 @@
293301
},
294302
{
295303
"cell_type": "markdown",
296-
"id": "29ca2aa3",
304+
"id": "e0d8497d",
297305
"metadata": {},
298306
"source": [
299307
"Next, let's add samplers to generate data related to the customer and their review.\n"
@@ -302,7 +310,7 @@
302310
{
303311
"cell_type": "code",
304312
"execution_count": null,
305-
"id": "4ca9ba1c",
313+
"id": "62e84282",
306314
"metadata": {},
307315
"outputs": [],
308316
"source": [
@@ -339,7 +347,7 @@
339347
},
340348
{
341349
"cell_type": "markdown",
342-
"id": "f4d54299",
350+
"id": "8cb147fa",
343351
"metadata": {},
344352
"source": [
345353
"## 🦜 LLM-generated columns\n",
@@ -354,7 +362,7 @@
354362
{
355363
"cell_type": "code",
356364
"execution_count": null,
357-
"id": "361b63b1",
365+
"id": "37a4a6d0",
358366
"metadata": {},
359367
"outputs": [],
360368
"source": [
@@ -390,7 +398,7 @@
390398
},
391399
{
392400
"cell_type": "markdown",
393-
"id": "49ca028a",
401+
"id": "49559576",
394402
"metadata": {},
395403
"source": [
396404
"### 🔁 Iteration is key – preview the dataset!\n",
@@ -407,7 +415,7 @@
407415
{
408416
"cell_type": "code",
409417
"execution_count": null,
410-
"id": "068ea8c3",
418+
"id": "0d52b447",
411419
"metadata": {},
412420
"outputs": [],
413421
"source": [
@@ -417,7 +425,7 @@
417425
{
418426
"cell_type": "code",
419427
"execution_count": null,
420-
"id": "bf196a77",
428+
"id": "088a5004",
421429
"metadata": {},
422430
"outputs": [],
423431
"source": [
@@ -428,7 +436,7 @@
428436
{
429437
"cell_type": "code",
430438
"execution_count": null,
431-
"id": "36ebb017",
439+
"id": "9780021a",
432440
"metadata": {},
433441
"outputs": [],
434442
"source": [
@@ -438,7 +446,7 @@
438446
},
439447
{
440448
"cell_type": "markdown",
441-
"id": "1dcba545",
449+
"id": "c9122bc6",
442450
"metadata": {},
443451
"source": [
444452
"### 📊 Analyze the generated data\n",
@@ -451,7 +459,7 @@
451459
{
452460
"cell_type": "code",
453461
"execution_count": null,
454-
"id": "e5164902",
462+
"id": "4d6bb3c5",
455463
"metadata": {},
456464
"outputs": [],
457465
"source": [
@@ -461,7 +469,7 @@
461469
},
462470
{
463471
"cell_type": "markdown",
464-
"id": "cc433fae",
472+
"id": "6003ae71",
465473
"metadata": {},
466474
"source": [
467475
"### 🆙 Scale up!\n",
@@ -474,7 +482,7 @@
474482
{
475483
"cell_type": "code",
476484
"execution_count": null,
477-
"id": "17132fe2",
485+
"id": "e343639d",
478486
"metadata": {},
479487
"outputs": [],
480488
"source": [
@@ -484,7 +492,7 @@
484492
{
485493
"cell_type": "code",
486494
"execution_count": null,
487-
"id": "6669442a",
495+
"id": "cd328abd",
488496
"metadata": {},
489497
"outputs": [],
490498
"source": [
@@ -497,7 +505,7 @@
497505
{
498506
"cell_type": "code",
499507
"execution_count": null,
500-
"id": "ee689b41",
508+
"id": "6a09793a",
501509
"metadata": {},
502510
"outputs": [],
503511
"source": [
@@ -509,7 +517,7 @@
509517
},
510518
{
511519
"cell_type": "markdown",
512-
"id": "6965e6ac",
520+
"id": "769dd181",
513521
"metadata": {},
514522
"source": [
515523
"## ⏭️ Next Steps\n",

0 commit comments

Comments
 (0)