You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+17-28Lines changed: 17 additions & 28 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -143,40 +143,17 @@ This repository supports agent-assisted development — see [CONTRIBUTING.md](CO
143
143
144
144
## Telemetry
145
145
146
-
Data Designer collects telemetry to help us improve the library for developers. We collect:
146
+
Data Designer collects telemetry to help us improve the library for developers. This data is not used to track any individual user behavior. It is used to see an aggregation of which models are the most popular for SDG. We will share this usage data with the community.
147
147
148
-
* The names of models used
149
-
* The count of input tokens
150
-
* The count of output tokens
148
+
Disable with `NEMO_TELEMETRY_ENABLED=false`. **[More details →](#telemetry-and-privacy)**
151
149
152
-
**No user or device information is collected.** This data is not used to track any individual user behavior. It is used to see an aggregation of which models are the most popular for SDG. We will share this usage data with the community.
150
+
### Top models (YTD)
153
151
154
-
Specifically, a model name that is defined a `ModelConfig` object, is what will be collected. In the below example config:
The value `nvidia/nemotron-3-super-120b-a12b` would be collected.
170
-
171
-
To disable telemetry capture, set `NEMO_TELEMETRY_ENABLED=false`.
172
-
173
-
### Top Models
174
-
175
-
This chart represents the breakdown of models used for Data Designer across all synthetic data generation jobs from 2/23/2026 to 3/23/2026.
152
+
Aggregate model usage across synthetic data generation jobs, year-to-date 1/1/2026–5/1/2026:
176
153
177
154

178
155
179
-
_Last updated on 3/23/2026_
156
+
_Last updated on May 1, 2026_
180
157
181
158
---
182
159
@@ -199,3 +176,15 @@ If you use NeMo Data Designer in your research, please cite it using the followi
199
176
note = {GitHub Repository},
200
177
}
201
178
```
179
+
180
+
---
181
+
182
+
<aid="telemetry-and-privacy"></a>
183
+
184
+
## Telemetry & privacy
185
+
186
+
NeMo Data Designer includes an optional function to share anonymous telemetry data with NVIDIA for product improvement. Data collected is limited to names of models used and token counts (input and output). No user or device information is collected. This data is used to prioritize product improvements and will be shared in aggregate with the community. It is not used to track any individual user behavior.
187
+
188
+
You may opt out of telemetry collection at any time. Opting out applies only to data collection by the NeMo Data Designer library itself.
189
+
190
+
**Use of third-party endpoints, including NVIDIA Build:** NeMo Data Designer can be configured to use various inference endpoints, including [build.nvidia.com](https://build.nvidia.com) (NVIDIA Build). If you choose to use NVIDIA Build or any other third-party endpoint, that endpoint's own terms of service and privacy practices apply independently of this library. Any opt-out you exercise within NeMo Data Designer does not extend to data collection by your chosen endpoint. NVIDIA Build is intended for evaluation and testing purposes only and may not be used in production environments. Do not submit any confidential information or personal data when using NVIDIA Build.
0 commit comments