🎨 NeMo Data Designer v0.6.0
This release comes with two major updates:
-
⚡ Async by default: The async execution engine is now the default path in Data Designer. Pipelines schedule work at the cell level, overlapping independent columns and adapting model concurrency per provider/model.
-
🔌 Plugins are stable: Plugins have moved out of experimental mode. Seed readers, column generators, and processors can now live in normal Python packages with typed configs and entry-point discovery.
What's Changed
- fix(async): pack of fixes for async engine under degraded providers by @andreatgretel in #585
- feat(config): add deterministic fingerprint for workflow configs by @nabinchha in #587
- fix(interface): don't leak YAML default provider into user-supplied list by @nabinchha in #591
- docs: Updated telemetry disclosure by @kirit93 in #595
- chore(readme): bump tokens-generated badge to 350+ billion by @johnnygreco in #593
- ci: bump semantic_pull_request from 0.93.0 to 0.94.1 by @dependabot[bot] in #596
- feat(cli): add data-designer --version by @eric-tramel in #599
- feat: make async engine the default execution path by @andreatgretel in #592
- fix: Add --org option to NGC download command by @mikeknep in #604
- ci: add graphify structural impact analysis to PR review and structure audit by @andreatgretel in #567
- fix(config): round-trip processors and profilers by @johnnygreco in #605
- feat(models): deprecate implicit default provider routing by @nabinchha in #594
- fix(engine): validate processor plugin impls by @johnnygreco in #609
- docs: graduate plugins out of experimental mode by @johnnygreco in #603
- feat(results): add export() method and --output-format CLI flag by @przemekboruta in #540
- docs: migrate documentation from MkDocs to Fern by @lbliii in #581
- fix: normalize rollout timestamps before deriving started_at/ended_at by @eric-tramel in #556
- fix: quote review-code skill argument hint by @johnnygreco in #616
- feat(cli): show version update notice by @eric-tramel in #602
- fix(docs): unbreak published Fern site by @lbliii in #615
- fix: validate subcategory parents are sampler columns by @andreatgretel in #614
- ci: bump semantic_pull_request.yml from 0.94.1 to 1.1.0 in the all-actions group by @dependabot[bot] in #621
- feat: resume interrupted dataset generation runs (sync + async engine) by @przemekboruta in #526
- fix(config): validate subcategory parent sampler type by @johnnygreco in #628
- chore(engine): rename correction-step counter for clarity by @nabinchha in #627
- fix(config): update OpenRouter vision model id by @andreatgretel in #630
- feat: let column configs declare all model aliases for the startup health check by @nabinchha in #626
- fix: harden resume checkpoint handling by @nabinchha in #624
- docs: add plan for workflow chaining by @andreatgretel in #552
- chore: update vulnerable dependencies by @johnnygreco in #631
- docs: prepare Fern docs workflow by @andreatgretel in #622
- feat(agentic-ci): decision-ready triage and daily PR fixes by @andreatgretel in #600
- feat(cli): add plugin catalog core by @johnnygreco in #618
- feat: add fair async task scheduling by @eric-tramel in #639
- docs: add "Have It Your Way" plugin dev note by @johnnygreco in #608
- fix(interface): silence registry-default deprecation when library auto-fills it by @nabinchha in #655
New Contributors
Full Changelog: v0.5.9...v0.6.0