Skip to content

feat: Add diffusion finetuning CI pipeline for nightly runs#1728

Draft
pthombre wants to merge 5 commits intomainfrom
pranav/diffusion_nightly_runs
Draft

feat: Add diffusion finetuning CI pipeline for nightly runs#1728
pthombre wants to merge 5 commits intomainfrom
pranav/diffusion_nightly_runs

Conversation

@pthombre
Copy link
Copy Markdown
Contributor

@pthombre pthombre commented Apr 8, 2026

What does this PR do?

Adds a 4-stage CI pipeline (data download, preprocessing, finetuning, inference smoke test) for diffusion model nightly testing, starting with the Wan2.1-T2V-1.3B recipe.

Changelog

  • Add diffusion_finetune_launcher.sh script with data download, video preprocessing, distributed finetuning, and inference validation stages
  • Add nightly_recipes.yml and override_recipes.yml configs for wan2_1_t2v_flow
  • Add ci metadata (recipe_owner, time) to wan2_1_t2v_flow.yaml
  • Extend generate_ci_tests.py to support diffusion_sft stage and custom examples_dir
  • Add consolidated safetensors checkpoint loading support to generate.py
  • Bump diffusers>=0.37.0 to fix NameError in torchao_quantizer (logger used before definition in 0.36.0)

pthombre and others added 2 commits April 7, 2026 19:23
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
…antizer

diffusers 0.36.0 has a bug where `logger` is used before definition in
`quantizers/torchao/torchao_quantizer.py`, causing NameError on import
when torchao is available.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot bot commented Apr 8, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

NeMo Bot and others added 3 commits April 8, 2026 02:27
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants