Skip to content

cp: test: add vLLM deployment tests into r0.4.0#1745

Merged
akoumpa merged 1 commit intor0.4.0from
cherry-pick-1656-r0.4.0
Apr 9, 2026
Merged

cp: test: add vLLM deployment tests into r0.4.0#1745
akoumpa merged 1 commit intor0.4.0from
cherry-pick-1656-r0.4.0

Conversation

@svcnvidia-nemo-ci
Copy link
Copy Markdown
Contributor

beep boop [🤖]: Hi @adil-a 👋,

we've cherry picked #1656 into  for you! 🚀

Please review and approve this cherry pick by your convenience!

* test: add vLLM deployment tests for checkpoint robustness

vLLM deployment verification tests that load consolidated checkpoints
and compare greedy output token-for-token against HuggingFace.
Supports both full comparison and smoke test mode.

Depends on checkpoint robustness PR #1606.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Signed-off-by: adil-a <adil.asif2000@hotmail.com>

* Create deploy-test dependency group

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Revert deploy test group

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Move configs to recipes and create vllm_launcher

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Setup deploy environment

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Remove duplicate keys

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Add scope to vllm deploy test

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Drop needs dependency

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Use finetune test name for ckpt dir

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Make ckpt checking more robust

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Pass arguments correctly

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Update arguments

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

* Remove unused file

Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>

---------

Signed-off-by: adil-a <adil.asif2000@hotmail.com>
Signed-off-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-authored-by: Dong Hyuk Chang <9426164+thomasdhc@users.noreply.github.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
@svcnvidia-nemo-ci
Copy link
Copy Markdown
Contributor Author

/ok to test 04bbd69

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot bot commented Apr 9, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@akoumpa akoumpa changed the title cp: test: add vLLM deployment tests for checkpoint robustness (1656) into r0.4.0 cp: test: add vLLM deployment tests into r0.4.0 Apr 9, 2026
@akoumpa akoumpa merged commit 83062e2 into r0.4.0 Apr 9, 2026
54 of 56 checks passed
@akoumpa akoumpa deleted the cherry-pick-1656-r0.4.0 branch April 9, 2026 16:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cherry-pick Run CICD Trigger Testing CICD

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants