Skip to content

[NVIDIA] Add DSR1 MTP#53

Closed
kedarpotdar-nv wants to merge 12 commits into
mainfrom
kepotdar-dsr1-trt-mtp-shicli
Closed

[NVIDIA] Add DSR1 MTP#53
kedarpotdar-nv wants to merge 12 commits into
mainfrom
kepotdar-dsr1-trt-mtp-shicli

Conversation

@kedarpotdar-nv
Copy link
Copy Markdown
Collaborator

As per guidance we've recvd, wanted to add MTP ON configs for single node after GB200 is merged.
Only TRT MTP for now.

Changes:

  1. Added benchmarks/dsr1_*_mtp_slurm.sh files for fp8 and fp4
  2. modified dsr1-tmpl.yml to account for new runs.

@kedarpotdar-nv
Copy link
Copy Markdown
Collaborator Author

I'm not sure the mtp-mode: on actually triggers the dsr1_b200_fp4/fp8_trt_mtp.sh scripts.

@kedarpotdar-nv
Copy link
Copy Markdown
Collaborator Author

@kimbochen can you please guide how to add logic in runner/launch b200 to account for mtp flag?

@csahithi
Copy link
Copy Markdown
Collaborator

Can we add a variable called mtp-mode (default set to off) to benchmark-tmpl.yaml? We can pass this as an env variable which will be used by the runner script. If set as on, then we attach _mtp as a suffix to the filename so that the correct script gets picked up.

@functionstackx
Copy link
Copy Markdown
Collaborator

stale

@github-project-automation github-project-automation Bot moved this from In Progress to Done in InferenceMAX Board Jan 5, 2026
@cquil11 cquil11 changed the title Add DSR1 MTP [NVIDIA] Add DSR1 MTP Apr 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

Development

Successfully merging this pull request may close these issues.

3 participants