Update B200 Dsv4 configs by wzhao18 · Pull Request #1655 · SemiAnalysisAI/InferenceX

wzhao18 · 2026-06-03T16:46:25Z

Note

Medium Risk
Changes the vLLM image pin and MoE/EPLB flags for DP-attention sweep points, which can shift measured throughput/latency and comparability to prior results.

Overview
Updates DeepSeek-V4 FP4 B200 vLLM benchmark serving so DP-attention search points turn on EPLB (expert parallel load balancing) with torch_nccl and synchronous EPLB, alongside the existing deep_gemm_mega_moe MoE backend in dsv4_fp4_b200_vllm.sh.

Pins the dsv4-fp4-b200-vllm config in nvidia-master.yaml to a specific vLLM nightly container image instead of v0.22.0, and records the EPLB change in perf-changelog.yaml.

^{Reviewed by Cursor Bugbot for commit 783dfe8. Bugbot is set up for automated code reviews on this repo. Configure here.}

github-actions · 2026-06-03T16:46:37Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-06-03T17:11:48Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26899483031
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26899483031

github-actions · 2026-06-03T18:02:48Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26900694153
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26900694153

github-actions · 2026-06-03T18:20:22Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26900694153
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26900694153

Set environment variables for NIXL EPLB configuration.

github-actions · 2026-06-03T19:16:16Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26905612623
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26905612623

Add eplb-config option to EPLB_ARGS for NCCL.

Removed unnecessary environment variable exports for NCCL and UCX.

github-actions · 2026-06-03T21:27:32Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26905612623
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26905612623

github-actions · 2026-06-04T04:42:49Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26914041013
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26914041013

github-actions · 2026-06-04T05:21:02Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26914041013
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26914041013

wzhao18 · 2026-06-04T14:44:47Z

@kedarpotdar-nv @functionstackx @Oseltamivir Ready for review/merge. Thanks!

Enable EPLB for DEP configs

8b3e665

wzhao18 requested a review from a team June 3, 2026 16:46

github-project-automation Bot added this to InferenceMAX Board Jun 3, 2026

Perf changelog

bb31d48

wzhao18 added the full-sweep-enabled label Jun 3, 2026

Update to nightly image

c361dad

wzhao18 requested review from jgangani and kedarpotdar-nv as code owners June 3, 2026 17:10

wzhao18 added 2 commits June 3, 2026 14:42

Add NCCL_NET_PLUGIN and UCX_TLS exports

94e89cc

Set environment variables for NIXL EPLB configuration.

Merge branch 'main' into wzhao/dsv4-b200-eplb

eba41e5

wzhao18 added 2 commits June 3, 2026 17:26

Enhance EPLB_ARGS with eplb-config

31a12b0

Add eplb-config option to EPLB_ARGS for NCCL.

Clean up environment variable exports in script

783dfe8

Removed unnecessary environment variable exports for NCCL and UCX.

wzhao18 changed the title ~~[WIP] Update B200 Dsv4 configs~~ Update B200 Dsv4 configs Jun 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update B200 Dsv4 configs#1655

Update B200 Dsv4 configs#1655
wzhao18 wants to merge 7 commits into
mainfrom
wzhao/dsv4-b200-eplb

wzhao18 commented Jun 3, 2026 •

edited by cursor Bot

Loading

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

wzhao18 commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

wzhao18 commented Jun 3, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

wzhao18 commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

wzhao18 commented Jun 3, 2026 •

edited by cursor Bot

Loading