Commit 0618646
Updating dsv4 b200 vllm version (#1384)
* Try updating b200 dsv4
* add changelog
* Set MAX_CUDAGRAPH_CAPTURE_SIZE to 2048 unconditionally
* Update Docker image for dsv4-fp4-b200-vllm
* Update vLLM image tag in perf-changelog.yaml
Updated the vLLM image tag to specify the nightly version.
* Update Docker image tag for dsv4-fp4-b200-vllm
* Update vLLM image tag to v0.22.0
* Update conc-end values in nvidia-master.yaml
---------
Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com>1 parent c088658 commit 0618646
3 files changed
Lines changed: 12 additions & 10 deletions
File tree
- .github/configs
- benchmarks/single_node
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1756 | 1756 | | |
1757 | 1757 | | |
1758 | 1758 | | |
1759 | | - | |
| 1759 | + | |
1760 | 1760 | | |
1761 | 1761 | | |
1762 | 1762 | | |
| |||
1770 | 1770 | | |
1771 | 1771 | | |
1772 | 1772 | | |
1773 | | - | |
| 1773 | + | |
| 1774 | + | |
1774 | 1775 | | |
1775 | 1776 | | |
1776 | 1777 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
42 | 42 | | |
43 | 43 | | |
44 | 44 | | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | 45 | | |
49 | 46 | | |
50 | 47 | | |
51 | | - | |
52 | 48 | | |
53 | 49 | | |
54 | 50 | | |
| |||
58 | 54 | | |
59 | 55 | | |
60 | 56 | | |
| 57 | + | |
| 58 | + | |
61 | 59 | | |
62 | | - | |
63 | | - | |
64 | | - | |
65 | 60 | | |
66 | 61 | | |
67 | 62 | | |
| |||
90 | 85 | | |
91 | 86 | | |
92 | 87 | | |
93 | | - | |
| 88 | + | |
94 | 89 | | |
95 | 90 | | |
96 | 91 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3208 | 3208 | | |
3209 | 3209 | | |
3210 | 3210 | | |
| 3211 | + | |
| 3212 | + | |
| 3213 | + | |
| 3214 | + | |
| 3215 | + | |
| 3216 | + | |
0 commit comments