Commit bddbf40
B200 Minimax FP8 vllm upgrade (#947)
* Update nvidia-master.yaml
* vllm version bump
* add perf changelog
* update search space and configs
* fix typo in VLLM_USE_DEEP_GEMM
* Remove ISL 1024 / OSL 8192 seq-len config for minimaxm2.5-fp8-b200-vllm
Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>
* update image
* update config and remove DEEPGEMM flag
* test tep
* fix typo in ep bash script
* add max cudagraph size
* upgrade to vllm 0.19
* typo
* revert h200 change
* fix: update perf-changelog version to v0.19.0
Co-authored-by: Cameron Quilici <cquil11@users.noreply.github.com>
* Remove commented-out tp:8 search-space entry
Co-authored-by: Cameron Quilici <cquil11@users.noreply.github.com>
---------
Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com>
Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>
Co-authored-by: Cameron Quilici <cquil11@users.noreply.github.com>1 parent 800f57a commit bddbf40
3 files changed
Lines changed: 25 additions & 11 deletions
File tree
- .github/configs
- benchmarks/single_node
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3101 | 3101 | | |
3102 | 3102 | | |
3103 | 3103 | | |
3104 | | - | |
| 3104 | + | |
3105 | 3105 | | |
3106 | 3106 | | |
3107 | 3107 | | |
| |||
3112 | 3112 | | |
3113 | 3113 | | |
3114 | 3114 | | |
3115 | | - | |
3116 | | - | |
| 3115 | + | |
| 3116 | + | |
| 3117 | + | |
| 3118 | + | |
3117 | 3119 | | |
3118 | 3120 | | |
3119 | 3121 | | |
3120 | | - | |
3121 | | - | |
| 3122 | + | |
| 3123 | + | |
3122 | 3124 | | |
3123 | 3125 | | |
3124 | 3126 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
27 | | - | |
28 | | - | |
| 27 | + | |
29 | 28 | | |
30 | | - | |
| 29 | + | |
31 | 30 | | |
32 | 31 | | |
33 | 32 | | |
| |||
44 | 43 | | |
45 | 44 | | |
46 | 45 | | |
47 | | - | |
| 46 | + | |
48 | 47 | | |
49 | 48 | | |
50 | | - | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
51 | 53 | | |
52 | 54 | | |
53 | 55 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1143 | 1143 | | |
1144 | 1144 | | |
1145 | 1145 | | |
1146 | | - | |
| 1146 | + | |
1147 | 1147 | | |
1148 | 1148 | | |
1149 | 1149 | | |
| |||
1235 | 1235 | | |
1236 | 1236 | | |
1237 | 1237 | | |
| 1238 | + | |
| 1239 | + | |
| 1240 | + | |
| 1241 | + | |
| 1242 | + | |
| 1243 | + | |
| 1244 | + | |
| 1245 | + | |
| 1246 | + | |
| 1247 | + | |
0 commit comments