Commit 3e4d6dd
Updated DSv4 vllm B300 MTP (#1271)
* add DP to b300 mtp
* Update changelog
* Update Docker image version for dsv4-fp4-b300-vllm-mtp
* Update Docker image version to v0.20.2
* Modify search-space parameters in nvidia-master.yaml
* Merge duplicate DP_ATTENTION conditions in benchmark script
Co-authored-by: Cameron Quilici <cquil11@users.noreply.github.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com>
Co-authored-by: Cameron Quilici <cquil11@users.noreply.github.com>
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
Co-authored-by: Cameron Quilici <cjquilici@gmail.com>1 parent 5fe6d56 commit 3e4d6dd
3 files changed
Lines changed: 21 additions & 4 deletions
File tree
- .github/configs
- benchmarks/single_node
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2825 | 2825 | | |
2826 | 2826 | | |
2827 | 2827 | | |
2828 | | - | |
| 2828 | + | |
2829 | 2829 | | |
2830 | 2830 | | |
2831 | 2831 | | |
| |||
2838 | 2838 | | |
2839 | 2839 | | |
2840 | 2840 | | |
2841 | | - | |
| 2841 | + | |
| 2842 | + | |
2842 | 2843 | | |
2843 | 2844 | | |
2844 | 2845 | | |
2845 | 2846 | | |
2846 | | - | |
| 2847 | + | |
2847 | 2848 | | |
| 2849 | + | |
2848 | 2850 | | |
2849 | 2851 | | |
2850 | 2852 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
39 | | - | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
40 | 47 | | |
41 | 48 | | |
42 | 49 | | |
| |||
61 | 68 | | |
62 | 69 | | |
63 | 70 | | |
| 71 | + | |
64 | 72 | | |
65 | 73 | | |
66 | 74 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2486 | 2486 | | |
2487 | 2487 | | |
2488 | 2488 | | |
| 2489 | + | |
| 2490 | + | |
| 2491 | + | |
| 2492 | + | |
| 2493 | + | |
| 2494 | + | |
| 2495 | + | |
0 commit comments