File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -2252,7 +2252,7 @@ glm5-fp4-b200-sglang-mtp:
22522252 # does not have a B300-specific recipe, so this config reuses the existing
22532253 # GLM-5 FP4 B200 SGLang recipe as-is until B300-specific tuning is available.
22542254glm5-fp4-b300-sglang :
2255- image : lmsysorg/sglang:v0.5.10.post1 -cu130
2255+ image : lmsysorg/sglang:v0.5.11 -cu130
22562256 model : nvidia/GLM-5-NVFP4
22572257 model-prefix : glm5
22582258 runner : b300
Original file line number Diff line number Diff line change 24302430 description :
24312431 - " Update SGLang image from v0.5.10.post1-cu130 to v0.5.11-cu130"
24322432 pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1346
2433+
2434+ - config-keys :
2435+ - glm5-fp4-b300-sglang
2436+ description :
2437+ - " Update SGLang image from v0.5.10.post1-cu130 to v0.5.11-cu130"
2438+ pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1329
You can’t perform that action at this time.
0 commit comments