You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix(agentx): fix TP sizes and remove hardcoded max-model-len on MI355X agentic benchmarks
- dsv4 and minimaxm2.5 agentic: remove MAX_MODEL_LEN override and --max-model-len flag to let vLLM use server default
- amd-master.yaml: update dsv4 agentic TP from 4→8, minimaxm2.5 agentic TP from 4→1
- launch_mi355x-amds.sh: extend HF_HUB_CACHE_MOUNT override to vllm framework for DeepSeek-V4-Pro
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
0 commit comments