You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
revert: drop MAX_MODEL_LEN=131072 default from Kimi MI355X/B200 launchers
Per agentic benchmark design: must not cap context. Removes the
MAX_MODEL_LEN=131072 default from kimik2.5_fp4_mi355x,
kimik2.5_fp4_b200, and kimik2.5_int4_b200 launchers so vLLM uses the
model's native context window (matches the H100/H200/B300 launchers
that already have no cap).
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
0 commit comments