Commit 88a1153
nvidia-master(kimik2.5-fp4-b200-vllm-agentic): bump vLLM v0.20.2 -> v0.21.0
v0.20.2's bundled huggingface_hub==1.14.0 silently fetches Git-LFS
pointer files instead of LFS content for `hf download --repo-type
dataset`. Every kimik2.5-fp4-b200-vllm-agentic job in run 26536606210
hit "pyarrow.lib.ArrowInvalid: JSON parse error: Missing a name for
object member. in row 0" -- the signature of pyarrow trying to parse
the literal `version https://git-lfs.github.com/spec/v1` line of an
LFS pointer file as JSON.
b200-dgxc has no persistent /mnt/hf_hub_cache mount (per launcher
diff), so every container re-downloads the dataset and re-hits the
bug. v0.21.0 ships a newer huggingface_hub that resolves LFS
correctly. v0.20.x's flashinfer fix for the max_model_len=131072 +
prefix-caching warmup crash is included in v0.21.0.
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Signed-off-by: Cam Quilici <cjquilici@gmail.com>1 parent eab58e9 commit 88a1153
1 file changed
Lines changed: 7 additions & 5 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2699 | 2699 | | |
2700 | 2700 | | |
2701 | 2701 | | |
2702 | | - | |
| 2702 | + | |
2703 | 2703 | | |
2704 | 2704 | | |
2705 | | - | |
2706 | | - | |
2707 | | - | |
2708 | | - | |
| 2705 | + | |
| 2706 | + | |
| 2707 | + | |
| 2708 | + | |
| 2709 | + | |
| 2710 | + | |
2709 | 2711 | | |
2710 | 2712 | | |
2711 | 2713 | | |
| |||
0 commit comments