Skip to content

Commit 1cede80

Browse files
committed
switch to native offloading
1 parent ad505ff commit 1cede80

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1762,7 +1762,7 @@ dsv4-fp4-b200-vllm:
17621762
# - image: bumped to a custom v0.21.0 build (cquil/vllm-openai:v0.21.0-8813c92)
17631763
# to test SimpleCPUOffloadConnector lazy_offload behavior on a newer vLLM.
17641764
dsv4-fp4-b200-vllm-agentic:
1765-
image: vllm/vllm-openai:v0.21.0
1765+
image: cquil/vllm-openai:v0.21.0-dsv4-offloading
17661766
model: deepseek-ai/DeepSeek-V4-Pro
17671767
model-prefix: dsv4
17681768
runner: b200-dgxc

0 commit comments

Comments
 (0)