Skip to content

Commit 2c4c418

Browse files
committed
b200 test
1 parent 4ac92c6 commit 2c4c418

2 files changed

Lines changed: 3 additions & 2 deletions

File tree

.github/workflows/eval-tmpl.yml

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -82,7 +82,6 @@ jobs:
8282
- name: Resource cleanup
8383
run: |
8484
# Helper to avoid indefinite hangs on flaky tools (Docker/Slurm)
85-
sudo rm -rf /home/nvadmin/actions-runner/_work/InferenceMAX/InferenceMAX/gpt-oss-120b/
8685
safe_timeout() {
8786
if command -v timeout >/dev/null 2>&1; then
8887
timeout -k 5 30s "$@"
@@ -138,7 +137,7 @@ jobs:
138137
with:
139138
fetch-depth: 0
140139
# Avoid aggressive workspace deletion if stale, rely on git reset/clean later
141-
clean: true
140+
clean: false
142141

143142
- name: Launch eval via runner script
144143
env:

benchmarks/gptoss_fp4_b200_docker.sh

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,8 @@
1414

1515
nvidia-smi
1616

17+
rm -rf /home/nvadmin/actions-runner/_work/InferenceMAX/InferenceMAX/gpt-oss-120b/
18+
1719
# To improve CI stability, we patch this helper function to prevent a race condition that
1820
# happens 1% of the time. ref: https://github.com/flashinfer-ai/flashinfer/pull/1779
1921
sed -i '102,108d' /usr/local/lib/python3.12/dist-packages/flashinfer/jit/cubin_loader.py

0 commit comments

Comments
 (0)