Commit 2707be9
committed
fix: prevent Metal GPU Watchdog timeout on low-RAM CI runners
- Move MLX_MAX_OPS_PER_BUFFER=50 to top of run() before Metal init
- Enable --stream-experts automatically on <12GB machines in test-dflash.sh
so weights are paged via mmap/pread instead of macOS VM swap
- Auto-cap draft tokens to 1 under SSD streaming (minimal fan-out)
- Always compute draftFootprintBytes regardless of --stream-experts flag1 parent 91e32af commit 2707be9
2 files changed
Lines changed: 30 additions & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
290 | 290 | | |
291 | 291 | | |
292 | 292 | | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
293 | 302 | | |
294 | 303 | | |
295 | 304 | | |
| |||
467 | 476 | | |
468 | 477 | | |
469 | 478 | | |
470 | | - | |
471 | 479 | | |
472 | 480 | | |
473 | 481 | | |
| |||
479 | 487 | | |
480 | 488 | | |
481 | 489 | | |
482 | | - | |
483 | 490 | | |
484 | 491 | | |
485 | 492 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
66 | 66 | | |
67 | 67 | | |
68 | 68 | | |
69 | | - | |
70 | | - | |
71 | | - | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
72 | 86 | | |
73 | 87 | | |
74 | 88 | | |
| |||
83 | 97 | | |
84 | 98 | | |
85 | 99 | | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
86 | 103 | | |
87 | 104 | | |
88 | 105 | | |
89 | 106 | | |
90 | | - | |
| 107 | + | |
91 | 108 | | |
92 | 109 | | |
93 | 110 | | |
| |||
0 commit comments