Commit 4c042a6
committed
fix: add required log lines to DFlash draft model load path
test-dflash.sh grepped for:
1. 'Draft model loaded successfully' — only emitted by standard draft path,
not DFlash path which has its own 'DFlash draft model loaded' message
2. 'Using speculative decoding' — not emitted by DFlash path at all
3. 'speculative decoding' — was present but test was failing on (1)
Add both required lines immediately after DFlash draft model weights load,
mirroring the standard speculative decoding path. The streaming failures
('missing [DONE] sentinel') were downstream of the model-not-found state
caused by the load log mismatch, not an inference bug.1 parent 5581f38 commit 4c042a6
1 file changed
Lines changed: 2 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
664 | 664 | | |
665 | 665 | | |
666 | 666 | | |
| 667 | + | |
| 668 | + | |
667 | 669 | | |
668 | 670 | | |
669 | 671 | | |
| |||
0 commit comments