Commit 82ecad0
committed
server: stop DFlash at grammar tool boundaries
Keep DFlash active before lazy grammar triggers, then stop speculative accept/drafting once grammar, reasoning-budget forcing, or raw tool-call markers require normal token-by-token sampling.
Track accepted draft tokens separately from hidden-state rows so DFlash rollback and ring updates stay aligned at grammar/tool boundaries.
Fixes #5
Refs #61 parent 84efc67 commit 82ecad0
4 files changed
Lines changed: 207 additions & 42 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
653 | 653 | | |
654 | 654 | | |
655 | 655 | | |
| 656 | + | |
| 657 | + | |
| 658 | + | |
| 659 | + | |
| 660 | + | |
| 661 | + | |
| 662 | + | |
| 663 | + | |
| 664 | + | |
| 665 | + | |
| 666 | + | |
| 667 | + | |
| 668 | + | |
| 669 | + | |
| 670 | + | |
| 671 | + | |
| 672 | + | |
| 673 | + | |
| 674 | + | |
| 675 | + | |
| 676 | + | |
656 | 677 | | |
657 | 678 | | |
658 | 679 | | |
659 | 680 | | |
660 | | - | |
661 | | - | |
662 | | - | |
| 681 | + | |
663 | 682 | | |
664 | 683 | | |
665 | | - | |
666 | | - | |
| 684 | + | |
| 685 | + | |
667 | 686 | | |
668 | | - | |
| 687 | + | |
669 | 688 | | |
670 | 689 | | |
671 | 690 | | |
| |||
682 | 701 | | |
683 | 702 | | |
684 | 703 | | |
| 704 | + | |
| 705 | + | |
| 706 | + | |
| 707 | + | |
685 | 708 | | |
686 | 709 | | |
687 | 710 | | |
| |||
760 | 783 | | |
761 | 784 | | |
762 | 785 | | |
| 786 | + | |
| 787 | + | |
| 788 | + | |
| 789 | + | |
763 | 790 | | |
764 | 791 | | |
765 | 792 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
67 | 67 | | |
68 | 68 | | |
69 | 69 | | |
| 70 | + | |
70 | 71 | | |
71 | 72 | | |
72 | 73 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
255 | 255 | | |
256 | 256 | | |
257 | 257 | | |
258 | | - | |
| 258 | + | |
259 | 259 | | |
260 | 260 | | |
261 | 261 | | |
| |||
2106 | 2106 | | |
2107 | 2107 | | |
2108 | 2108 | | |
2109 | | - | |
2110 | | - | |
2111 | | - | |
| 2109 | + | |
| 2110 | + | |
| 2111 | + | |
2112 | 2112 | | |
2113 | 2113 | | |
2114 | 2114 | | |
| |||
0 commit comments