Commit c778385
committed
feat(qwen3_5_mtp): scheduler MTP verify backend + accept-len transport
Drive the draft/verify loop from the scheduler:
- carry a canonical InferReq.mtp_accept_len pointer and persist the
per-request accept_len across steps; build per-req
b_num_accepted_tokens in decode_mtp and commit it in phase 2 so the
next step reads a fresh count.
- extend the chunked_prefill backend / base_backend with the MTP verify
dispatch and the partial-accept read offset.1 parent bb5ffd9 commit c778385
3 files changed
Lines changed: 191 additions & 33 deletions
File tree
- lightllm/server/router/model_infer
- mode_backend
- chunked_prefill
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
357 | 357 | | |
358 | 358 | | |
359 | 359 | | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
360 | 365 | | |
361 | 366 | | |
362 | 367 | | |
| |||
377 | 382 | | |
378 | 383 | | |
379 | 384 | | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
380 | 389 | | |
381 | 390 | | |
382 | 391 | | |
| |||
385 | 394 | | |
386 | 395 | | |
387 | 396 | | |
| 397 | + | |
388 | 398 | | |
389 | 399 | | |
390 | 400 | | |
| |||
400 | 410 | | |
401 | 411 | | |
402 | 412 | | |
403 | | - | |
404 | | - | |
405 | | - | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
406 | 421 | | |
407 | 422 | | |
408 | 423 | | |
| |||
558 | 573 | | |
559 | 574 | | |
560 | 575 | | |
| 576 | + | |
| 577 | + | |
561 | 578 | | |
562 | 579 | | |
563 | 580 | | |
| |||
Lines changed: 34 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
357 | 357 | | |
358 | 358 | | |
359 | 359 | | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
360 | 370 | | |
361 | 371 | | |
362 | 372 | | |
| |||
602 | 612 | | |
603 | 613 | | |
604 | 614 | | |
605 | | - | |
606 | 615 | | |
607 | 616 | | |
608 | 617 | | |
| |||
783 | 792 | | |
784 | 793 | | |
785 | 794 | | |
| 795 | + | |
| 796 | + | |
| 797 | + | |
| 798 | + | |
| 799 | + | |
| 800 | + | |
| 801 | + | |
| 802 | + | |
| 803 | + | |
| 804 | + | |
| 805 | + | |
| 806 | + | |
| 807 | + | |
| 808 | + | |
| 809 | + | |
| 810 | + | |
| 811 | + | |
| 812 | + | |
| 813 | + | |
| 814 | + | |
| 815 | + | |
| 816 | + | |
786 | 817 | | |
787 | 818 | | |
788 | 819 | | |
789 | 820 | | |
790 | 821 | | |
| 822 | + | |
| 823 | + | |
791 | 824 | | |
792 | 825 | | |
793 | 826 | | |
| |||
809 | 842 | | |
810 | 843 | | |
811 | 844 | | |
812 | | - | |
813 | 845 | | |
814 | 846 | | |
815 | 847 | | |
| |||
0 commit comments