Commit 775bae6
committed
[TRTLLM-12669][chore] Address review feedback
- llm_args: keep allow_advanced_sampling as a deprecated no-op field
with a logger warning when explicitly set, so removing it isn't an
abrupt API break
- llm_args: add TODO above the Eagle3-only rejection-sampling whitelist
to track extending support to MTP / DraftTarget / PARD / DFlash /
SaveHiddenStates / SA and unifying the dispatch in SpecMetadata
- cuda_graph_runner: type spec_metadata as Optional[SpecMetadata]
instead of Optional[Any]
- model_engine: always initialize self.spec_metadata = None so the
capture-pass can access it directly without a getattr() fallback
- eagle3.draft_decoder: drop the dead Optional/_draft_sampler_greedy
fallback; spec_metadata and batch_size are always passed by the
sole caller
Signed-off-by: ZhaoyangWang <zhaoyangw@nvidia.com>1 parent df3dc51 commit 775bae6
4 files changed
Lines changed: 46 additions & 19 deletions
File tree
- tensorrt_llm
- _torch
- pyexecutor
- speculative
- llmapi
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| 18 | + | |
18 | 19 | | |
19 | 20 | | |
20 | 21 | | |
| |||
202 | 203 | | |
203 | 204 | | |
204 | 205 | | |
205 | | - | |
| 206 | + | |
206 | 207 | | |
207 | 208 | | |
208 | 209 | | |
| |||
244 | 245 | | |
245 | 246 | | |
246 | 247 | | |
247 | | - | |
| 248 | + | |
248 | 249 | | |
249 | 250 | | |
250 | 251 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
411 | 411 | | |
412 | 412 | | |
413 | 413 | | |
414 | | - | |
415 | 414 | | |
416 | 415 | | |
417 | 416 | | |
| |||
464 | 463 | | |
465 | 464 | | |
466 | 465 | | |
| 466 | + | |
467 | 467 | | |
468 | 468 | | |
469 | 469 | | |
| |||
1072 | 1072 | | |
1073 | 1073 | | |
1074 | 1074 | | |
1075 | | - | |
| 1075 | + | |
1076 | 1076 | | |
1077 | 1077 | | |
1078 | 1078 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
798 | 798 | | |
799 | 799 | | |
800 | 800 | | |
801 | | - | |
802 | | - | |
| 801 | + | |
| 802 | + | |
803 | 803 | | |
804 | 804 | | |
805 | 805 | | |
806 | | - | |
807 | | - | |
808 | | - | |
| 806 | + | |
| 807 | + | |
809 | 808 | | |
810 | 809 | | |
811 | 810 | | |
| |||
814 | 813 | | |
815 | 814 | | |
816 | 815 | | |
817 | | - | |
818 | | - | |
| 816 | + | |
819 | 817 | | |
820 | 818 | | |
821 | 819 | | |
822 | 820 | | |
823 | 821 | | |
824 | 822 | | |
825 | 823 | | |
826 | | - | |
827 | | - | |
828 | | - | |
829 | | - | |
830 | | - | |
831 | | - | |
832 | | - | |
833 | | - | |
| 824 | + | |
| 825 | + | |
| 826 | + | |
| 827 | + | |
| 828 | + | |
| 829 | + | |
834 | 830 | | |
835 | 831 | | |
836 | 832 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
906 | 906 | | |
907 | 907 | | |
908 | 908 | | |
| 909 | + | |
| 910 | + | |
| 911 | + | |
| 912 | + | |
| 913 | + | |
| 914 | + | |
| 915 | + | |
| 916 | + | |
909 | 917 | | |
910 | 918 | | |
911 | 919 | | |
| |||
970 | 978 | | |
971 | 979 | | |
972 | 980 | | |
| 981 | + | |
| 982 | + | |
| 983 | + | |
| 984 | + | |
| 985 | + | |
| 986 | + | |
| 987 | + | |
| 988 | + | |
| 989 | + | |
| 990 | + | |
| 991 | + | |
| 992 | + | |
| 993 | + | |
| 994 | + | |
| 995 | + | |
| 996 | + | |
| 997 | + | |
973 | 998 | | |
974 | 999 | | |
975 | 1000 | | |
| |||
4148 | 4173 | | |
4149 | 4174 | | |
4150 | 4175 | | |
| 4176 | + | |
| 4177 | + | |
| 4178 | + | |
| 4179 | + | |
| 4180 | + | |
4151 | 4181 | | |
4152 | 4182 | | |
4153 | 4183 | | |
| |||
0 commit comments