Commit d6fd852
committed
[TRTLLM-12669][chore] Address review feedback
- llm_args: keep allow_advanced_sampling as a deprecated no-op field
with a logger warning when explicitly set, so removing it isn't an
abrupt API break
- llm_args: add TODO above the Eagle3-only rejection-sampling whitelist
to track extending support to MTP / DraftTarget / PARD / DFlash /
SaveHiddenStates / SA and unifying the dispatch in SpecMetadata
- cuda_graph_runner: type spec_metadata as Optional[SpecMetadata]
instead of Optional[Any]
- model_engine: always initialize self.spec_metadata = None so the
capture-pass can access it directly without a getattr() fallback
- eagle3.draft_decoder: drop the dead Optional/_draft_sampler_greedy
fallback; spec_metadata and batch_size are always passed by the
sole caller
Signed-off-by: ZhaoyangWang <zhaoyangw@nvidia.com>1 parent 53b4415 commit d6fd852
4 files changed
Lines changed: 46 additions & 19 deletions
File tree
- tensorrt_llm
- _torch
- pyexecutor
- speculative
- llmapi
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| 22 | + | |
22 | 23 | | |
23 | 24 | | |
24 | 25 | | |
| |||
206 | 207 | | |
207 | 208 | | |
208 | 209 | | |
209 | | - | |
| 210 | + | |
210 | 211 | | |
211 | 212 | | |
212 | 213 | | |
| |||
248 | 249 | | |
249 | 250 | | |
250 | 251 | | |
251 | | - | |
| 252 | + | |
252 | 253 | | |
253 | 254 | | |
254 | 255 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
484 | 484 | | |
485 | 485 | | |
486 | 486 | | |
487 | | - | |
488 | 487 | | |
489 | 488 | | |
490 | 489 | | |
| |||
538 | 537 | | |
539 | 538 | | |
540 | 539 | | |
| 540 | + | |
541 | 541 | | |
542 | 542 | | |
543 | 543 | | |
| |||
1214 | 1214 | | |
1215 | 1215 | | |
1216 | 1216 | | |
1217 | | - | |
| 1217 | + | |
1218 | 1218 | | |
1219 | 1219 | | |
1220 | 1220 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
819 | 819 | | |
820 | 820 | | |
821 | 821 | | |
822 | | - | |
823 | | - | |
| 822 | + | |
| 823 | + | |
824 | 824 | | |
825 | 825 | | |
826 | 826 | | |
827 | | - | |
828 | | - | |
829 | | - | |
| 827 | + | |
| 828 | + | |
830 | 829 | | |
831 | 830 | | |
832 | 831 | | |
| |||
835 | 834 | | |
836 | 835 | | |
837 | 836 | | |
838 | | - | |
839 | | - | |
| 837 | + | |
840 | 838 | | |
841 | 839 | | |
842 | 840 | | |
843 | 841 | | |
844 | 842 | | |
845 | 843 | | |
846 | 844 | | |
847 | | - | |
848 | | - | |
849 | | - | |
850 | | - | |
851 | | - | |
852 | | - | |
853 | | - | |
854 | | - | |
| 845 | + | |
| 846 | + | |
| 847 | + | |
| 848 | + | |
| 849 | + | |
| 850 | + | |
855 | 851 | | |
856 | 852 | | |
857 | 853 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1066 | 1066 | | |
1067 | 1067 | | |
1068 | 1068 | | |
| 1069 | + | |
| 1070 | + | |
| 1071 | + | |
| 1072 | + | |
| 1073 | + | |
| 1074 | + | |
| 1075 | + | |
| 1076 | + | |
1069 | 1077 | | |
1070 | 1078 | | |
1071 | 1079 | | |
| |||
1130 | 1138 | | |
1131 | 1139 | | |
1132 | 1140 | | |
| 1141 | + | |
| 1142 | + | |
| 1143 | + | |
| 1144 | + | |
| 1145 | + | |
| 1146 | + | |
| 1147 | + | |
| 1148 | + | |
| 1149 | + | |
| 1150 | + | |
| 1151 | + | |
| 1152 | + | |
| 1153 | + | |
| 1154 | + | |
| 1155 | + | |
| 1156 | + | |
| 1157 | + | |
1133 | 1158 | | |
1134 | 1159 | | |
1135 | 1160 | | |
| |||
4423 | 4448 | | |
4424 | 4449 | | |
4425 | 4450 | | |
| 4451 | + | |
| 4452 | + | |
| 4453 | + | |
| 4454 | + | |
| 4455 | + | |
4426 | 4456 | | |
4427 | 4457 | | |
4428 | 4458 | | |
| |||
0 commit comments