Commit 2b80f8d
authored
* Why?
We would like to be able to use a TorchLlmArgs config in
AutoDeploy's own version with minimal changes.
* What?
This commit removes the redefinition of:
- `model_kwargs`: existing usages guarded against `None` the same way
as an empty dict.
- `max_batch_size: most unit tests set it explicitly; a few configs were
updated to have the old default.
- `max_beam_width`: instead adds a validator for it.
- `att_backend`: although the default between the base class ("TRTLLM")
and autodeploy ("flashinfer") differ, the
`update_transforms_with_shortcuts` validator in practice reads the
default from `default.yaml`, which is "flashinfer".
- `sampler`: the executor code already supported both. We just tweak it
so that the "auto" value corresponds to the now removed default.
It also removes the `cuda_graph_batch_sizes` in favor of
`cuda_graph_config.batch_sizes`, with necessary adjustments to unit
tests and existing configs.
Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
1 parent 889b81c commit 2b80f8d
34 files changed
Lines changed: 380 additions & 176 deletions
File tree
- examples/auto_deploy
- model_registry/configs
- tensorrt_llm/_torch/auto_deploy
- config
- models
- shim
- tests
- integration/defs
- accuracy
- examples
- perf
- unittest/auto_deploy
- _utils_test
- singlegpu
- shim
- smoke
- transformations/library
Lines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
| 16 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
| 4 | + | |
Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | | - | |
| 5 | + | |
| 6 | + | |
6 | 7 | | |
7 | 8 | | |
8 | 9 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
10 | | - | |
| 10 | + | |
| 11 | + | |
11 | 12 | | |
12 | 13 | | |
13 | 14 | | |
| |||
Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
| 8 | + | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| |||
Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
| 8 | + | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
| 10 | + | |
10 | 11 | | |
11 | 12 | | |
12 | 13 | | |
| |||
Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| 22 | + | |
| 23 | + | |
22 | 24 | | |
23 | 25 | | |
24 | 26 | | |
25 | | - | |
| |||
Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
| 8 | + | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| |||
Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
| 8 | + | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| |||
0 commit comments