Commit 9fd6c8d
authored
[FIX]
* nested_move_to() now support dict
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
* Revert erroneous modification to thread number.
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
* WeightOnlyLooper supports multi-GPU and multi-threading acceleration for quantization
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
* Add progress bar to the `submodule finalize`
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
---------
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>weight_only_looper did not support multi-GPU quantization. (#2915)1 parent 7b27707 commit 9fd6c8d
5 files changed
Lines changed: 804 additions & 31 deletions
File tree
- gptqmodel
- looper
- utils
- tests
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
176 | 176 | | |
177 | 177 | | |
178 | 178 | | |
179 | | - | |
| 179 | + | |
180 | 180 | | |
181 | 181 | | |
182 | | - | |
183 | | - | |
184 | | - | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
185 | 185 | | |
186 | 186 | | |
187 | 187 | | |
| |||
0 commit comments