Skip to content

Commit 5e090d1

Browse files
authored
Update README.md
1 parent 5596261 commit 5e090d1

1 file changed

Lines changed: 1 addition & 29 deletions

File tree

README.md

Lines changed: 1 addition & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -289,35 +289,7 @@ You can add your own model type referring to files under `llmc/models/*.py`.
289289

290290
✅ [Wanda](https://arxiv.org/abs/2306.11695)
291291

292-
## TODO List
293-
294-
### Quantization
295-
296-
- [ ] QuIP
297-
298-
- [ ] QuIP#
299-
300-
- [ ] AQLM
301-
302-
**Note:** Some specific algorithms like QUIK, SpQR, needing special hardware or kernel support can not go through naive quantization by multiple backends, and then utilize these backends to infer. However, users can still use our tool to evaluate the performance of these algorithms in their research.
303-
304-
### Pruning
305-
306-
- [ ] SparseGPT
307-
308-
- [ ] LLM-Pruner
309-
310-
This part is coming soon🚀.
311-
312-
### Docs
313-
314-
- [ ] End-to-end examples of compressing a model and then utilizing multiple backends, e.g., [Lightllm](https://github.com/ModelTC/lightllm), [TensorRT-LLM](https://github.com/NVIDIA/TensorRT-LLM), to infer.
315-
316-
- [ ] Docs about `special` in `quant` part in [Configuration](#configuration) for different algorithms.
317-
318-
- [ ] Docs about adding new algorithms by users themselves.
319-
320-
More detailed Docs are coming soon🚀.
292+
✅ [ShortGPT](https://arxiv.org/abs/2403.03853)
321293

322294
## Acknowledgments
323295

0 commit comments

Comments
 (0)