From 5a66e8b056dc8f06c2428511d78165e0b1266119 Mon Sep 17 00:00:00 2001 From: WOOD_C <51071696+WOODchen7@users.noreply.github.com> Date: Fri, 26 Sep 2025 17:49:05 +0800 Subject: [PATCH 1/2] ADD tequila news to readme --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 6888db72..669d9ce7 100644 --- a/README.md +++ b/README.md @@ -31,6 +31,7 @@ - [技术交流](#技术交流) ## 📣最新进展 +- [25/09/26] 我们发布了三值量化[TEQUILA](https://github.com/Tencent/AngelSlim/tree/tequila): TRAPPING-FREE TERNARY QUANTIZATION FOR LARGE LANGUAGE MODELS 相关代码。 - [25/09/24] 我们支持了Qwen3系列模型的NVFP4的PTQ量化,我们还开源了[Qwen3-32B-NVFP4](https://huggingface.co/AngelSlim/Qwen3-32B_nvfp4)、[Qwen3-235B-A22B-NVFP4](https://huggingface.co/AngelSlim/Qwen3-235B-A22B_nvfp4)权重。 - [25/09/01] 我们支持了[Hunyuan-MT-7B](https://huggingface.co/tencent/Hunyuan-MT-7B-fp8)翻译开源模型的FP8量化;支持了Eagle3的Torch推理及Benchmark评测流程;支持了[FLUX](https://github.com/Tencent/AngelSlim/tree/main/configs/flux)的量化、Cache;支持了[Seed-OSS](https://github.com/Tencent/AngelSlim/tree/main/configs/seed_oss)模型量化压缩。 - [25/08/06] 我们支持了`Hunyuan 0.5B/1.8B/4B/7B`和`Qwen2.5VL 3B/7B/32B/72B`的FP8、INT4量化,支持了`DeepSeek-R1/V3`和`Kimi-K2`模型的`FP8-Static`、`W4A8-FP8`量化。我们还开源了`Hunyuan 1.8B/4B/7B`系列模型的Eagle3权重。 From f27f072626e988e09631a2c8a14e2fe7508c5496 Mon Sep 17 00:00:00 2001 From: WOOD_C <51071696+WOODchen7@users.noreply.github.com> Date: Fri, 26 Sep 2025 21:04:32 +0800 Subject: [PATCH 2/2] Update TEQUILA link in README for clarity --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 669d9ce7..90ee4239 100644 --- a/README.md +++ b/README.md @@ -31,7 +31,7 @@ - [技术交流](#技术交流) ## 📣最新进展 -- [25/09/26] 我们发布了三值量化[TEQUILA](https://github.com/Tencent/AngelSlim/tree/tequila): TRAPPING-FREE TERNARY QUANTIZATION FOR LARGE LANGUAGE MODELS 相关代码。 +- [25/09/26] 我们发布了三值量化[TEQUILA](https://github.com/Tencent/AngelSlim/tree/tequila/TernaryQuant): TRAPPING-FREE TERNARY QUANTIZATION FOR LARGE LANGUAGE MODELS 相关代码。 - [25/09/24] 我们支持了Qwen3系列模型的NVFP4的PTQ量化,我们还开源了[Qwen3-32B-NVFP4](https://huggingface.co/AngelSlim/Qwen3-32B_nvfp4)、[Qwen3-235B-A22B-NVFP4](https://huggingface.co/AngelSlim/Qwen3-235B-A22B_nvfp4)权重。 - [25/09/01] 我们支持了[Hunyuan-MT-7B](https://huggingface.co/tencent/Hunyuan-MT-7B-fp8)翻译开源模型的FP8量化;支持了Eagle3的Torch推理及Benchmark评测流程;支持了[FLUX](https://github.com/Tencent/AngelSlim/tree/main/configs/flux)的量化、Cache;支持了[Seed-OSS](https://github.com/Tencent/AngelSlim/tree/main/configs/seed_oss)模型量化压缩。 - [25/08/06] 我们支持了`Hunyuan 0.5B/1.8B/4B/7B`和`Qwen2.5VL 3B/7B/32B/72B`的FP8、INT4量化,支持了`DeepSeek-R1/V3`和`Kimi-K2`模型的`FP8-Static`、`W4A8-FP8`量化。我们还开源了`Hunyuan 1.8B/4B/7B`系列模型的Eagle3权重。