Skip to content

add Leptokurtic Quant(LeptoQuant) algorithm#45

Merged
WOODchen7 merged 19 commits into
Tencent:mainfrom
WOODchen7:psad_fp8
Aug 19, 2025
Merged

add Leptokurtic Quant(LeptoQuant) algorithm#45
WOODchen7 merged 19 commits into
Tencent:mainfrom
WOODchen7:psad_fp8

Conversation

@WOODchen7
Copy link
Copy Markdown
Collaborator

@WOODchen7 WOODchen7 commented Aug 18, 2025

To address the aforementioned FP8 quantization issues, we introduced Leptokurtic Quant(LeptoQuant) search, a search strategy that concentrates the FP8 weight mapping range into a high-precision region by isolating outliers. Quantizing activations is generally more difficult than quantizing weights, so we prioritize optimizing FP8 activations. LeptoQuant uses the original FP8 outlier values as the upper limit of FP8 precision expression, calculating a new scale that compresses the value distribution into a high-precision distribution range, resulting in better precision expression of quantized activation values.

@yghstill yghstill changed the title DOIS fp8 add DOIS-fp8 algorithm Aug 19, 2025
@yghstill yghstill self-requested a review August 19, 2025 02:12
Comment thread angelslim/compressor/quant/modules/fp8/dois_scale.py Outdated
Comment thread configs/hunyuan/fp8_static/hunyuan_2b_dense_ados_fp8_static.yaml Outdated
Comment thread docs/source/features/quantization/fp8_dois.md Outdated
Comment thread docs/source/features/quantization/fp8_dois.md Outdated
Comment thread angelslim/compressor/quant/modules/fp8/dois_fp8.py Outdated
@WOODchen7 WOODchen7 changed the title add DOIS-fp8 algorithm add Leptokurtic Quant(LeptoQuant) algorithm Aug 19, 2025
@WOODchen7 WOODchen7 merged commit fa7304e into Tencent:main Aug 19, 2025
5 checks passed
WOODchen7 added a commit to WOODchen7/AngelSlim that referenced this pull request Aug 27, 2025
Co-authored-by: woodchen7 <woodchenwu@tencent.com>
dawnranger pushed a commit to dawnranger/AngelSlim that referenced this pull request Mar 11, 2026
Co-authored-by: woodchen7 <woodchenwu@tencent.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants