fix deepseek all reduce; fix fp8 per-tensor weight scale; add int4-awq in deepseek by ali-88123 · Pull Request #87 · Tencent/AngelSlim

ali-88123 · 2025-10-13T03:36:35Z

修复了在使用专家并行时，量化因子scale因为all_reduce最终存储的值不准确的问题
修复了deepseek per-tensor量化中，weight scale重复除以448.0的问题
增加了deepseek int4_awq方法

…q in deepseek (Tencent#87)

ali-88123 added 4 commits October 9, 2025 13:03

fix deepseek all reduce;fix fp8 per-tensor weight scale

edb6b48

Merge branch 'main' into dev

2cbb7d5

add awq in deepseek

b18741d

add int4-awq instruction in deepseek_quant.md

8381921

yghstill reviewed Oct 13, 2025

View reviewed changes

Comment thread angelslim/compressor/quant/ptq.py Outdated

ali-88123 added 2 commits October 13, 2025 15:06

modify AWQ's observer_layer_classes from self.quant_model

96f69c1

add self.observer_layer_classes in quant_model

2b327f0

yghstill approved these changes Oct 13, 2025

View reviewed changes

yghstill merged commit 06a687d into Tencent:main Oct 13, 2025
5 checks passed

RuBing-Yang pushed a commit to RuBing-Yang/AngelSlim that referenced this pull request Oct 22, 2025

fix deepseek all reduce; fix fp8 per-tensor weight scale; add int4-aw…

5c0251a

…q in deepseek (Tencent#87)

dawnranger pushed a commit to dawnranger/AngelSlim that referenced this pull request Mar 11, 2026

fix deepseek all reduce; fix fp8 per-tensor weight scale; add int4-aw…

e06d9e9

…q in deepseek (Tencent#87)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix deepseek all reduce; fix fp8 per-tensor weight scale; add int4-awq in deepseek#87

fix deepseek all reduce; fix fp8 per-tensor weight scale; add int4-awq in deepseek#87
yghstill merged 6 commits into
Tencent:mainfrom
ali-88123:dev_ruicen

ali-88123 commented Oct 13, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ali-88123 commented Oct 13, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants