Skip to content

fix deepseek all reduce; fix fp8 per-tensor weight scale; add int4-awq in deepseek#87

Merged
yghstill merged 6 commits into
Tencent:mainfrom
ali-88123:dev_ruicen
Oct 13, 2025
Merged

fix deepseek all reduce; fix fp8 per-tensor weight scale; add int4-awq in deepseek#87
yghstill merged 6 commits into
Tencent:mainfrom
ali-88123:dev_ruicen

Conversation

@ali-88123
Copy link
Copy Markdown
Collaborator

  • 修复了在使用专家并行时,量化因子scale因为all_reduce最终存储的值不准确的问题
  • 修复了deepseek per-tensor量化中,weight scale重复除以448.0的问题
  • 增加了deepseek int4_awq方法

Comment thread angelslim/compressor/quant/ptq.py Outdated
@yghstill yghstill merged commit 06a687d into Tencent:main Oct 13, 2025
5 checks passed
RuBing-Yang pushed a commit to RuBing-Yang/AngelSlim that referenced this pull request Oct 22, 2025
dawnranger pushed a commit to dawnranger/AngelSlim that referenced this pull request Mar 11, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants