Add diffusion FLUX fp8_static quantization by yghstill · Pull Request #37 · Tencent/AngelSlim

yghstill · 2025-08-10T15:11:07Z

Features

Support FLUX's Transformer block fp8 static quantization calibration.
Export quantization model as safetensors format.
Support FLUX's quantization inference, load quant config from angelslim_config.json, for example:

from angelslim.engine import InferEngine
slim_engine = InferEngine()
slim_engine.from_pretrained(model_path="youu/quant/angelslim/model/")
output = slim_engine.generate("A beautiful landscape with mountains and a river.")

TODO

For better reviewability, the combined compression strategy (e.g., cache + quant) will be submitted in the next PR.

…n_quant

add diffusion FLUX fp8_static quantization

0a4ec56

yghstill changed the title ~~Add diffusion FLUX fp8_static quantization~~ [WIP]Add diffusion FLUX fp8_static quantization Aug 10, 2025

yghstill and others added 4 commits August 17, 2025 23:15

update quant infer workflow

4785e34

Merge branch 'main' into add_diffusion_quant

87660c7

update flux quant infer

1141f11

clean infer code

2c724be

yghstill changed the title ~~[WIP]Add diffusion FLUX fp8_static quantization~~ Add diffusion FLUX fp8_static quantization Aug 19, 2025

yghstill added 5 commits August 21, 2025 18:44

Merge branch 'main' of github.com:Tencent/AngelSlim into add_diffusio…

9b21cd0

…n_quant

fix quant infer

3fb2b32

Merge branch 'main' of github.com:Tencent/AngelSlim into add_diffusio…

8a4f724

…n_quant

add save scale only

d499692

update ptq hook

efea88d

WOODchen7 approved these changes Aug 26, 2025

View reviewed changes

yghstill merged commit 9177027 into Tencent:main Aug 26, 2025
5 checks passed

yghstill deleted the add_diffusion_quant branch August 26, 2025 08:43

WOODchen7 pushed a commit to WOODchen7/AngelSlim that referenced this pull request Aug 27, 2025

Add diffusion FLUX fp8_static quantization (Tencent#37)

12b3d0b

dawnranger pushed a commit to dawnranger/AngelSlim that referenced this pull request Mar 11, 2026

Add diffusion FLUX fp8_static quantization (Tencent#37)

0977bd4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add diffusion FLUX fp8_static quantization#37

Add diffusion FLUX fp8_static quantization#37
yghstill merged 10 commits into
Tencent:mainfrom
yghstill:add_diffusion_quant

yghstill commented Aug 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yghstill commented Aug 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Features

TODO

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yghstill commented Aug 10, 2025 •

edited

Loading