Support combination quant and cache in diffusion model by yghstill · Pull Request #63 · Tencent/AngelSlim

yghstill · 2025-09-03T07:27:10Z

Combination strategy can setting in yaml such as:

compression:
  name: [Cache, PTQ]
  cache:
    name: DeepCache
    ...
  quantization:
    name: fp8_static
    bits: 8
    ...

And run combin inference can execute the following command:

python tools/infer.py --model-path ./output/flux-1-schnell_deepcache_fp8_static \
        --input-prompt "A beautiful landscape with mountains and a river."

yghstill added 2 commits September 2, 2025 22:23

support combin quant and flux

6db55d5

update flux cache

ed77a9c

yghstill changed the title ~~[WIP]Support combination quant and cache in diffusion model~~ Support combination quant and cache in diffusion model Sep 6, 2025

yghstill requested a review from ifif-S September 6, 2025 15:29

fix cache infer

1de4d15

ifif-S approved these changes Sep 9, 2025

View reviewed changes

yghstill merged commit 863ac3f into Tencent:main Sep 9, 2025
5 checks passed

yghstill deleted the combin_quant_cache branch September 9, 2025 04:15

dawnranger pushed a commit to dawnranger/AngelSlim that referenced this pull request Mar 11, 2026

Support combination quant and cache in diffusion model (Tencent#63)

8ed43d2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support combination quant and cache in diffusion model#63

Support combination quant and cache in diffusion model#63
yghstill merged 3 commits into
Tencent:mainfrom
yghstill:combin_quant_cache

yghstill commented Sep 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yghstill commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yghstill commented Sep 3, 2025 •

edited

Loading