Skip to content

Support combination quant and cache in diffusion model#63

Merged
yghstill merged 3 commits into
Tencent:mainfrom
yghstill:combin_quant_cache
Sep 9, 2025
Merged

Support combination quant and cache in diffusion model#63
yghstill merged 3 commits into
Tencent:mainfrom
yghstill:combin_quant_cache

Conversation

@yghstill
Copy link
Copy Markdown
Collaborator

@yghstill yghstill commented Sep 3, 2025

  • Combination strategy can setting in yaml such as:
compression:
  name: [Cache, PTQ]
  cache:
    name: DeepCache
    ...
  quantization:
    name: fp8_static
    bits: 8
    ...
  • And run combin inference can execute the following command:
python tools/infer.py --model-path ./output/flux-1-schnell_deepcache_fp8_static \
        --input-prompt "A beautiful landscape with mountains and a river."

@yghstill yghstill changed the title [WIP]Support combination quant and cache in diffusion model Support combination quant and cache in diffusion model Sep 6, 2025
@yghstill yghstill requested a review from ifif-S September 6, 2025 15:29
@yghstill yghstill merged commit 863ac3f into Tencent:main Sep 9, 2025
5 checks passed
@yghstill yghstill deleted the combin_quant_cache branch September 9, 2025 04:15
dawnranger pushed a commit to dawnranger/AngelSlim that referenced this pull request Mar 11, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants