|
4 | 4 | Compression via Sparse Attention and Adaptive Frequency", IEEE/CVF Conf. |
5 | 5 | on Computer Vision and Pattern Recognition (CVPR), 2026. |
6 | 6 |
|
7 | | -Adapted from the upstream reference implementation; the entropy stack |
8 | | -shares the containerized |
9 | | -:class:`~compressai.latent_codecs.HyperpriorLatentCodec` / |
10 | | -:class:`~compressai.latent_codecs.ChannelGroupsLatentCodec` wiring with |
11 | | -DCAE and reuses |
12 | | -:class:`~compressai.models._helpers.dictionary_context.DictionaryMeanScaleContextHead`. |
13 | | -
|
14 | | -Two SAAF-specific additions to the DCAE template: |
15 | | -
|
16 | | -- An auxiliary encoder / decoder chain (``aux_enc`` / ``aux_dec``) of |
17 | | - :class:`_AdaptiveFrequencyBlock` / |
18 | | - :class:`_InverseAdaptiveFrequencyBlock` modules runs alongside ``g_a`` / |
19 | | - ``g_s`` and is summed into the main path at every stage boundary. Each |
20 | | - block carries an :class:`compressai.models._helpers.auxt.OLP` so the |
21 | | - AuxT regulariser is integral (not opt-in like TCM ``use_auxt``); |
22 | | - :meth:`SAAF.aux_loss` aggregates them via the shared helper. |
23 | | -- A :class:`_DenoisingAsRegularizer` "diffusion prior" head produces a |
24 | | - scalar ``diffusion_loss`` during training that the host's training loop |
25 | | - adds to the rate-distortion objective. |
| 7 | +Adapted from the upstream reference implementation at |
| 8 | +https://github.com/huidong-ma/SAAF |
| 9 | +based on CompressAI, DCAE, and AuxT. |
| 10 | +
|
| 11 | +SAAF combines adaptive-frequency auxiliary transform branches |
| 12 | +(``aux_enc`` / ``aux_dec``) with a denoising regularizer that produces |
| 13 | +``diffusion_loss`` during training. |
26 | 14 | """ |
27 | 15 |
|
28 | 16 | from __future__ import annotations |
|
0 commit comments