Skip to content

🌟 Introducing SAR (Self-Autoregressive Refinement): stable self-rollout that unlocks robust post-training for Visual Autoregressive image generation. #178

@GengzeZhou

Description

@GengzeZhou

Thanks to the inspiring progress in Visual Autoregressive (VAR) models!

We introduce SAR (Self-Autoregressive Refinement), a lightweight post-training framework that fixes the train–test mismatch in scale-wise autoregressive image generation.
Instead of RL-based alignment, SAR shows that self-rollout alone can serve as a robust and stable post-training strategy.

Stagger-Scale Rollout (SSR)

With Stagger-Scale Rollout and a Contrastive Student-Forcing Loss, SAR enables effective error correction across scales and consistently improves image quality with minimal overhead (up to 5.2% FID↓).

Arxiv: https://arxiv.org/abs/2512.06421
Project Page: https://gengzezhou.github.io/SAR/
Github: https://gengzezhou.github.io/SAR/

Metadata

Metadata

Assignees

No one assigned

    Labels

    ResearchThird-party repositories or research which use VAR

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions