Skip to content

Inference strategy bug for looplm #2

Description

@ACautomata

# CDF update (scalar — mean over batch and positions)
lam_mean = lam.mean().item()

It seems that all samples in a batch will exit at the same step. The official implementation of loop exit is token-level could be found here. https://huggingface.co/ByteDance/Ouro-1.4B/blob/main/modeling_ouro.py#L781

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions