Skip to content

ROCm 8-bit Qwen models produce garbage output #17

@soloish90

Description

@soloish90

On my Framework Desktop (Strix Halo) running CachyOS with ROCm 7.2.2 on gfx1151, several 8-bit Qwen MLX models produced broken output in lemon-mlx-engine, including:

  • mlx-community/Qwen3.5-27B-8bit
  • mlx-community/Qwen3.6-27B-8bit
  • mlx-community/Qwen3-1.7B-8bit

This turned out to be an upstream MLX ROCm issue in the tiled 8-bit QMV path, not a lemon-mlx-engine bug.

Upstream fix: NripeshN/mlx#6

Once that MLX change is merged, this downstream issue should be resolved.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions