[fix](gpt-oss): fix quark quantized model in moe bias by PerryZhang01 · Pull Request #787 · ROCm/ATOM

PerryZhang01 · 2026-05-14T12:27:06Z

Motivation

This PR fixed the padding error in quantized gpt_oss. the quantized gpt-oss-120b is from quark team(https://huggingface.co/amd/gpt-oss-120b-moe-ori-attn-ptpc), it only quantized gemm weights in attention with PTPC methods. the bias in moe are padding, using empty tensor will introduce dirty data, so use zero bias data.

[fix](gpt-oss): fix quark quantized model in moe bias

3c0f267

valarLip approved these changes May 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix](gpt-oss): fix quark quantized model in moe bias#787

[fix](gpt-oss): fix quark quantized model in moe bias#787
PerryZhang01 wants to merge 1 commit into
mainfrom
quant_gpt_oss

PerryZhang01 commented May 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

PerryZhang01 commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

PerryZhang01 commented May 14, 2026 •

edited

Loading