Skip to content

DIOS fp8#44

Closed
WOODchen7 wants to merge 13 commits into
Tencent:mainfrom
WOODchen7:psad_fp8
Closed

DIOS fp8#44
WOODchen7 wants to merge 13 commits into
Tencent:mainfrom
WOODchen7:psad_fp8

Conversation

@WOODchen7
Copy link
Copy Markdown
Collaborator

@WOODchen7 WOODchen7 commented Aug 18, 2025

To address the aforementioned FP8 quantization issues, we introduced Dynamic Outlier Isolation Scale ($DOIS$) search, a search strategy that concentrates the FP8 weight mapping range into a high-precision region by isolating outliers. Quantizing activations is generally more difficult than quantizing weights, so we prioritize optimizing FP8 activations. DOIS uses the original FP8 outlier values as the upper limit of FP8 precision expression, calculating a new scale that compresses the value distribution into a high-precision distribution range, resulting in better precision expression of quantized activation values.

@WOODchen7 WOODchen7 closed this Aug 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant