Skip to content

fix(inference): correct known-issue precisions#441

Merged
adibarra merged 1 commit into
masterfrom
fix/known-issue-precisions
Jun 11, 2026
Merged

fix(inference): correct known-issue precisions#441
adibarra merged 1 commit into
masterfrom
fix/known-issue-precisions

Conversation

@adibarra

@adibarra adibarra commented Jun 11, 2026

Copy link
Copy Markdown
Contributor

Note

Low Risk
Small data-only change to warning metadata; wrong precisions would mislabel charts but does not affect inference execution or auth.

Overview
Corrects which DeepSeek R1 benchmark precisions trigger known-issue warnings on inference charts and in CSV exports.

GB300 Dynamo TRT MTP (NVIDIA/srt-slurm#51) is now limited to fp4 instead of fp8. MI355X MoRI SGLang MTP (sgl-project/sglang#27194) is now limited to fp8 instead of matching every precision. known-issues.test.ts is updated so each entry has positive and negative precision cases aligned with those rules.

Reviewed by Cursor Bugbot for commit d786dbd. Bugbot is set up for automated code reviews on this repo. Configure here.

@adibarra adibarra marked this pull request as ready for review June 11, 2026 20:55
@vercel

vercel Bot commented Jun 11, 2026

Copy link
Copy Markdown

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
inferencemax-app Building Building Preview, Comment Jun 11, 2026 8:55pm

Request Review

@adibarra adibarra merged commit 85e1e41 into master Jun 11, 2026
15 of 16 checks passed
@adibarra adibarra deleted the fix/known-issue-precisions branch June 11, 2026 20:55
functionstackx added a commit that referenced this pull request Jun 11, 2026
…P4) (#442)

#441 applied the precision swap to the wrong entries. The GB300 Dynamo
TRT MTP accuracy issue (NVIDIA/srt-slurm#51) affects FP8, and the MI355X
MoRI SGLang MTP issue (sgl-project/sglang#27194) affects FP4.

Co-authored-by: Claude Fable 5 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant