Skip to content

fix(inference): swap known-issue precisions back (GB300 FP8, MI355X FP4)#442

Merged
functionstackx merged 1 commit into
masterfrom
fix/known-issue-precision-swap
Jun 11, 2026
Merged

fix(inference): swap known-issue precisions back (GB300 FP8, MI355X FP4)#442
functionstackx merged 1 commit into
masterfrom
fix/known-issue-precision-swap

Conversation

@functionstackx

@functionstackx functionstackx commented Jun 11, 2026

Copy link
Copy Markdown
Contributor

Summary

#441 applied the precision swap to the wrong entries — it set GB300 Dynamo TRT MTP to FP4 and MI355X MoRI SGLang MTP to FP8, which is backwards. This PR sets the correct precisions:

Testing

  • All 39 tests pass across known-issues.test.ts, knownIssueAnnotations.test.ts, and csv-export.test.ts; tests updated to assert the corrected precisions.
  • Data-only change to the KNOWN_CONFIG_ISSUES registry — no matching-logic or rendering changes, so the official and ?unofficialrun= overlay annotation paths are unaffected.

🤖 Generated with Claude Code


Note

Low Risk
Registry-only precision strings in KNOWN_CONFIG_ISSUES with no matching or UI logic changes; only which benchmark points show known-issue warnings is affected.

Overview
Corrects known-config issue precision filters that were swapped in a prior change: GB300 Dynamo TRT MTP now matches FP8 (NVIDIA/srt-slurm#51), and MI355X MoRI SGLang MTP matches FP4 (sgl-project/sglang#27194).

known-issues.test.ts is updated so positive/negative match cases and the deduplication scenario use those precisions. Matching, chart warnings, and CSV notes still use the same matchKnownConfigIssues logic—only which (hwKey, precision) pairs trigger warnings changes.

Reviewed by Cursor Bugbot for commit 7afd522. Bugbot is set up for automated code reviews on this repo. Configure here.

#441 applied the precision swap to the wrong entries. The GB300 Dynamo
TRT MTP accuracy issue (NVIDIA/srt-slurm#51) affects FP8, and the MI355X
MoRI SGLang MTP issue (sgl-project/sglang#27194) affects FP4.

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
@functionstackx functionstackx requested a review from adibarra as a code owner June 11, 2026 21:07
@vercel

vercel Bot commented Jun 11, 2026

Copy link
Copy Markdown

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
inferencemax-app Ready Ready Preview, Comment Jun 11, 2026 9:08pm

Request Review

@functionstackx functionstackx merged commit cb1e46a into master Jun 11, 2026
16 of 17 checks passed
@functionstackx functionstackx deleted the fix/known-issue-precision-swap branch June 11, 2026 21:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant