Skip to content

Commit 8bcfe75

Browse files
committed
Update
Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>
1 parent c5e788d commit 8bcfe75

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

.claude/skills/ptq/references/unsupported-models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -71,7 +71,7 @@ Custom patches are required when:
7171
- **Fused/batched expert weights** — experts stored as a single parameter (e.g., 3D `[num_experts, in, out]`) rather than separate `nn.Linear` modules → Pattern 1 + 3
7272
- **Self-defined weight parameters** (`nn.Parameter` used directly instead of `nn.Linear`) — common in non-HF or research models → Pattern 1 + 3
7373
- **VLM structure** (vision encoder that should be excluded) → Pattern 4
74-
- **FP8 checkpoint** that needs dequantization before re-quantizing → Pattern 5
74+
- **FP8 checkpoint with non-standard parameter names** (standard `FP8Linear` is handled automatically by the `_QuantFP8Linear` plugin) → Pattern 5
7575
7676
## Step D — Check weight names against ModelOpt's config patterns
7777

0 commit comments

Comments
 (0)