Skip to content

[PyT] Reduce test sizes in fused attn fp8 vs fp16 to avoid OOM #3020

Merged
cyanguwa merged 3 commits into
NVIDIA:mainfrom
vedaanta:vedaanta/te-fp8-vs-f16-shrink-b1
Jun 4, 2026
Merged

[PyT] Reduce test sizes in fused attn fp8 vs fp16 to avoid OOM #3020
cyanguwa merged 3 commits into
NVIDIA:mainfrom
vedaanta:vedaanta/te-fp8-vs-f16-shrink-b1

tests/attention: black format fp8_13 ModelConfig

56b1837
Select commit
Loading
Failed to load commit list.