Skip to content

Remove redundant workers in Qwen3 Q8_0 FFN layers.

664f160
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup #78

Remove redundant workers in Qwen3 Q8_0 FFN layers.
664f160
Select commit
Loading
Failed to load commit list.