Add Triton INT4 dense kernels with dequant prefill path for Qwen3.5 MoE #617
| Job | Run time |
|---|---|
| 36m 43s | |
| 32m 4s | |
| 38m 13s | |
| 8m 59s | |
| 9m 8s | |
| 12m 18s | |
| 33m 16s | |
| 12m 2s | |
| 10m 42s | |
| 10m 54s | |
| 10m 40s | |
| 10m 53s | |
| 10m 31s | |
| 10m 16s | |
| 10m 14s | |
| 10m 16s | |
| 11m 9s | |
| 9m 53s | |
| 10m 12s | |
| 10m 32s | |
| 10m 23s | |
| 5h 19m 18s |