[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline #30122
| Job | Run time |
|---|---|
| 8m 25s | |
| 7m 52s | |
| 7m 55s | |
| 12m 18s | |
| 10m 36s | |
| 8m 39s | |
| 6m 40s | |
| 6m 21s | |
| 7m 43s | |
| 7m 22s | |
| 7m 16s | |
| 7m 43s | |
| 6m 58s | |
| 8m 35s | |
| 12m 20s | |
| 2h 6m 43s |
| Job | Run time |
|---|---|
| 8m 25s | |
| 7m 52s | |
| 7m 55s | |
| 12m 18s | |
| 10m 36s | |
| 8m 39s | |
| 6m 40s | |
| 6m 21s | |
| 7m 43s | |
| 7m 22s | |
| 7m 16s | |
| 7m 43s | |
| 6m 58s | |
| 8m 35s | |
| 12m 20s | |
| 2h 6m 43s |