Fix TEGroupedLinear quantization for expert parallelism (EP > 1) #1727
| Job | Run time |
|---|---|
| 12s | |
| 0s | |
| 0s | |
| 0s | |
| 0s | |
| 10m 43s | |
| 10m 46s | |
| 13m 42s | |
| 22m 48s | |
| 11m 49s | |
| 30m 15s | |
| 38m 3s | |
| 28m 16s | |
| 3s | |
| 2h 46m 37s |
| Job | Run time |
|---|---|
| 12s | |
| 0s | |
| 0s | |
| 0s | |
| 0s | |
| 10m 43s | |
| 10m 46s | |
| 13m 42s | |
| 22m 48s | |
| 11m 49s | |
| 30m 15s | |
| 38m 3s | |
| 28m 16s | |
| 3s | |
| 2h 46m 37s |