Add GPU-side Gumbel-max sampling for CUDA graph compatibility #31556
| Job | Run time |
|---|---|
| 19m 28s | |
| 23s | |
| 14m 10s | |
| 10m 25s | |
| 7m 11s | |
| 7m 46s | |
| 7m 27s | |
| 7m 54s | |
| 5m 59s | |
| 6m 38s | |
| 8m 39s | |
| 9m 27s | |
| 7m 58s | |
| 8m 10s | |
| 7m 16s | |
| 2h 8m 51s |
| Job | Run time |
|---|---|
| 19m 28s | |
| 23s | |
| 14m 10s | |
| 10m 25s | |
| 7m 11s | |
| 7m 46s | |
| 7m 27s | |
| 7m 54s | |
| 5m 59s | |
| 6m 38s | |
| 8m 39s | |
| 9m 27s | |
| 7m 58s | |
| 8m 10s | |
| 7m 16s | |
| 2h 8m 51s |