Commit 23f04b8
Enable Rust frontend (VLLM_USE_RUST_FRONTEND=1) (#1634)
* Enable Rust frontend (VLLM_USE_RUST_FRONTEND=1)
With Rust frontend, we don't change kernel, attention, MoE GEMM, or
KV cache. So it won't change the Througput and TPOT. But it benefits
TTFT as it helps to decrease the frontend CUP time cost from the moment
of requesting to generate the first token.
* Update per-changelog
---------
Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com>1 parent 21aa356 commit 23f04b8
2 files changed
Lines changed: 9 additions & 0 deletions
Lines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| 28 | + | |
28 | 29 | | |
29 | 30 | | |
30 | 31 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3523 | 3523 | | |
3524 | 3524 | | |
3525 | 3525 | | |
| 3526 | + | |
| 3527 | + | |
| 3528 | + | |
| 3529 | + | |
| 3530 | + | |
| 3531 | + | |
| 3532 | + | |
| 3533 | + | |
0 commit comments