Skip to content

Commit 2e3262c

Browse files
kyuyeunkYijia J
authored andcommitted
Add batched rpa e2e test using qwen coder (#2694)
Signed-off-by: Kyuyeun Kim <kyuyeunk@google.com>
1 parent 3ce54ef commit 2e3262c

1 file changed

Lines changed: 2 additions & 0 deletions

File tree

tests/e2e/benchmarking/bm_qwen3_coder.sh

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -140,6 +140,8 @@ start_time=$(date +%s)
140140
# If needed, replace "--async-scheduling" with "--no-async-scheduling"
141141
export USE_MOE_EP_KERNEL=${use_moe_ep_kernel}
142142
export MODEL_IMPL_TYPE=vllm
143+
# TODO: Remove this once batched rpa is the default
144+
export USE_BATCHED_RPA_KERNEL=1
143145

144146
echo "bench_serving commit: $(git -C bench_serving rev-parse HEAD)"
145147

0 commit comments

Comments
 (0)