Add routing microbenchmark for choose_replica + dispatch pattern by jeffreywang-anyscale · Pull Request #63293 · ray-project/ray

jeffreywang-anyscale · 2026-05-12T03:10:44Z

Description

Following up on #63255 (comment), we'd like to show the delta between choose_replica + dispatch and remote in our DB dashboards.

Release test results -- latencies in ms (https://buildkite.com/ray-project/release/builds/92517/canvas?sid=019e1a3c-642f-4a42-867b-2818577272b6&tab=output)

Percentile	`remote`	`choose_replica + dispatch`	Delta
p50	1.040	1.784	+0.74
p90	1.149	1.922	+0.77
p95	1.199	1.970	+0.77
p99	1.437	2.137	+0.70

Related issues

Link related issues: "Fixes #1234", "Closes #1234", or "Related to #1234".

Additional information

Optional: Add implementation details, API changes, usage examples, screenshots, etc.

jeffreywang-anyscale · 2026-05-12T03:16:12Z

Kicking off release tests to populate data to verify databricks query.

gemini-code-assist

Code Review

This pull request introduces a new benchmarking mode, 'choose_dispatch', to evaluate the latency of the choose_replica and dispatch pattern in Ray Serve. The changes include adding a new benchmarking method to the Benchmarker class, updating the handle_noop_latency script with a new CLI option, and expanding the microbenchmark workloads to include this new mode. A review comment suggests refactoring the run_latency_benchmark method to avoid duplicating the definition of the internal benchmark function, which would improve code maintainability.

gemini-code-assist · 2026-05-12T03:16:14Z

+        if mode == "remote":
+
+            async def f():
+                await self.do_single_request(payload)
+
+        elif mode == "choose_dispatch":
+
+            async def f():
+                await self.do_single_choose_dispatch(payload)
+
+        else:
+            raise ValueError(f"Unknown mode {mode!r}")


The current implementation defines the local function f twice within different branches of the if statement. This logic can be refactored to be more concise and maintainable by assigning the target method to a variable first, and then defining f once.

if mode == "remote": func = self.do_single_request elif mode == "choose_dispatch": func = self.do_single_choose_dispatch else: raise ValueError(f"Unknown mode {mode!r}") async def f(): await func(payload)

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale requested a review from a team as a code owner May 12, 2026 03:10

jeffreywang-anyscale mentioned this pull request May 12, 2026

[serve][3/3] Expose choose_replica/dispatch on deployment handles #63255

Merged

jeffreywang-anyscale added the go add ONLY when ready to merge, run all tests label May 12, 2026

Base automatically changed from decouple-routing-primitives-3 to master May 12, 2026 03:16

gemini-code-assist Bot reviewed May 12, 2026

View reviewed changes

ray-gardener Bot added the serve Ray Serve Related Issue label May 12, 2026

Add routing microbenchmark for choose_replica + dispatch pattern

7da35fb

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale force-pushed the decouple-routing-primitives-benchmark branch from 4ae7060 to 7da35fb Compare May 22, 2026 06:30

iamjustinhsu approved these changes May 22, 2026

View reviewed changes

abrarsheikh approved these changes Jun 1, 2026

View reviewed changes

abrarsheikh merged commit 02ba033 into master Jun 1, 2026
6 checks passed

abrarsheikh deleted the decouple-routing-primitives-benchmark branch June 1, 2026 18:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add routing microbenchmark for choose_replica + dispatch pattern#63293

Add routing microbenchmark for choose_replica + dispatch pattern#63293
abrarsheikh merged 1 commit into
masterfrom
decouple-routing-primitives-benchmark

jeffreywang-anyscale commented May 12, 2026 •

edited

Loading

Uh oh!

jeffreywang-anyscale commented May 12, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jeffreywang-anyscale commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related issues

Additional information

Uh oh!

jeffreywang-anyscale commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jeffreywang-anyscale commented May 12, 2026 •

edited

Loading

jeffreywang-anyscale commented May 12, 2026 •

edited

Loading