[core] feat: support async scheduling with structured outputs by AlpinDale · Pull Request #1582 · dphnAI/aphrodite-engine

AlpinDale · 2025-11-04T06:46:39Z

No description provided.

Signed-off-by: AlpinDale <alpindale@gmail.com>

gemini-code-assist

Code Review

This pull request introduces a significant refactoring to support asynchronous scheduling with structured outputs. The core change involves splitting the model execution into a two-step process: execute_model for the forward pass and sample_tokens for sampling. This is a clean approach that allows for grammar bitmask computation between the two steps. The changes are consistently applied across the scheduler, engine core, executors, and model runners. I've found one critical issue that could lead to a crash in distributed setups.

gemini-code-assist · 2025-11-04T06:48:42Z

+            assert model_runner_output is not None
            kv_output = model_runner_output.kv_connector_output
            if not kv_output:
                continue


The assertion assert model_runner_output is not None can fail. The outputs list can contain None values from workers that did not produce an output, which is a valid scenario with the new changes. This assertion will cause a crash in distributed setups. You should handle None values gracefully by skipping them.

Suggested change

assert model_runner_output is not None

kv_output = model_runner_output.kv_connector_output

if not kv_output:

continue

if model_runner_output is None:

continue

kv_output = model_runner_output.kv_connector_output

if not kv_output:

continue

…Runner Signed-off-by: AlpinDale <alpindale@gmail.com>

[core] feat: support async scheduling with structured outputs

18118d6

Signed-off-by: AlpinDale <alpindale@gmail.com>

gemini-code-assist Bot reviewed Nov 4, 2025

View reviewed changes

[cleanup] remove unused variable num_scheduled_tokens from GPUModel…

db31fed

…Runner Signed-off-by: AlpinDale <alpindale@gmail.com>

AlpinDale merged commit a3d7b9d into main Nov 4, 2025
1 check passed

AlpinDale deleted the async-sched-structured branch November 4, 2025 07:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[core] feat: support async scheduling with structured outputs#1582

[core] feat: support async scheduling with structured outputs#1582
AlpinDale merged 2 commits into
mainfrom
async-sched-structured

AlpinDale commented Nov 4, 2025

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Uh oh!

Conversation

AlpinDale commented Nov 4, 2025

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant