feat: add configurable scheduling policy by Alise-svg · Pull Request #119 · sgl-project/mini-sglang

Alise-svg · 2026-04-20T02:57:33Z

Add --schedule-policy command line argument to allow users to choose
between different scheduling strategies for batch formation.

Supported policies:

prefill_first (default): Prioritizes prefill requests, reducing
Time To First Token (TTFT) for online serving scenarios where
users wait for the first token.
decode_first: Prioritizes decode requests, improving throughput
for offline batch inference where maximizing token generation
rate is more important than latency.

Usage:
python -m minisgl --model "Qwen/Qwen3-0.6B" --schedule-policy decode_first

DarkSharpness · 2026-05-10T10:22:44Z

-            self.prefill_manager.schedule_next_batch(self.prefill_budget)
-            or self.decode_manager.schedule_next_batch()
-        )
+        if self.schedule_policy == "decode_first":


A decode first policy should be:

Form a decode batch first.

Try to schedule a prefill batch with the remaining token budget (prefill_budget - decode_tokens).
This is actually mix prefill-decode style batching.

- Add --schedule-policy command line argument - Support 'prefill_first' (default) and 'decode_first' policies - prefill_first reduces TTFT for online serving - decode_first improves throughput for offline inference

DarkSharpness added the enhancement New feature or request label May 10, 2026

DarkSharpness requested changes May 10, 2026

View reviewed changes

Alise-svg added 2 commits May 12, 2026 22:45

feat: add configurable scheduling policy

ee7d6ae

- Add --schedule-policy command line argument - Support 'prefill_first' (default) and 'decode_first' policies - prefill_first reduces TTFT for online serving - decode_first improves throughput for offline inference

feat: add schedule_policy attribute to Scheduler

3d221e8

Alise-svg force-pushed the feature/configurable-scheduling-policy branch from 20dbc44 to 3d221e8 Compare May 12, 2026 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add configurable scheduling policy#119

feat: add configurable scheduling policy#119
Alise-svg wants to merge 2 commits into
sgl-project:mainfrom
Alise-svg:feature/configurable-scheduling-policy

Alise-svg commented Apr 20, 2026

Uh oh!

DarkSharpness May 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Alise-svg commented Apr 20, 2026

Uh oh!

DarkSharpness May 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants