Add bounded generation batch scheduling by fcogidi · Pull Request #2 · VectorInstitute/infermesh

fcogidi · 2026-04-17T18:38:15Z

Summary

Add a bounded in-flight scheduler for generate_batch and agenerate_batch when max_parallel_requests is set.
Keep the uncapped path unchanged for backward-compatible broad fan-out behavior.
Preserve ordered batch results, callback semantics, cancellation behavior, and queue-wait timing.
Refactor generation logic into a private helper module so LMClient keeps the public API surface explicit.
Update docs to explain that explicit max_parallel_requests is required for large or memory-sensitive Python batch runs.

Testing

uv run pytest tests/test_client_batch.py tests/test_client_limits.py tests/test_cli_bench.py -q
uv run pre-commit run -a

…Client

….2.0

Copilot

Pull request overview

Adds bounded in-flight scheduling for generation batches when LMClient(max_parallel_requests=...) is set, to avoid creating one task per batch item up front while preserving ordering, callbacks, and cancellation semantics.

Changes:

Refactors async generation logic into src/infermesh/_generation.py and routes LMClient.agenerate* through it.
Implements bounded-window scheduling for agenerate_batch (and therefore generate_batch, via the sync runner) and plumbs queue-admission timing into request metrics.
Updates docs and adds tests covering bounded ordering, strict-failure cancellation, and queue-wait including scheduler delay.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
`src/infermesh/client.py`	Routes `agenerate`/`agenerate_batch` through new private helpers; updates docstrings to document bounded batching behavior.
`src/infermesh/_generation.py`	New helper module implementing bounded-window scheduling and shared generation request logic.
`src/infermesh/_client_runtime.py`	Adds optional `queue_started_at` to include scheduler delay in queue-wait metrics.
`tests/test_client_batch.py`	Adds bounded-window tests for ordering, concurrency cap, and strict failure behavior.
`tests/test_client_limits.py`	Adds test asserting queue-wait includes bounded-scheduler delay.
`docs/guide.md`	Documents using `max_parallel_requests` for large batches; updates examples.
`README.md`	Documents bounded batch behavior; updates examples to set `max_parallel_requests`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

fcogidi added 4 commits April 17, 2026 13:51

feat: extract generation helpers; add bounded batch execution

0011f1d

docs: update README and guide to include max_parallel_requests for LM…

0c64e60

…Client

chore: update infermesh package details and dependencies to version 0…

8ce81a2

….2.0

Merge branch 'main' into feat/bounded_runner

5d6e686

fcogidi requested a review from Copilot April 17, 2026 18:41

Copilot started reviewing on behalf of fcogidi April 17, 2026 18:41 View session

fcogidi marked this pull request as ready for review April 17, 2026 18:42

Copilot AI reviewed Apr 17, 2026

View reviewed changes

Comment thread src/infermesh/_generation.py

fix: validate max_parallel_requests at client construction

11d7e4b

fcogidi merged commit 2354908 into main Apr 17, 2026
8 checks passed

fcogidi deleted the feat/bounded_runner branch April 17, 2026 18:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add bounded generation batch scheduling#2

Add bounded generation batch scheduling#2
fcogidi merged 5 commits intomainfrom
feat/bounded_runner

fcogidi commented Apr 17, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

fcogidi commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fcogidi commented Apr 17, 2026 •

edited

Loading