[data][llm] Promote `max_tasks_in_flight_per_actor` to a first-class config field and adjust defaults by jeffreywang-anyscale · Pull Request #63214 · ray-project/ray

jeffreywang-anyscale · 2026-05-08T00:46:18Z

Why

Ray Data LLM hardcoded DEFAULT_MAX_TASKS_IN_FLIGHT = 16 instead of using Ray Data's actor-pool fallback, which (a) didn't track max_concurrent_batches when users tuned it and (b) bypassed both DataContext.max_tasks_in_flight_per_actor and the env-var override of the factor.

What changes?

New top-level field OfflineProcessorConfig.max_tasks_in_flight_per_actor: Optional[int] = None.
Removed the DEFAULT_MAX_TASKS_IN_FLIGHT = 16 constant; engine processors pass config.max_tasks_in_flight_per_actor straight through to ActorPoolStrategy (including None).
Default in-flight cap: hardcoded 16 → max_concurrent_batches × FACTOR, resolved by Ray Data's actor pool.
DataContext.max_tasks_in_flight_per_actor and RAY_DATA_ACTOR_DEFAULT_MAX_TASKS_IN_FLIGHT_TO_MAX_CONCURRENCY_FACTOR are now honored (previously bypassed by the explicit 16).
experimental["max_tasks_in_flight_per_actor"] is deprecated: migrated to the new field at construction with a logger.warning. Top-level field wins if both are set.

Original API

OfflineProcessorConfig(
    ...,
    experimental={"max_tasks_in_flight_per_actor": 32},  # only knob
)

New API

OfflineProcessorConfig(
    ...,
    max_concurrent_batches=8,           # unchanged
    max_tasks_in_flight_per_actor=32,   # new top-level field, Optional[int]
)

Behavior changes

Users who set RAY_DATA_ACTOR_DEFAULT_MAX_TASKS_IN_FLIGHT_TO_MAX_CONCURRENCY_FACTOR now have their override honored by Ray Data LLM (previously ignored).
Setting via experimental still works but logs a deprecation warning. The top-level field overrides experimental if both are set.

`max_concurrent_batches`	`max_tasks_in_flight_per_actor`	Ray actor `max_concurrency`	Effective in-flight cap
unset (default 8)	unset (None)	8	16
16	unset (None)	16	32
unset (default 8)	50	8	50
16	50	16	50

Related issues

Link related issues: "Fixes #1234", "Closes #1234", or "Related to #1234".

Additional information

Optional: Add implementation details, API changes, usage examples, screenshots, etc.

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

gemini-code-assist

Code Review

This pull request promotes max_tasks_in_flight_per_actor from an experimental configuration to a top-level field in OfflineProcessorConfig and its subclasses. It introduces deprecation warnings for the experimental key and implements a resolution strategy that defaults to a calculated value based on max_concurrent_batches. Feedback identifies a potential type mismatch where a float could be assigned to an integer field when bypassing Pydantic validation, suggesting an explicit integer cast to ensure compatibility with Ray Data's actor pool.

gemini-code-assist · 2026-05-08T00:59:31Z

+                * DEFAULT_ACTOR_MAX_TASKS_IN_FLIGHT_TO_MAX_CONCURRENCY_FACTOR,
+            )
+            # Bypass `validate_assignment=True` so we don't re-fire the deprecation warning
+            object.__setattr__(self, "max_tasks_in_flight_per_actor", resolved)


The resolved value for max_tasks_in_flight_per_actor can be a float, particularly when calculated using DEFAULT_ACTOR_MAX_TASKS_IN_FLIGHT_TO_MAX_CONCURRENCY_FACTOR, which is defined as a float. The max_tasks_in_flight_per_actor field is typed as Optional[int], but using object.__setattr__ bypasses Pydantic's type coercion. This could result in a float value being passed to ray.data.ActorPoolStrategy, which expects an integer and may lead to unexpected behavior or a runtime error.

To ensure an integer is always assigned, the resolved value should be explicitly cast to int. This would also align with the original logic in Ray Data's actor pool, which performs this integer conversion.

Suggested change

object.__setattr__(self, "max_tasks_in_flight_per_actor", resolved)

object.__setattr__(self, "max_tasks_in_flight_per_actor", int(resolved))

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

Aydin-ab

making a bit more explicit in the doc that it's 2 * max_concurrent_batches

kouroshHakha

The approach is sound — using a Pydantic mode="after" validator to eagerly resolve the None sentinel is clean, and the resolution order (explicit > experimental > formula) is implemented correctly. The behavioral no-op for default users (8×2=16) is a good property.

Note

This review was co-written with AI assistance (Claude Code).

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

…config field and adjust defaults (ray-project#63214) Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

…config field and adjust defaults (ray-project#63214) Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com> Signed-off-by: anindyam1969 <amukherjee@kinetica.com>

…config field and adjust defaults (ray-project#63214) Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

…config field and adjust defaults (ray-project#63214) Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com> Signed-off-by: Alexandr Plashchinsky <alexandr.plashchinsky@alexandrplashchinsky-H765G66H9V.local>

[data][llm] Promote to a first-class config field and adjust defaults

806378f

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale requested a review from a team as a code owner May 8, 2026 00:46

jeffreywang-anyscale added the go add ONLY when ready to merge, run all tests label May 8, 2026

jeffreywang-anyscale changed the title ~~[data][llm] Promote to a first-class config field and adjust defaults~~ [data][llm] Promote max_tasks_in_flight_per_actor to a first-class config field and adjust defaults May 8, 2026

gemini-code-assist Bot reviewed May 8, 2026

View reviewed changes

Claude feedback

606cc2d

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

ray-gardener Bot added the data Ray Data-related issues label May 8, 2026

Aydin-ab approved these changes May 8, 2026

View reviewed changes

Comment thread python/ray/data/llm.py Outdated

Comment thread python/ray/data/llm.py Outdated

Comment thread python/ray/llm/_internal/batch/processor/base.py Outdated

kouroshHakha reviewed May 8, 2026

View reviewed changes

Comment thread python/ray/llm/_internal/batch/processor/base.py

CR feedback

1e909c0

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

kouroshHakha enabled auto-merge (squash) May 8, 2026 21:09

kouroshHakha approved these changes May 8, 2026

View reviewed changes

kouroshHakha merged commit 75f55e3 into master May 8, 2026
7 checks passed

kouroshHakha deleted the data-llm-max-tasks-in-flight branch May 8, 2026 21:42

chillCode404 pushed a commit to chillCode404/ray-contrib that referenced this pull request May 9, 2026

[data][llm] Promote max_tasks_in_flight_per_actor to a first-class …

b570d38

…config field and adjust defaults (ray-project#63214) Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

dancingactor pushed a commit to dancingactor/ray that referenced this pull request May 13, 2026

[data][llm] Promote max_tasks_in_flight_per_actor to a first-class …

2bdd0be

…config field and adjust defaults (ray-project#63214) Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

Lucas61000 pushed a commit to Lucas61000/ray that referenced this pull request May 15, 2026

[data][llm] Promote max_tasks_in_flight_per_actor to a first-class …

afee413

…config field and adjust defaults (ray-project#63214) Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[data][llm] Promote `max_tasks_in_flight_per_actor` to a first-class config field and adjust defaults#63214

[data][llm] Promote `max_tasks_in_flight_per_actor` to a first-class config field and adjust defaults#63214
kouroshHakha merged 3 commits into
masterfrom
data-llm-max-tasks-in-flight

jeffreywang-anyscale commented May 8, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 8, 2026

Uh oh!

Aydin-ab left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kouroshHakha left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	object.__setattr__(self, "max_tasks_in_flight_per_actor", resolved)
	object.__setattr__(self, "max_tasks_in_flight_per_actor", int(resolved))

Conversation

jeffreywang-anyscale commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What changes?

Original API

New API

Behavior changes

Related issues

Additional information

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Aydin-ab left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kouroshHakha left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jeffreywang-anyscale commented May 8, 2026 •

edited

Loading