Add support for batched tasks. by bosilca · Pull Request #668 · ICLDisco/parsec

bosilca · 2024-09-10T03:51:56Z

The idea is the following:

tasks incarnations (aka. BODY) can be marked with the "batch" property allowing the runtime to provide the task with the entire list of ready tasks of the execution stream instead of just extracting the head.
this list of ready tasks is in fact a ring, that can then be trimmed by the kernel and divided into the tasks to be batch and the rest. While the batch group will be submitted for execution (user responsibility), the rest of the tasks will be added back into the stream pending list, in the order in which they were provided in the ring. This mechanism also allow the user to reorder the tasks based on some user-level criteria.
the kernel also needs to provide a callback into the gpu_task complete_stage, such that the runtime can call the specialized function able to complete all batched tasks.

abouteiller · 2025-02-26T19:33:00Z

    if( NULL != type_property) {

-        if (!strcasecmp(type_property->expr->jdf_var, "cuda")
+        if (!strncasecmp(type_property->expr->jdf_var, "cuda", 4)  /* for batched */


Looks like a leftover from a prior iteration of that patchset that used the type=cuda_batched instead of adding a new batched property.

I assume the expectation is that we can have batched and non batched CUDA bodies simultaneously. Did you test this works?

The idea is the following: - tasks incarnations (aka. BODY) can be marked with the "batch" property allowing the runtime to provide the task with the entire list of ready tasks of the execution stream instead of just extracting the head. - this list of ready tasks is in fact a ring, that can then be trimmed by the kernel and divided into batch and the rest. The rest of the tasks will be left in the ring, while the batch group will be submitted for execution. - the kernel also needs to provide a callback into the gpu_task complete_stage, such that the runtime can call the specialized function able to complete all batched tasks. Signed-off-by: George Bosilca <gbosilca@nvidia.com>

Replace the CUDA-specific batch build switch with PARSEC_HAVE_DEV_CAPABILITY_BATCH so batching is a runtime capability shared by all supported device types. Export the new option through parsec_options and PaRSECConfig. Add per-device MCA parameters to disable batching for CPU, recursive, CUDA, HIP, and Level Zero devices. Use shared helpers to sanitize batch chore types in DTD and to gate GPU task-ring batching on the selected device. Teach PTG to accept batch=true for CPU/default bodies as well as typed device bodies, and add CPU batch examples for both PTG and DTD with ctest coverage for the enabled and CPU-disabled DTD paths. Signed-off-by: George Bosilca <gbosilca@nvidia.com>

bosilca requested a review from a team as a code owner September 10, 2024 03:51

bosilca force-pushed the topic/batched_tasks branch from 2c004b1 to fffc3ec Compare September 10, 2024 04:02

bosilca mentioned this pull request Sep 10, 2024

Add support for batched tasks and for CUDA-aware communications bosilca/parsec#4

Open

bosilca force-pushed the topic/batched_tasks branch from fffc3ec to 9998554 Compare September 10, 2024 05:33

abouteiller self-requested a review October 11, 2024 15:38

abouteiller reviewed Feb 26, 2025

View reviewed changes

bosilca added 3 commits May 9, 2026 02:24

bosilca force-pushed the topic/batched_tasks branch from 88bbbd3 to 41fa201 Compare May 9, 2026 06:32

bosilca requested a review from abouteiller May 9, 2026 06:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for batched tasks.#668

Add support for batched tasks.#668
bosilca wants to merge 3 commits intoICLDisco:masterfrom
bosilca:topic/batched_tasks

bosilca commented Sep 10, 2024

Uh oh!

Uh oh!

abouteiller Feb 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bosilca commented Sep 10, 2024

Uh oh!

Uh oh!

abouteiller Feb 26, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants