Revert PR #580 streaming workaround (CCCL #1422 resolved) by PointKernel · Pull Request #810 · NVIDIA/cuCollections

PointKernel · 2026-05-01T16:57:01Z

This PR reverts the #580 streaming workaround as large size type is now supported by CUB.

sleeepyjack · 2026-05-04T23:53:23Z

+    CUCO_CUDA_TRY(cub::DeviceSelect::If(nullptr,
+                                        temp_storage_bytes,
+                                        begin,
+                                        output_begin,
+                                        d_num_out,
+                                        this->capacity(),
+                                        is_filled,
+                                        stream.get()));
+
+    auto d_temp_storage = temp_allocator.allocate(temp_storage_bytes, stream);
+
+    CUCO_CUDA_TRY(cub::DeviceSelect::If(d_temp_storage,
+                                        temp_storage_bytes,
+                                        begin,
+                                        output_begin,
+                                        d_num_out,
+                                        this->capacity(),
+                                        is_filled,
+                                        stream.get()));


nit: Instead of the classical two-phase approach, we could use CUB's new single-phase API that takes an allocator: https://nvidia.github.io/cccl/unstable/cub/api_docs/device_wide.html#environment-api-single-phase

PointKernel requested a review from sleeepyjack as a code owner May 1, 2026 16:57

Revert PR NVIDIA#580 streaming workaround (CCCL #1422 resolved)

7e00710

PointKernel force-pushed the revert-pr-580-streaming branch from a4b8172 to 7e00710 Compare May 1, 2026 17:00

NVIDIA deleted a comment from copy-pr-bot Bot May 1, 2026

PointKernel added the type: improvement Improvement / enhancement to an existing function label May 1, 2026

sleeepyjack approved these changes May 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert PR #580 streaming workaround (CCCL #1422 resolved)#810

Revert PR #580 streaming workaround (CCCL #1422 resolved)#810
PointKernel wants to merge 1 commit intoNVIDIA:devfrom
PointKernel:revert-pr-580-streaming

PointKernel commented May 1, 2026

Uh oh!

sleeepyjack May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

PointKernel commented May 1, 2026

Uh oh!

sleeepyjack May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants