SortingKernels: use int64_t type for num_tile by BartoszKokoszko · Pull Request #3555 · intel/torch-xpu-ops

BartoszKokoszko · 2026-05-05T09:35:46Z

In order to avoid num_tile overflow it should be
declared as int64_t type.

In order to avoid num_tile overflow it should be declared as int64_t type.

Copilot

Pull request overview

Updates the XPU SYCL sorting kernel to compute num_tiles in 64-bit to avoid overflow during the ceil-division calculation for large num_elements.

Changes:

Switches num_tiles in segmented_radix_sort_pairs_kernel from int to int64_t.
Performs the num_elements + TILE_PROCESSING_LENGTH - 1 arithmetic in 64-bit via static_cast<int64_t>(num_elements).

Skill files read:

.github/skills/xpu-ops-pr-review/SKILL.md
.github/skills/xpu-ops-pr-review/references/torch-xpu-ops-review-notes.md
.github/skills/xpu-ops-pr-review/references/review-checklist.md
.github/skills/xpu-ops-pr-review/references/bc-guidelines.md

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

…fmemory-error

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

Comments suppressed due to low confidence (2)

src/ATen/native/xpu/sycl/SortingKernels.h:194

sycl_kernel_submit(num_segments * num_tiles * GROUP_SIZE, ...) multiplies int values first, so the product can overflow in 32-bit signed arithmetic before being passed to the int64_t overload. Please promote to int64_t (and ideally check for overflow) when computing the global range.

This issue also appears on line 328 of the same file.

  auto caller = SegmentedRadixSortPairsUpsweepFunctor<method_t, key_t, value_t>(
      keys_in, counts, num_elements, begin_bit, end_bit);
  sycl_kernel_submit(
      num_segments * num_tiles * GROUP_SIZE,
      GROUP_SIZE,
      at::xpu::getCurrentSYCLQueue(),
      caller);

src/ATen/native/xpu/sycl/SortingKernels.h:342

sycl_kernel_submit(num_segments * num_tiles * GROUP_SIZE, ...) is still computed in int arithmetic here, which can overflow before conversion to the int64_t parameter type. Compute the global range using int64_t (and consider an overflow check) to avoid launching an incorrect ND-range.

  auto caller =
      SegmentedRadixSortPairsDownsweepFunctor<method_t, key_t, value_t>(
          keys_in,
          keys_out,
          values_in,
          values_out,
          num_elements,
          begin_bit,
          end_bit,
          count);
  sycl_kernel_submit(
      num_segments * num_tiles * GROUP_SIZE,
      GROUP_SIZE,
      at::xpu::getCurrentSYCLQueue(),
      caller);

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

BBBela

Looks good to me.
Thank you! 😉

pawel-olejniczak

LGTM

CuiYifeng

Do we have any test cases for such overflow? The remaining part of this PR LGTM.

BartoszKokoszko · 2026-05-29T15:28:56Z

Do we have any test cases for such overflow? The remaining part of this PR LGTM.

@CuiYifeng

Directly no. It was taken from #3426

CuiYifeng · 2026-06-01T08:23:58Z

Do we have any test cases for such overflow? The remaining part of this PR LGTM.

@CuiYifeng

Directly no. It was taken from #3426

Please add a test case in torch-xpu-ops referring to test_topk_large_k in test/test_sort_and_select.py.

SortingKernels: use int64_t type for num_tile

02f3081

In order to avoid num_tile overflow it should be declared as int64_t type.

Copilot AI review requested due to automatic review settings May 5, 2026 09:35

Copilot started reviewing on behalf of BartoszKokoszko May 5, 2026 09:36 View session

Copilot AI reviewed May 5, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

BartoszKokoszko added 2 commits May 5, 2026 10:54

fix review

65ec758

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

1b968fe

Copilot AI review requested due to automatic review settings May 5, 2026 10:56

Copilot AI reviewed May 5, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

BartoszKokoszko added 2 commits May 5, 2026 11:49

address review comments

e87a74c

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

52c81c7

Copilot AI review requested due to automatic review settings May 6, 2026 08:19

Copilot started reviewing on behalf of BartoszKokoszko May 6, 2026 08:22 View session

Copilot AI reviewed May 6, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

BartoszKokoszko added 2 commits May 6, 2026 09:38

cast to size_t before allocate call

0f36a2e

Merge remote-tracking branch 'origin/main' into dev/bkokoszx/fix-outo…

488b40b

…fmemory-error

Copilot AI review requested due to automatic review settings May 6, 2026 09:39

Copilot started reviewing on behalf of BartoszKokoszko May 6, 2026 09:46 View session

Copilot AI reviewed May 6, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

BartoszKokoszko added 2 commits May 7, 2026 09:17

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

6839470

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

2dd4195

Copilot AI review requested due to automatic review settings May 11, 2026 07:51

Copilot started reviewing on behalf of BartoszKokoszko May 11, 2026 07:51 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h

BartoszKokoszko added 2 commits May 12, 2026 10:22

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

197cff6

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

9103624

Copilot AI review requested due to automatic review settings May 13, 2026 07:38

Copilot started reviewing on behalf of BartoszKokoszko May 13, 2026 07:39 View session

Copilot AI reviewed May 13, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h

BartoszKokoszko added 2 commits May 13, 2026 11:12

fix num_tiles logic also in upsweep and downsweep kernels

01f3d44

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

2580046

Copilot AI review requested due to automatic review settings May 13, 2026 13:02

Copilot started reviewing on behalf of BartoszKokoszko May 13, 2026 13:02 View session

Copilot AI reviewed May 13, 2026

View reviewed changes

BartoszKokoszko added 2 commits May 14, 2026 08:47

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

791452a

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

a552dde

Copilot AI review requested due to automatic review settings May 19, 2026 12:31

Copilot AI reviewed May 19, 2026

View reviewed changes

BBBela reviewed May 20, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

BartoszKokoszko added 4 commits May 20, 2026 13:53

review: use ceil_dev

b1bca17

review: pass num_tiles to functors without recalculating it

8a29dca

review: checked_num_tiles overflow

087352a

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

8e125b3

BBBela reviewed May 21, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

address review comments

30a13cf

BBBela approved these changes May 21, 2026

View reviewed changes

BartoszKokoszko added 2 commits May 22, 2026 16:59

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

79bd3d5

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

95c9f4d

pponikox approved these changes May 25, 2026

View reviewed changes

pawel-olejniczak reviewed May 25, 2026

View reviewed changes

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

8b7642e

BartoszKokoszko requested a review from CuiYifeng May 26, 2026 15:14

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

5a985da

Silv3S approved these changes May 27, 2026

View reviewed changes

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

d885ccf

CuiYifeng reviewed May 28, 2026

View reviewed changes

Comment thread src/ATen/native/xpu/sycl/SortingKernels.h Outdated

BartoszKokoszko added 2 commits May 29, 2026 15:26

review: remove checked_num_tiles()

6ecce37

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

1e2ba67

Merge branch 'main' into dev/bkokoszx/fix-outofmemory-error

a47b2e5

Conversation

BartoszKokoszko commented May 5, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BBBela left a comment

Choose a reason for hiding this comment

Uh oh!

pawel-olejniczak left a comment

Choose a reason for hiding this comment

Uh oh!

CuiYifeng left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BartoszKokoszko commented May 29, 2026

Uh oh!

CuiYifeng commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

CuiYifeng left a comment •

edited

Loading