FIX, MAINT: Implement 'everything follows X' and namespace checks for KNN by david-cortes-intel · Pull Request #3127 · uxlfoundation/scikit-learn-intelex

david-cortes-intel · 2026-04-24T14:22:24Z

Description

This PR:

Implements the logic of 'everything follows X' for KNN classes.
Implements the logic where class predictions from classifiers follow the 'y' namespace (e.g. so that it can return strings when fitting on GPU).
Adds array API namespace and device checks for KNN methods that come after .fit() in order to throw informative Python exceptions instead of segfaults, the same way scikit-learn would do.
Fixes some issues with internal functions that were not working with array_api_strict.

Includes the changes from #3117 since they are also necessary for KNN classes.

Checklist:

Completeness and readability

I have commented my code, particularly in hard-to-understand areas.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
I have extended testing suite if new functionality was introduced in this PR.

david-cortes-intel · 2026-04-24T14:22:37Z

/intelci: run

david-cortes-intel · 2026-04-24T14:22:48Z

/azp run Nightly

azure-pipelines · 2026-04-24T14:22:59Z

Azure Pipelines successfully started running 1 pipeline(s).

david-cortes-intel · 2026-04-24T15:19:46Z

CI failures in BasicStatistics are unrelated to the changes here and should be solved with this PR:
#3128

david-cortes-intel · 2026-04-24T15:26:14Z

/intelci: run

david-cortes-intel · 2026-04-24T15:26:21Z

/azp run Nightly

azure-pipelines · 2026-04-24T15:26:46Z

Azure Pipelines successfully started running 1 pipeline(s).

codecov · 2026-04-24T16:04:16Z

Codecov Report

❌ Patch coverage is 28.94737% with 54 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
sklearnex/neighbors/common.py	42.85%	12 Missing and 4 partials ⚠️
sklearnex/neighbors/knn_classification.py	23.80%	9 Missing and 7 partials ⚠️
sklearnex/neighbors/knn_regression.py	17.64%	8 Missing and 6 partials ⚠️
sklearnex/neighbors/knn_unsupervised.py	20.00%	5 Missing and 3 partials ⚠️

Flag	Coverage Δ
azure	`77.50% <28.94%> (-1.55%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
sklearnex/neighbors/_lof.py	`98.01% <ø> (-1.99%)`	⬇️
sklearnex/neighbors/knn_unsupervised.py	`86.17% <20.00%> (-8.02%)`	⬇️
sklearnex/neighbors/knn_regression.py	`79.36% <17.64%> (-9.93%)`	⬇️
sklearnex/neighbors/common.py	`85.83% <42.85%> (-4.53%)`	⬇️
sklearnex/neighbors/knn_classification.py	`84.76% <23.80%> (-10.05%)`	⬇️

... and 37 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

david-cortes-intel · 2026-04-30T10:02:03Z

/azp run Nightly

azure-pipelines · 2026-04-30T10:02:14Z

Azure Pipelines successfully started running 1 pipeline(s).

david-cortes-intel · 2026-04-30T10:02:17Z

/intelci: run

avolkov-intel · 2026-05-11T09:33:58Z

            device = getattr(y_train, "device", None)
-            neigh_dist = xp.asarray(neigh_dist, device=device)
-            neigh_ind = xp.asarray(neigh_ind, device=device)
+            if not _is_numpy_namespace(xp):


What is the logic of this if-statement? As I understand it previously it was that neigh_dist and neigh_ind originally have numpy type and we only need to convert them if y is not a numpy. Is it correct? If yes do we need the same logic after the array-api update?

This is how it was before if you look at the changes. I guess the purpose is to have them work with the other arrays.

avolkov-intel · 2026-05-11T09:43:03Z

            device = getattr(y_train, "device", None)
-            neigh_dist = xp.asarray(neigh_dist, device=device)
-            neigh_ind = xp.asarray(neigh_ind, device=device)
+            if not _is_numpy_namespace(xp):


I think we also need some explanation about why do we need this numpy check

It has a different codepath for numpy with operations that are not supported by array API.

avolkov-intel · 2026-05-11T09:52:19Z

Left a few comments mostly related to clarification about the current logic. Also PR needs to be rebased to fix some CI failures

david-cortes-intel added the Array API label Apr 24, 2026

namespace checks and movements for KNN

1507c24

david-cortes-intel force-pushed the knn_y branch from efd866f to 1507c24 Compare April 30, 2026 09:50

more tests

7b9077c

david-cortes-intel marked this pull request as ready for review April 30, 2026 10:14

david-cortes-intel requested review from Vika-F, ahuber21, ethanglaser, icfaust and yuejiaointel as code owners April 30, 2026 10:14

avolkov-intel reviewed May 11, 2026

View reviewed changes

Comment thread sklearnex/neighbors/_lof.py Outdated

avolkov-intel reviewed May 11, 2026

View reviewed changes

Comment thread sklearnex/neighbors/knn_classification.py

remove unused import

bb35944

avolkov-intel approved these changes May 11, 2026

View reviewed changes

david-cortes-intel merged commit 895535d into uxlfoundation:main May 11, 2026
24 of 31 checks passed

Conversation

david-cortes-intel commented Apr 24, 2026

Description

Uh oh!

david-cortes-intel commented Apr 24, 2026

Uh oh!

david-cortes-intel commented Apr 24, 2026

Uh oh!

azure-pipelines Bot commented Apr 24, 2026

Uh oh!

david-cortes-intel commented Apr 24, 2026

Uh oh!

david-cortes-intel commented Apr 24, 2026

Uh oh!

david-cortes-intel commented Apr 24, 2026

Uh oh!

azure-pipelines Bot commented Apr 24, 2026

Uh oh!

codecov Bot commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

david-cortes-intel commented Apr 30, 2026

Uh oh!

azure-pipelines Bot commented Apr 30, 2026

Uh oh!

david-cortes-intel commented Apr 30, 2026

Uh oh!

avolkov-intel May 11, 2026

Choose a reason for hiding this comment

Uh oh!

david-cortes-intel May 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

avolkov-intel May 11, 2026

Choose a reason for hiding this comment

Uh oh!

david-cortes-intel May 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

avolkov-intel commented May 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Apr 24, 2026 •

edited

Loading