Skip to content

ci: Add scikit-learn to downstream tests#3609

Merged
FBruzzesi merged 9 commits into
mainfrom
ci/sklearn
Jun 28, 2026
Merged

ci: Add scikit-learn to downstream tests#3609
FBruzzesi merged 9 commits into
mainfrom
ci/sklearn

Conversation

@FBruzzesi

@FBruzzesi FBruzzesi commented May 9, 2026

Copy link
Copy Markdown
Member

Description

Opening as draft and in downstram-tests (not slow version) to see how long it does take. From a local attempt it's still below 5 mins

What type of PR is this? (check all applicable)

  • 💾 Refactor
  • ✨ Feature
  • 🐛 Bug Fix
  • 🔧 Optimization
  • 📝 Documentation
  • ✅ Test
  • 🐳 Other

Related issues

@FBruzzesi FBruzzesi added the ci label May 9, 2026
@MarcoGorelli

Copy link
Copy Markdown
Member

10 mins in ci, that's fine with me!

@FBruzzesi

Copy link
Copy Markdown
Member Author

Thanks Marco, I will add a comment in scikit-learn PR asking if there is a better way to do this, and/or if we can use a pytest marker

@dangotbanned

dangotbanned commented Jun 8, 2026

Copy link
Copy Markdown
Member

Might it make sense to reintroduce 1 this workflow for scikit-learn?

To me it looks like pointblank (7m 21s) and scikit-learn (11m 16s) are pretty big outliers here.
Conceptually, it is odd to group them with Test Downstream Libraries - Fast 😄

I wonder if we could speed up some runs by doing things like (#2910)?
For example:

  • we don't need to run the workflow if a PR only touches docs/**
  • if they test a specific set of backends, could we look for changes in those packages to trigger?

Footnotes

  1. I'm not sure why vegafusion was removed, but assume there was a reason?

@FBruzzesi

Copy link
Copy Markdown
Member Author

Thanks for taking a look @dangotbanned - According to scikit-learn/scikit-learn#31127 (comment) we might run a subset of the tests.

In general I am a bit concerned/skeptical in running hardcoded tests, I would prefer having pytest markers. Happy to hear your thoughts

@dangotbanned

dangotbanned commented Jun 8, 2026

Copy link
Copy Markdown
Member

In general I am a bit concerned/skeptical in running hardcoded tests, I would prefer having pytest markers. Happy to hear your thoughts

I'm not sure we're talking about the same thing.
I was interested in using filters in the workflow(s).

Is a set of globs considered hardcoding?
Polars documents it here for the same motivation https://docs.pola.rs/development/contributing/ci/#design

Only run checks when they need to be run. A change to the Rust code does not warrant a linting check of the Python code, for example.

@FBruzzesi

Copy link
Copy Markdown
Member Author

I'm not sure we're talking about the same thing.

We are not 🤣

I was interested in using filters in the workflow(s).

We can have both approaches! Filters in workflows will allow to skip workflows given specific paths. What I mentioned will enable much faster runtime when it runs

@dangotbanned

Copy link
Copy Markdown
Member

We can have both approaches!

Perfect!
Yeah both works for me

@FBruzzesi

Copy link
Copy Markdown
Member Author

I know it's a partial solution, but it might be worth merging to start running (fast) tests against scikit-learn.

@FBruzzesi FBruzzesi changed the title ci: Add scikit-learn in downstream tests ci: Add scikit-learn in downstream tests Jun 21, 2026
@dangotbanned dangotbanned added the high priority Your PR will be reviewed very quickly if you address this label Jun 27, 2026
@dangotbanned dangotbanned changed the title ci: Add scikit-learn in downstream tests ci: Add scikit-learn to downstream tests Jun 27, 2026

@dangotbanned dangotbanned left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @FBruzzesi, just one comment from me

squeeze cheeks narwhal

Comment on lines +707 to +708
scikit-learn:
if: github.head_ref != 'bump-version'

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want to use anything like #3680 here?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The settings introduces there are all at workflow level, nothing job specific. We should be fine for now

@FBruzzesi FBruzzesi merged commit 1695c43 into main Jun 28, 2026
24 of 25 checks passed
@FBruzzesi FBruzzesi deleted the ci/sklearn branch June 28, 2026 08:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci high priority Your PR will be reviewed very quickly if you address this

Projects

None yet

Development

Successfully merging this pull request may close these issues.

CI: test scikit-learn downstream

3 participants