Skip to content

[Release / Train] Fix Train tutorials#63225

Merged
matthewdeng merged 2 commits into
ray-project:masterfrom
pseudo-rnd-thoughts:fix-ray-train-workloads
May 8, 2026
Merged

[Release / Train] Fix Train tutorials#63225
matthewdeng merged 2 commits into
ray-project:masterfrom
pseudo-rnd-thoughts:fix-ray-train-workloads

Conversation

@pseudo-rnd-thoughts
Copy link
Copy Markdown
Member

@pseudo-rnd-thoughts pseudo-rnd-thoughts commented May 8, 2026

Description

doc/source/train/tutorials were raising an error that ImportError: Dataset requires pyarrow >= 17.0.0, but 14.0.2 is installed. Reinstall with 'pip install -U "pyarrow"'.
Therefore, this PR fixes the tutorials by updating all pyarrows to 17.0.0 and at the same time removing unnecessary import statements

Signed-off-by: Mark Towers <mark@anyscale.com>
@pseudo-rnd-thoughts pseudo-rnd-thoughts requested a review from a team as a code owner May 8, 2026 09:43
@pseudo-rnd-thoughts pseudo-rnd-thoughts added train Ray Train Related Issue release-test release test labels May 8, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the pyarrow dependency to version 17.0.0 and performs extensive cleanup by removing unused imports across several Ray Train tutorial scripts. Feedback identifies a critical issue in 04d1_generative_cv_pattern.py where the removal of the Checkpoint import will result in a NameError during the training loop. Additionally, it is recommended to remove the datasets package from the installation list in 04c_time_series_workload_pattern.py as it is no longer utilized by the script.

Signed-off-by: Mark Towers <mark@anyscale.com>
Copy link
Copy Markdown
Contributor

@JasonLi1909 JasonLi1909 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@matthewdeng matthewdeng enabled auto-merge (squash) May 8, 2026 21:39
@github-actions github-actions Bot added the go add ONLY when ready to merge, run all tests label May 8, 2026
@matthewdeng matthewdeng merged commit bcdf33e into ray-project:master May 8, 2026
8 checks passed
chillCode404 pushed a commit to chillCode404/ray-contrib that referenced this pull request May 9, 2026
## Description
`doc/source/train/tutorials` were raising an error that `ImportError:
Dataset requires pyarrow >= 17.0.0, but 14.0.2 is installed. Reinstall
with 'pip install -U "pyarrow"'.`
Therefore, this PR fixes the tutorials by updating all pyarrows to
17.0.0 and at the same time removing unnecessary import statements

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
dancingactor pushed a commit to dancingactor/ray that referenced this pull request May 13, 2026
## Description
`doc/source/train/tutorials` were raising an error that `ImportError:
Dataset requires pyarrow >= 17.0.0, but 14.0.2 is installed. Reinstall
with 'pip install -U "pyarrow"'.`
Therefore, this PR fixes the tutorials by updating all pyarrows to
17.0.0 and at the same time removing unnecessary import statements

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
am-kinetica pushed a commit to kineticadb/ray that referenced this pull request May 14, 2026
## Description
`doc/source/train/tutorials` were raising an error that `ImportError:
Dataset requires pyarrow >= 17.0.0, but 14.0.2 is installed. Reinstall
with 'pip install -U "pyarrow"'.`
Therefore, this PR fixes the tutorials by updating all pyarrows to
17.0.0 and at the same time removing unnecessary import statements

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
Signed-off-by: anindyam1969 <amukherjee@kinetica.com>
Lucas61000 pushed a commit to Lucas61000/ray that referenced this pull request May 15, 2026
## Description
`doc/source/train/tutorials` were raising an error that `ImportError:
Dataset requires pyarrow >= 17.0.0, but 14.0.2 is installed. Reinstall
with 'pip install -U "pyarrow"'.`
Therefore, this PR fixes the tutorials by updating all pyarrows to
17.0.0 and at the same time removing unnecessary import statements

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
alexandrplashchinsky pushed a commit to alexandrplashchinsky/ray-alex that referenced this pull request May 29, 2026
## Description
`doc/source/train/tutorials` were raising an error that `ImportError:
Dataset requires pyarrow >= 17.0.0, but 14.0.2 is installed. Reinstall
with 'pip install -U "pyarrow"'.`
Therefore, this PR fixes the tutorials by updating all pyarrows to
17.0.0 and at the same time removing unnecessary import statements

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Alexandr Plashchinsky <alexandr.plashchinsky@alexandrplashchinsky-H765G66H9V.local>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

go add ONLY when ready to merge, run all tests release-test release test train Ray Train Related Issue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants