Skip to content

Improve dataset splitting#1589

Open
arteymix wants to merge 4 commits intohotfix-1.32.7from
feature-improve-dataset-splitting
Open

Improve dataset splitting#1589
arteymix wants to merge 4 commits intohotfix-1.32.7from
feature-improve-dataset-splitting

Conversation

@arteymix
Copy link
Copy Markdown
Collaborator

Make splitting the experiment transactional by isolating the relevant operations in a helper service.

Use generic BulkDataSlicerUtils and SingleCellSlicerUtils for slicing samples in splits which handle all the possible fixed-size encoding cases and single-cell data slicing.

Add options to the splitExperiment CLI to skip post-processing and delete the original experiment automatically.

@arteymix arteymix added the enhancement Enhance the code or user experience label Jan 14, 2026
@arteymix arteymix self-assigned this Jan 14, 2026
@arteymix arteymix added this to the 1.32.6 milestone Jan 14, 2026
@arteymix
Copy link
Copy Markdown
Collaborator Author

The only missing part is to persist the single-cell stuff in the Persister.

@arteymix arteymix force-pushed the feature-improve-dataset-splitting branch from 9824fd1 to 1f0011a Compare January 14, 2026 19:13
@arteymix arteymix added the testing Feature or fix is being tested label Jan 14, 2026
@arteymix
Copy link
Copy Markdown
Collaborator Author

Involving @cyan864 since we'll need some examples to test this on.

@arteymix arteymix linked an issue Feb 11, 2026 that may be closed by this pull request
@arteymix arteymix force-pushed the feature-improve-dataset-splitting branch from 1f0011a to 2052ad8 Compare February 13, 2026 19:54
@arteymix arteymix marked this pull request as ready for review February 14, 2026 20:16
Make splitting the experiment transactional by isolating the relevant
operations in a helper service.

Use generic BulkDataSlicerUtils and SingleCellSlicerUtils for slicing
samples in splits which handle all the possible fixed-size encoding
cases and single-cell data slicing.

Add options to the splitExperiment CLI to skip post-processing and
delete the original experiment automatically.
@arteymix arteymix force-pushed the feature-improve-dataset-splitting branch from 2052ad8 to f26e61e Compare February 14, 2026 20:23
@arteymix arteymix modified the milestones: 1.32.6, 1.32.7 Feb 27, 2026
@arteymix arteymix changed the base branch from hotfix-1.32.6 to hotfix-1.32.7 February 27, 2026 04:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement Enhance the code or user experience testing Feature or fix is being tested

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Splitting an experiment should be annotated with @Transactional

2 participants