Add acdc-data CLI for BioIO metadata, convert, and restructure. by keejkrej · Pull Request #1108 · SchmollerLab/Cell_ACDC

keejkrej · 2026-05-25T11:58:59Z

Introduce a separate headless entry point with thin cli.py kernels so GUI-only modules like dataStruct stay untouched. Co-authored-by: Cursor <cursoragent@cursor.com>

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds a new headless acdc-data command-line interface for metadata extraction, raw microscopy conversion to ACDC structure, and restructuring pre-processed image files, alongside docs and unit tests.

Changes:

Introduce cellacdc/data_cli.py implementing metadata, convert, and restructure subcommands.
Register the acdc-data console entry point and wire the CLI into existing _run/kernel infrastructure.
Add getting-started documentation and unit tests for key parsing/state helpers.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 14 comments.

Show a summary per file

File	Description
tests/test_data_cli.py	Adds pytest coverage for core helpers in the new data CLI module.
pyproject.toml	Registers `acdc-data` entry point.
cellacdc/docs/source/getting-started.rst	Documents new headless data commands and common usage patterns.
cellacdc/data_cli.py	Implements the new headless data CLI and conversion/restructure logic.
cellacdc/cli.py	Adds workflow kernels that invoke the new data CLI operations.
cellacdc/_run.py	Adds runner functions for `acdc-data` commands with logging setup.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+    for p, filename in enumerate(raw_filenames):
+        pos_n = p + start_pos_n


+        if raw_data_struct == 0:
+            num_pos = state.size_s
+            num_pos_digits = len(str(num_pos))
+            for in_file_p in range(state.size_s):
+                _save_to_pos_folder(
+                    state, raw_src_path, exp_dst_path, filename,
+                    in_file_p, pos_n, num_pos_digits, raw_data_struct,
+                    overwrite_pos, create_new, lazy_load,
+                    logger_func=logger_func,
+                )
+        elif raw_data_struct == 1:
+            num_pos = len(raw_filenames)
+            num_pos_digits = len(str(num_pos))
+            _save_to_pos_folder(
+                state, raw_src_path, exp_dst_path, filename,
+                0, pos_n, num_pos_digits, raw_data_struct,
+                overwrite_pos, create_new, lazy_load,


+            images_path = os.path.join(dst_path, f'Position_{p + 1}', 'Images')
+            os.makedirs(images_path, exist_ok=True)
+
+            video_data = None


+            for frame_i, file_info in enumerate(sorted_files_list):
+                file, _ = file_info
+                src_img_file_path = os.path.join(src_path, file)
+                try:
+                    img = load.imread(src_img_file_path)
+                    if video_data is None:
+                        video_data = np.zeros((size_t, *img.shape), dtype=img.dtype)
+                    video_data[frame_i] = img
+                    frame_number_match = re.findall(frame_name_pattern, file)[0][1]
+                    frame_numbers.append(int(frame_number_match))
+                except Exception:
+                    continue


+        for frame_number, img in zip(frame_numbers, video_data):
+            frame_i = frame_number - min_frame_number
+            padded_video_data[frame_i] = img


+        if raw_data_struct == 0:
+            num_pos = state.size_s
+            num_pos_digits = len(str(num_pos))
+            for in_file_p in range(state.size_s):
+                _save_to_pos_folder(


+        elif raw_data_struct == 1:
+            num_pos = len(raw_filenames)
+            num_pos_digits = len(str(num_pos))
+            _save_to_pos_folder(


+    )
+
+
+def restructure_multi_timepoint(


+            src_segm_paths = [''] * size_t
+            frame_numbers = []
+            size_z = 1
+            for frame_i, file_info in enumerate(sorted_files_list):


+        frame_numbers = img_data_info['frameNumbers']
+        padded_shape = (max_size_t, *video_data.shape[1:])
+        padded_video_data = np.zeros(padded_shape, dtype=video_data.dtype)
+        for frame_number, img in zip(frame_numbers, video_data):


Add acdc-data CLI for BioIO metadata, convert, and restructure.

dd26870

Introduce a separate headless entry point with thin cli.py kernels so GUI-only modules like dataStruct stay untouched. Co-authored-by: Cursor <cursoragent@cursor.com>

Copilot AI review requested due to automatic review settings May 25, 2026 11:59

Copilot AI reviewed May 25, 2026

View reviewed changes

keejkrej closed this May 25, 2026

keejkrej deleted the cursor/99204775 branch May 25, 2026 14:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add acdc-data CLI for BioIO metadata, convert, and restructure.#1108

Add acdc-data CLI for BioIO metadata, convert, and restructure.#1108
keejkrej wants to merge 1 commit into
SchmollerLab:mainfrom
keejkrej:cursor/99204775

keejkrej commented May 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		for p, filename in enumerate(raw_filenames):
		pos_n = p + start_pos_n

		)


		def restructure_multi_timepoint(

Conversation

keejkrej commented May 25, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants