Improved study and experiment handling, paired end data handling, multiple experiment handling by bedroesb · Pull Request #142 · elixir-europe/MARS

bedroesb · 2026-05-27T20:31:38Z

Moved ENA-specific study metadata partly into assay comments, while keeping core study title and description on the ISA Study itself. The adapter now reads study.title and study.description for the submitted ENA study title/description, while assay comments still carry ENA-specific fields such as STUDY_ABSTRACT, STUDY_TYPE, and new_study_type. This closes Include extra metadata for target repositories #3.
Updated the ENA study adapter so assay comments are no longer the source of truth for everything: core descriptive metadata now comes from the ISA study, and only ENA-specific study fields are read from the assay level.
Removed the explicit ENA PROJECT_SET submission from our adapter flow and deleted the unused project adapter. We now submit STUDY_SET, EXPERIMENT_SET, and RUN_SET; any linked PRJEB... project is created by ENA/Webin rather than by MARS. Closes ISA Investigation is being submitted as a ENA Study/Project #80.
Switched ENA experiment generation to read ENA-native process parameter names directly from the assay workflow, instead of relying on hardcoded platform values or custom parameter-name mappings. Closes assays > processSequence > parameterValues are not being parsed by the ENA endpoint #73.
Updated experiment-to-sample linking so each ENA experiment resolves the BioSamples accession from the specific study sample its library derives from, rather than reusing one global sample accession.
Preserved and clarified the original bottom-up experiment-building flow from data files to sequencing process to library to experiment.
Updated ENA run generation so one sequencing process produces one ENA RUN, allowing paired-end runs to contain both FASTQ files in a single DATA_BLOCK while single-end runs still produce one file per run.
Updated ENA receipt mapping so top-level ENA study accessions resolve back to the assay path in the MARS receipt, rather than the parent study/investigation path.
Updated ENA run receipt mapping so grouped runs are expanded back onto the corresponding ISA dataFiles, including assigning one paired-end ERR... accession to both paired FASTQ entries.
Refreshed the example ISA JSON files with more realistic valid ENA metadata values, moved study title/description back onto the ISA Study, and added a richer multi-file example covering two source/sample chains, two experiments, one paired-end run, and one single-end run.

…erator

bedroesb added 17 commits May 27, 2026 18:55

Improve study and experiment handling

8e8e5fa

remove unecessary code

e3357d8

add a multi experiment example

826d09d

correct ilumina model

7ae7d53

add support for paired end reads

7e4caae

retuen all run accessions in the receipt

c389838

add support for multiple experiments/samples

a98589b

add receipt to logging

c01abdd

add more logs

5b938cd

MORE!

c6a3b84

final logging

053862f

fix run accession mapping

96f76a7

read study title and descr from study + remove unused project xml gen…

e54a397

…erator

add missing function

5e719c2

align all example files

9752f65

format ISA JSON

717da18

fix end of lines

f0f6104

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved study and experiment handling, paired end data handling, multiple experiment handling#142

Improved study and experiment handling, paired end data handling, multiple experiment handling#142
bedroesb wants to merge 17 commits into
mainfrom
study-level

bedroesb commented May 27, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

bedroesb commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

bedroesb commented May 27, 2026 •

edited

Loading