Skip to content

Commit 6981bdc

Browse files
authored
Fix docs and new metro map
1 parent 6064f40 commit 6981bdc

16 files changed

Lines changed: 409 additions & 1391 deletions

CHANGELOG.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
5050

5151
## v2.0.2
5252

53-
- Drop unsed params
53+
- Drop unused params
5454
- Set aligner to `star` for RNA-seq
5555
- Finetune resources
5656
- Fix some bugs for different input tags

README.md

Lines changed: 18 additions & 33 deletions
Original file line numberDiff line numberDiff line change
@@ -18,17 +18,23 @@ It also performs basic QC and coverage analysis.
1818

1919
The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible.
2020

21-
Steps inlcude:
21+
Steps include:
2222

23-
1. Demultiplexing using [`BCLconvert`](https://emea.support.illumina.com/sequencing/sequencing_software/bcl-convert.html)
24-
2. Read QC and trimming using [`fastp`](https://github.com/OpenGene/fastp)
25-
3. Alignment using either [`bwa`](https://github.com/lh3/bwa), [`bwa-mem2`](https://github.com/bwa-mem2/bwa-mem2), [`bowtie2`](https://github.com/BenLangmead/bowtie2), [`dragmap`](https://github.com/Illumina/DRAGMAP), [`snap`](https://github.com/amplab/snap) or [`strobe`](https://github.com/ksahlin/strobealign) for DNA-seq and [`STAR`](https://github.com/alexdobin/STAR) for RNA-seq
26-
4. Duplicate marking using [`bamsormadup`](https://gitlab.com/german.tischler/biobambam2) or [`samtools markdup`](http://www.htslib.org/doc/samtools-markdup.html)
27-
5. Coverage analysis using [`mosdepth`](https://github.com/brentp/mosdepth) and [`samtools coverage`](http://www.htslib.org/doc/samtools-coverage.html)
28-
6. Alignment QC using [`samtools flagstat`](http://www.htslib.org/doc/samtools-flagstat.html), [`samtools stats`](http://www.htslib.org/doc/samtools-stats.html), [`samtools idxstats`](http://www.htslib.org/doc/samtools-idxstats.html) and [`picard CollecHsMetrics`](https://broadinstitute.github.io/picard/command-line-overview.html#CollectHsMetrics), [`picard CollectWgsMetrics`](https://broadinstitute.github.io/picard/command-line-overview.html#CollectWgsMetrics), [`picard CollectMultipleMetrics`](https://broadinstitute.github.io/picard/command-line-overview.html#CollectMultipleMetrics)
29-
7. QC aggregation using [`multiqc`](https://multiqc.info/)
23+
- Demultiplexing using [`BCLconvert`](https://emea.support.illumina.com/sequencing/sequencing_software/bcl-convert.html)
24+
- Run QC using [`MultiQC SAV`](https://github.com/MultiQC/MultiQC_SAV)
25+
- Read QC and trimming using [`fastp`](https://github.com/OpenGene/fastp) or [`falco`](https://github.com/smithlabcode/falco)
26+
- Alignment using either [`bwa`](https://github.com/lh3/bwa), [`bwa-mem2`](https://github.com/bwa-mem2/bwa-mem2), [`bowtie2`](https://github.com/BenLangmead/bowtie2), [`dragmap`](https://github.com/Illumina/DRAGMAP), [`snap`](https://github.com/amplab/snap) or [`strobe`](https://github.com/ksahlin/strobealign) for DNA-seq and [`STAR`](https://github.com/alexdobin/STAR) for RNA-seq
27+
- Duplicate marking using [`bamsormadup`](https://gitlab.com/german.tischler/biobambam2) or [`samtools markdup`](http://www.htslib.org/doc/samtools-markdup.html)
28+
- Coverage analysis using [`mosdepth`](https://github.com/brentp/mosdepth) and [`samtools coverage`](http://www.htslib.org/doc/samtools-coverage.html)
29+
- Alignment QC using [`samtools flagstat`](http://www.htslib.org/doc/samtools-flagstat.html), [`samtools stats`](http://www.htslib.org/doc/samtools-stats.html), [`samtools idxstats`](http://www.htslib.org/doc/samtools-idxstats.html) and [`picard CollectHsMetrics`](https://broadinstitute.github.io/picard/command-line-overview.html#CollectHsMetrics), [`picard CollectWgsMetrics`](https://broadinstitute.github.io/picard/command-line-overview.html#CollectWgsMetrics), [`picard CollectMultipleMetrics`](https://broadinstitute.github.io/picard/command-line-overview.html#CollectMultipleMetrics)
30+
- QC aggregation using [`multiqc`](https://multiqc.info/)
3031

31-
![metro map](docs/images/metro_map.png)
32+
<picture>
33+
34+
<source media="(prefers-color-scheme: dark)" srcset="docs/images/metro_map_dark.svg">
35+
<source media="(prefers-color-scheme: light)" srcset="docs/images/metro_map_light.svg">
36+
<img alt="Fallback image description" src="docs/images/metro_map_light.svg">
37+
</picture>
3238

3339
## Usage
3440

@@ -37,36 +43,15 @@ Steps inlcude:
3743
3844
The full documentation can be found [here](docs/README.md)
3945

40-
First, prepare a samplesheet with your input data that looks as follows:
41-
42-
`samplesheet.csv` for fastq inputs:
43-
44-
```csv
45-
id,samplename,organism,library,aligner,fastq_1,fastq_2
46-
sample1,sample1,Homo sapiens,Library_Name,bwamem,reads1.fq.gz,reads2.fq.gz
47-
```
48-
49-
`samplesheet.csv` for flowcell inputs:
50-
51-
```csv
52-
id,samplesheet,lane,flowcell,sample_info
53-
flowcell_id,/path/to/illumina_samplesheet.csv,1,/path/to/sequencer_uploaddir,/path/to/sampleinfo.csv
54-
```
55-
56-
`sampleinfo.csv` for use with flowcell inputs:
57-
58-
```csv
59-
samplename,library,organism,tag,aligner
60-
fc_sample1,test,Homo sapiens,WES,bwamem
61-
```
46+
First, prepare a samplesheet with your input data. Check the [usage docs](docs/usage.md) for details on the required format and example files.
6247

6348
Now, you can run the pipeline using:
6449

6550
```bash
6651
nextflow run nf-cmgg/preprocessing \
67-
-profile <docker/singularity/.../institute> \
52+
-profile <docker/singularity/...> \
6853
--igenomes_base /path/to/genomes \
69-
--input samplesheet.csv \
54+
--input samplesheet.<csv|yaml|json> \
7055
--outdir <OUTDIR>
7156
```
7257

assets/schema_input.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -160,10 +160,10 @@
160160
},
161161
"anyOf": [
162162
{
163-
"required": ["id", "samplename", "organism", "aligner", "tag", "fastq_1", "fastq_2"]
163+
"required": ["id", "samplename", "organism", "aligner", "fastq_1", "fastq_2"]
164164
},
165165
{
166-
"required": ["id", "samplename", "genome", "aligner", "tag", "fastq_1", "fastq_2"]
166+
"required": ["id", "samplename", "genome", "aligner", "fastq_1", "fastq_2"]
167167
},
168168
{
169169
"required": ["id", "samplesheet", "sample_info", "flowcell"]

docs/images/metro_map.png

-208 KB
Binary file not shown.

0 commit comments

Comments
 (0)