Skip to content

Commit 2aa4434

Browse files
authored
Merge pull request #13 from heuermh/add-seqkit-stats
Add nf-core seqkit/stats module
2 parents 49e32ca + 6bf2a51 commit 2aa4434

12 files changed

Lines changed: 506 additions & 20 deletions

File tree

CHANGELOG.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,8 @@ Initial release of nf-core/proteinannotator, created with the [nf-core](https://
99

1010
### `Added`
1111

12+
- [[PR #13](https://github.com/nf-core/proteinannotator/pull/13)] Add nf-core seqkit/stats module
13+
1214
### `Fixed`
1315

1416
### `Dependencies`

conf/modules.config

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -27,4 +27,7 @@ process {
2727
]
2828
}
2929

30+
withName: SEQKIT_STATS {
31+
ext.args = ' ' // turn off --all default argument
32+
}
3033
}

docs/output.md

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ The directories listed below will be created in the results directory after the
1313
The pipeline is built using [Nextflow](https://www.nextflow.io/) and processes data using the following steps:
1414

1515
- [MultiQC](#multiqc) - Aggregate report describing results and QC from the whole pipeline
16+
- [SeqKit stats](#seqkit_stats) - Simple statistics for protein FASTA files
1617
- [Pipeline information](#pipeline-information) - Report metrics generated during the workflow execution
1718

1819
### MultiQC
@@ -31,6 +32,18 @@ The pipeline is built using [Nextflow](https://www.nextflow.io/) and processes d
3132

3233
Results generated by MultiQC collate pipeline QC from supported tools e.g. FastQC. The pipeline has special steps which also allow the software versions to be reported in the MultiQC output for future traceability. For more information about how to use MultiQC reports, see <http://multiqc.info>.
3334

35+
### SeqKit stats
36+
37+
<details markdown="1">
38+
<summary>Output files</summary>
39+
40+
- `seqkit/`
41+
- `{prefix}.tsv`: output of `seqkit stats` command on `{prefix}.fasta` input file, in tab-delimited text format.
42+
43+
</details>
44+
45+
[SeqKit stats](https://bioinf.shenwei.me/seqkit/usage/#stats) generates simple statistics for protein FASTA files, such as number of residues, minimal sequence length, average sequence length, and maximal sequence length.
46+
3447
### Pipeline information
3548

3649
<details markdown="1">

modules.json

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,11 @@
99
"branch": "master",
1010
"git_sha": "f0719ae309075ae4a291533883847c3f7c441dad",
1111
"installed_by": ["modules"]
12+
},
13+
"seqkit/stats": {
14+
"branch": "master",
15+
"git_sha": "81880787133db07d9b4c1febd152c090eb8325dc",
16+
"installed_by": ["modules"]
1217
}
1318
}
1419
},

modules/nf-core/seqkit/stats/environment.yml

Lines changed: 7 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/seqkit/stats/main.nf

Lines changed: 45 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/seqkit/stats/meta.yml

Lines changed: 50 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/seqkit/stats/tests/main.nf.test

Lines changed: 141 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)