Skip to content

Commit 76e1a3e

Browse files
authored
Merge pull request #60 from nf-core/add-s4pred
added s4pred module to pipeline
2 parents 7d17d6c + 0269bdd commit 76e1a3e

16 files changed

Lines changed: 365 additions & 6 deletions

File tree

CHANGELOG.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,8 @@ Initial release of nf-core/proteinannotator, created with the [nf-core](https://
99

1010
### `Added`
1111

12-
- [#59](https://github.com/nf-core/proteinannotator/pull/59) - Added nf-core qc and pre-processing subworkflow for amino acid sequences `FAA_SEQFU_SEQKIT`. (by @vagkaratzas). (by @vagkaratzas)
12+
- [#60](https://github.com/nf-core/proteinannotator/pull/60) - Added nf-core module `S4PRED_RUNMODEL` for secondary structure prediction (i.e., α-helix, a β-strand or a coil). (by @vagkaratzas)
13+
- [#59](https://github.com/nf-core/proteinannotator/pull/59) - Added nf-core qc and pre-processing subworkflow for amino acid sequences `FAA_SEQFU_SEQKIT`. (by @vagkaratzas)
1314
- [#57](https://github.com/nf-core/proteinannotator/pull/57) - nf-core tools template update to 3.5.1. (by @vagkaratzas)
1415
- [#52](https://github.com/nf-core/proteinannotator/pull/52) - Add option to turn off InterProScan for testing
1516
- [#51](https://github.com/nf-core/proteinannotator/pull/51) - Update to nf-core/tools v3.3.1

CITATIONS.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,10 @@
2222

2323
> Zdobnov, Evgeni M., and Rolf Apweiler. “InterProScan – an Integration Platform for the Signature-Recognition Methods in InterPro.” Bioinformatics 17, no. 9 (September 1, 2001): 847–48. https://doi.org/10.1093/bioinformatics/17.9.847.
2424
25+
- [s4pred](https://pubmed.ncbi.nlm.nih.gov/34213528/)
26+
27+
> Moffat L, Jones DT. Increasing the accuracy of single sequence prediction methods using a deep semi-supervised learning framework. Bioinformatics. 2021 Nov 1;37(21):3744-51. doi: 10.1093/bioinformatics/btab491. PubMed PMID: 34213528; PubMed Central PMCID: PMC8570780.
28+
2529
<!-- - [MMseqs2](https://www.nature.com/articles/nbt.3988)
2630
2731
> Zdobnov, Evgeni M., and Rolf Apweiler. “InterProScan – an Integration Platform for the Signature-Recognition Methods in InterPro.” Bioinformatics 17, no. 9 (September 1, 2001): 847–48. https://doi.org/10.1093/bioinformatics/17.9.847.

conf/modules.config

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -74,6 +74,16 @@ process {
7474
]
7575
}
7676

77+
withName: 'NFCORE_PROTEINANNOTATOR:PROTEINANNOTATOR:S4PRED_RUNMODEL' {
78+
ext.prefix = { params.s4pred_outfmt }
79+
ext.args = { "--outfmt ${params.s4pred_outfmt}" }
80+
publishDir = [
81+
path: { "${params.outdir}/s4pred/${meta.id}/" },
82+
mode: params.publish_dir_mode,
83+
saveAs: { filename -> filename.equals('versions.yml') ? null : filename }
84+
]
85+
}
86+
7787
withName: 'MULTIQC' {
7888
ext.args = { params.multiqc_title ? "--title \"$params.multiqc_title\"" : '' }
7989
publishDir = [

docs/output.md

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,9 @@ The pipeline is built using [Nextflow](https://www.nextflow.io/) and processes d
1818

1919
- [Functional Annotation](#functional-annotation) Annotate proteins with functional domains
2020
- [InterProScan](#Interproscan) - Search the InterPro database for functional domains
21+
22+
- [s4pred](#s4pred) - Predict secondary structures of sequences, producing per amino acid probabilities of being an α-helix, a β-strand or a coil.
23+
2124
- [MultiQC](#multiqc) - Aggregate report describing results and QC from the whole pipeline
2225
- [Pipeline information](#pipeline-information) - Report metrics generated during the workflow execution
2326

@@ -307,6 +310,22 @@ The XML Schema Definition (XSD) is available [here](http://ftp.ebi.ac.uk/pub/sof
307310

308311
</details>
309312

313+
#### s4pred
314+
315+
<details markdown="1">
316+
<summary>Output files</summary>
317+
318+
- `s4pred/`
319+
- `<samplename>/`
320+
- `<s4pred_outfmt>/`
321+
- `<samplename>.<s4pred_outfmt>`: The probability of each amino acid to be an α-helix, a β-strand or a coil, in the chosen output format (i.e., 'ss2', 'fas', or 'horiz').
322+
323+
</details>
324+
325+
The `s4pred` module is used to predict secondary structures of amino acid sequences.
326+
327+
[s4pred](https://github.com/psipred/s4pred) is a tool for accurate prediction of a protein's secondary structure from only it's amino acid sequence.
328+
310329
### MultiQC
311330

312331
<details markdown="1">

main.nf

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,8 @@ workflow NFCORE_PROTEINANNOTATOR {
3939
//
4040
PROTEINANNOTATOR (
4141
samplesheet,
42-
params.skip_preprocessing
42+
params.skip_preprocessing,
43+
params.skip_s4pred
4344
)
4445
emit:
4546
multiqc_report = PROTEINANNOTATOR.out.multiqc_report // channel: /path/to/multiqc_report.html

modules.json

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,11 @@
2020
"git_sha": "af27af1be706e6a2bb8fe454175b0cdf77f47b49",
2121
"installed_by": ["modules"]
2222
},
23+
"s4pred/runmodel": {
24+
"branch": "master",
25+
"git_sha": "41dfa3f7c0ffabb96a6a813fe321c6d1cc5b6e46",
26+
"installed_by": ["modules"]
27+
},
2328
"seqfu/stats": {
2429
"branch": "master",
2530
"git_sha": "e753770db613ce014b3c4bc94f6cba443427b726",

modules/nf-core/s4pred/runmodel/environment.yml

Lines changed: 7 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/s4pred/runmodel/main.nf

Lines changed: 52 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/s4pred/runmodel/meta.yml

Lines changed: 52 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/s4pred/runmodel/tests/main.nf.test

Lines changed: 58 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)