Skip to content

Commit 89f5a13

Browse files
committed
added s4pred module to pipeline
1 parent 6386634 commit 89f5a13

13 files changed

Lines changed: 342 additions & 0 deletions

File tree

CITATIONS.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,10 @@
2222

2323
> Zdobnov, Evgeni M., and Rolf Apweiler. “InterProScan – an Integration Platform for the Signature-Recognition Methods in InterPro.” Bioinformatics 17, no. 9 (September 1, 2001): 847–48. https://doi.org/10.1093/bioinformatics/17.9.847.
2424
25+
- [s4pred](https://pubmed.ncbi.nlm.nih.gov/34213528/)
26+
27+
> Moffat L, Jones DT. Increasing the accuracy of single sequence prediction methods using a deep semi-supervised learning framework. Bioinformatics. 2021 Nov 1;37(21):3744-51. doi: 10.1093/bioinformatics/btab491. PubMed PMID: 34213528; PubMed Central PMCID: PMC8570780.
28+
2529
<!-- - [MMseqs2](https://www.nature.com/articles/nbt.3988)
2630
2731
> Zdobnov, Evgeni M., and Rolf Apweiler. “InterProScan – an Integration Platform for the Signature-Recognition Methods in InterPro.” Bioinformatics 17, no. 9 (September 1, 2001): 847–48. https://doi.org/10.1093/bioinformatics/17.9.847.

conf/modules.config

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -74,6 +74,16 @@ process {
7474
]
7575
}
7676

77+
withName: 'NFCORE_PROTEINANNOTATOR:PROTEINANNOTATOR:S4PRED_RUNMODEL' {
78+
ext.prefix = { params.s4pred_outfmt }
79+
ext.args = { "--outfmt ${params.s4pred_outfmt}" }
80+
publishDir = [
81+
path: { "${params.outdir}/s4pred/${meta.id}/" },
82+
mode: params.publish_dir_mode,
83+
saveAs: { filename -> filename.equals('versions.yml') ? null : filename }
84+
]
85+
}
86+
7787
withName: 'MULTIQC' {
7888
ext.args = { params.multiqc_title ? "--title \"$params.multiqc_title\"" : '' }
7989
publishDir = [

docs/output.md

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,9 @@ The pipeline is built using [Nextflow](https://www.nextflow.io/) and processes d
1818

1919
- [Functional Annotation](#functional-annotation) Annotate proteins with functional domains
2020
- [InterProScan](#Interproscan) - Search the InterPro database for functional domains
21+
22+
- [s4pred](#s4pred) - Predict secondary structures of sequences, producing per amino acid probabilities of being an α-helix, a β-strand or a coil.
23+
2124
- [MultiQC](#multiqc) - Aggregate report describing results and QC from the whole pipeline
2225
- [Pipeline information](#pipeline-information) - Report metrics generated during the workflow execution
2326

@@ -307,6 +310,22 @@ The XML Schema Definition (XSD) is available [here](http://ftp.ebi.ac.uk/pub/sof
307310

308311
</details>
309312

313+
#### s4pred
314+
315+
<details markdown="1">
316+
<summary>Output files</summary>
317+
318+
- `s4pred/`
319+
- `<samplename>/`
320+
- `<s4pred_outfmt>/`
321+
- `<samplename>.<s4pred_outfmt>`: The probability of each amino acid to be an α-helix, a β-strand or a coil, in the chosen output format (i.e., 'ss2', 'fas', or 'horiz').
322+
323+
</details>
324+
325+
The `s4pred` module is used to predict secondary structures of amino acid sequences.
326+
327+
[s4pred](https://github.com/psipred/s4pred) is a tool for accurate prediction of a protein's secondary structure from only it's amino acid sequence.
328+
310329
### MultiQC
311330

312331
<details markdown="1">

modules.json

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,11 @@
2020
"git_sha": "af27af1be706e6a2bb8fe454175b0cdf77f47b49",
2121
"installed_by": ["modules"]
2222
},
23+
"s4pred/runmodel": {
24+
"branch": "master",
25+
"git_sha": "41dfa3f7c0ffabb96a6a813fe321c6d1cc5b6e46",
26+
"installed_by": ["modules"]
27+
},
2328
"seqfu/stats": {
2429
"branch": "master",
2530
"git_sha": "e753770db613ce014b3c4bc94f6cba443427b726",

modules/nf-core/s4pred/runmodel/environment.yml

Lines changed: 7 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/s4pred/runmodel/main.nf

Lines changed: 52 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/s4pred/runmodel/meta.yml

Lines changed: 52 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/s4pred/runmodel/tests/main.nf.test

Lines changed: 58 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

modules/nf-core/s4pred/runmodel/tests/main.nf.test.snap

Lines changed: 98 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

nextflow.config

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -24,6 +24,10 @@ params {
2424
interproscan_database = null
2525
interproscan_database_version = null
2626

27+
// Secondary structure prediction (s4pred)
28+
skip_s4pred = false
29+
s4pred_outfmt = 'ss2' // ["ss2", "fas", "horiz"]
30+
2731
// MultiQC options
2832
multiqc_config = null
2933
multiqc_title = null

0 commit comments

Comments
 (0)