You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+1Lines changed: 1 addition & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,6 +9,7 @@ Initial release of nf-core/proteinannotator, created with the [nf-core](https://
9
9
10
10
### `Added`
11
11
12
+
-[#61](https://github.com/nf-core/proteinannotator/pull/61) - Added nf-core modules `ARIA2` and `HMMER_HMMSEARCH` to download latest Pfam HMM library (or use path to existing one) and match domains to input sequences. (by @vagkaratzas)
12
13
-[#60](https://github.com/nf-core/proteinannotator/pull/60) - Added nf-core module `S4PRED_RUNMODEL` for secondary structure prediction (i.e., α-helix, a β-strand or a coil). (by @vagkaratzas)
13
14
-[#59](https://github.com/nf-core/proteinannotator/pull/59) - Added nf-core qc and pre-processing subworkflow for amino acid sequences `FAA_SEQFU_SEQKIT`. (by @vagkaratzas)
14
15
-[#57](https://github.com/nf-core/proteinannotator/pull/57) - nf-core tools template update to 3.5.1. (by @vagkaratzas)
Copy file name to clipboardExpand all lines: CITATIONS.md
+4Lines changed: 4 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,6 +18,10 @@
18
18
19
19
> Shen W, Sipos B, Zhao L. SeqKit2: A Swiss army knife for sequence and alignment processing. iMeta. 2024 Apr 5:e191. doi: 10.1002/imt2.191. PubMed PMID: 38898985; PubMed Central PMCID: PMC11183193.
> Zdobnov, Evgeni M., and Rolf Apweiler. “InterProScan – an Integration Platform for the Signature-Recognition Methods in InterPro.” Bioinformatics 17, no. 9 (September 1, 2001): 847–48. https://doi.org/10.1093/bioinformatics/17.9.847.
Copy file name to clipboardExpand all lines: docs/output.md
+35-2Lines changed: 35 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,11 @@ The pipeline is built using [Nextflow](https://www.nextflow.io/) and processes d
16
16
-[SeqFu](#seqfu) for input amino acid sequences quality check (QC)
17
17
-[SeqKit](#seqkit) for preprocessing input amino acid sequences (i.e., gap removal, convert to upper case, validate, filter by length, replace special characters such as `/`, and remove duplicate sequences)
18
18
19
-
-[Functional Annotation](#functional-annotation) Annotate proteins with functional domains
19
+
-[Domain annotation](#domain-annotation) Annotate proteins with domains from established repositories.
20
+
-[aria2](#aria2) - To optionally download the latest Pfam database through the pipeline.
21
+
-[hmmer](#hmmer) - To optionally match the input sequence to known Pfam domains through `hmmer/hmmsearch`
22
+
23
+
-[Functional annotation](#functional-annotation) Annotate proteins with functional domains
20
24
-[InterProScan](#Interproscan) - Search the InterPro database for functional domains
21
25
22
26
-[s4pred](#s4pred) - Predict secondary structures of sequences, producing per amino acid probabilities of being an α-helix, a β-strand or a coil.
@@ -60,7 +64,36 @@ The `seqkit` module is used for initial preprocessing (i.e., gap removal, conver
60
64
61
65
[SeqKit](https://github.com/shenwei356/seqkit) is a cross-platform and ultrafast toolkit for FASTA/Q file manipulation.
62
66
63
-
### Functional Annotation
67
+
### Domain annotation
68
+
69
+
#### aria2
70
+
71
+
<detailsmarkdown="1">
72
+
<summary>Output files</summary>
73
+
74
+
-`downloaded_dbs/`
75
+
-`Pfam-A*.hmm.gz`: (optional) The latest full, or a minimal test, Pfam-A HMM database that can be downloaded through the pipeline.
76
+
77
+
</details>
78
+
79
+
[aria2](https://github.com/aria2/aria2/) is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.
80
+
81
+
#### hmmer
82
+
83
+
<detailsmarkdown="1">
84
+
<summary>Output files</summary>
85
+
86
+
-`domain_annotation/`
87
+
-`pfam/`
88
+
-`<samplename>.domtbl.gz`: `hmmer/hmmsearch` results along parameters info.
89
+
90
+
</details>
91
+
92
+
The `domain_annotation/pfam` folder contains a `.domtbl.gz` annotation file per input sample.
93
+
94
+
[hmmer](https://github.com/EddyRivasLab/hmmer) is a fast and flexible alignment trimming tool that keeps phylogenetically informative sites and removes others.
0 commit comments