You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
7
7
8
8
### `Added`
9
9
10
-
-[#90](https://github.com/nf-core/proteinannotator/pull/90) - Added the option to download and use the latest `MetagRoot` HMM library (or use path to an existing one) for domain annotation. (by @angelphanth)
10
+
-[#90](https://github.com/nf-core/proteinannotator/pull/90) - Added the option to download and use the latest `metagRoot` HMM library (or use path to an existing one) for domain annotation. (by @angelphanth)
11
11
-[#87](https://github.com/nf-core/proteinannotator/pull/87) - Added the option to download and use the latest `NMPFams` HMM library (or use path to an existing one) for domain annotation. (by @npechl)
12
12
-[#85](https://github.com/nf-core/proteinannotator/pull/85) - Added zenodo doi in `nextflow.config`. (by @vagkaratzas)
1. Conserved domain annotation with ([`hmmer`](https://github.com/EddyRivasLab/hmmer/)) against databases
40
-
such as [Pfam](https://ftp.ebi.ac.uk/pub/databases/Pfam/), [FunFam](https://download.cathdb.info/cath/releases/all-releases/), and [NMPFams and MetagRoot](https://pavlopoulos-lab.org/envofams/databases/hmmer/)
40
+
such as [Pfam](https://ftp.ebi.ac.uk/pub/databases/Pfam/), [FunFam](https://download.cathdb.info/cath/releases/all-releases/), and [NMPFams and metagRoot](https://pavlopoulos-lab.org/envofams/databases/hmmer/)
41
41
2. Functional annotation:
42
42
- ([`InterProScan`](https://interproscan-docs.readthedocs.io/en/v5/)) a software tool used to analyze protein sequences by scanning them against the signatures of protein families, domains, and sites in the [InterPro](https://www.ebi.ac.uk/interpro/) database, helping to identify their functional characteristics.
43
43
3. Predict secondary structure compositional features such as α-helices, β-strands and coils with ([`s4pred`](https://github.com/psipred/s4pred))
Copy file name to clipboardExpand all lines: docs/output.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,9 +14,9 @@ The pipeline is built using [Nextflow](https://www.nextflow.io/) and processes d
14
14
-[SeqFu](#seqfu) for input amino acid sequences quality control (QC)
15
15
-[SeqKit](#seqkit) for preprocessing input amino acid sequences (i.e., gap removal, convert to upper case, validate, filter by length, replace special characters such as `/`, and remove duplicate sequences)
16
16
-[Database download](#database-download) Optionally download selected databases for annotation.
17
-
-[aria2](#aria2) - To optionally download the Pfam, FunFam, NMPFams, MetagRoot and/or InterProScan databases through the pipeline.
17
+
-[aria2](#aria2) - To optionally download the Pfam, FunFam, NMPFams, metagRoot and/or InterProScan databases through the pipeline.
18
18
-[Domain annotation](#domain-annotation) Annotate proteins with domains from established repositories.
19
-
-[hmmer](#hmmer) - To optionally match the input sequence to known Pfam, FunFam, NMPFams and/or MetagRoot domains through `hmmer/hmmsearch`
19
+
-[hmmer](#hmmer) - To optionally match the input sequence to known Pfam, FunFam, NMPFams and/or metagRoot domains through `hmmer/hmmsearch`
20
20
-[Functional annotation](#functional-annotation) Annotate proteins with functional domains
21
21
-[InterProScan](#Interproscan) - Search the InterProScan database for functional domains
22
22
-[s4pred](#s4pred) - Predict secondary structures of sequences, producing amino acid level probabilities of forming an α-helix, a β-strand or a coil.
@@ -73,7 +73,7 @@ The `seqkit` module is used for initial preprocessing (i.e., gap removal, conver
73
73
-`interproscan_test.tar.gz`: (optional) the downloaded InterProScan archive of member databases according to the optional user-provided url
74
74
-`funfam-hmm3-v4_3_0*.lib.gz`: (optional) The latest (v4_3_0) full, or a minimal test, FunFam HMM database that can be downloaded through the pipeline.
75
75
-`nmpfamsdb.hmm.gz`: (optional) The latest full, or a minimal test, NMPFams HMM database that can be downloaded through the pipeline.
76
-
-`metagroot.hmm.gz`: (optional) The latest full, or a minimal test, MetagRoot HMM database that can be downloaded through the pipeline.
76
+
-`metagroot.hmm.gz`: (optional) The latest full, or a minimal test, metagRoot HMM database that can be downloaded through the pipeline.
Copy file name to clipboardExpand all lines: docs/usage.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@
7
7
## Introduction
8
8
9
9
**nf-core/proteinannotator** is a bioinformatics pipeline that computes statistics and generates sequence-level annotations for amino acid sequences.
10
-
It takes a protein FASTA file as input and performs conserved domain annotation (using Pfam, FunFam, NMPFams and MetagRoot HMM databases), functional annotation (using InterProScan), and secondary structure prediction (using s4pred).
10
+
It takes a protein FASTA file as input and performs conserved domain annotation (using Pfam, FunFam, NMPFams and metagRoot HMM databases), functional annotation (using InterProScan), and secondary structure prediction (using s4pred).
11
11
Optionally, paths to pre-downloaded databases can be provided to skip the automatic download steps and speed up repeated runs.
0 commit comments