Skip to content

Commit b4af633

Browse files
authored
Merge pull request #67 from nf-core/nf-core-interproscan
Use nf-core interproscan instead of local version
2 parents c51c2d1 + 22f5e2f commit b4af633

144 files changed

Lines changed: 968 additions & 325168 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.github/workflows/nf-test.yml

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,6 @@ jobs:
5151
NFT_VER: ${{ env.NFT_VER }}
5252
with:
5353
max_shards: 7
54-
tags: CI
5554

5655
- name: debug
5756
run: |
@@ -101,7 +100,6 @@ jobs:
101100
profile: ${{ matrix.profile }}
102101
shard: ${{ matrix.shard }}
103102
total_shards: ${{ env.TOTAL_SHARDS }}
104-
tags: CI
105103

106104
- name: Report test status
107105
if: ${{ always() }}

.prettierignore

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,4 +16,3 @@ modules/nf-core/
1616
subworkflows/nf-core/
1717
**/Makefile
1818
Makefile
19-
modules/local/interproscan/tests/fixtures/interproscan_db/data

CHANGELOG.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,8 @@ Initial release of nf-core/proteinannotator, created with the [nf-core](https://
99

1010
### `Added`
1111

12+
- [#68](https://github.com/nf-core/proteinannotator/pull/68) - Using the `ARIA2` and `UNTAR` nf-core modules to download and decompress the InterProScan database. (by @vagkaratzas)
13+
- [#67](https://github.com/nf-core/proteinannotator/pull/67) - Swapped to the updated, non-buggy, nf-core version of `INTERPROSCAN`. (by @vagkaratzas)
1214
- [#65](https://github.com/nf-core/proteinannotator/pull/65) - Converted the pipeline schematic to nf-core metromap. (by @vagkaratzas)
1315
- [#62](https://github.com/nf-core/proteinannotator/pull/62) - Added the option to download and use the latest FunFam HMM library (or use path to an existing one) for domain annotation. (by @vagkaratzas)
1416
- [#61](https://github.com/nf-core/proteinannotator/pull/61) - Added nf-core modules `ARIA2` and `HMMER_HMMSEARCH` to download latest Pfam HMM library (or use path to existing one) and match domains to input sequences. (by @vagkaratzas)
@@ -20,7 +22,7 @@ Initial release of nf-core/proteinannotator, created with the [nf-core](https://
2022
- [#47](https://github.com/nf-core/proteinannotator/pull/47) - Update metromap with more tools added from [May 2025 Hackathon](https://nf-co.re/events/2025/hackathon-boston)
2123
- [#42](https://github.com/nf-core/proteinannotator/pull/42) - Updated to `nf-test` on GitHub Actions and in the `PULL_REQUEST_TEMPLATE.md`
2224
- [#13](https://github.com/nf-core/proteinannotator/pull/13) - Add nf-core seqkit/stats module
23-
- [#9](https://github.com/nf-core/proteinannotator/pull/9) - Add [InterProScan](https://interproscan-docs.readthedocs.io/) module
25+
- [#9](https://github.com/nf-core/proteinannotator/pull/9) - Added [InterProScan](https://interproscan-docs.readthedocs.io/) module - local version
2426

2527
### `Fixed`
2628

CITATIONS.md

Lines changed: 1 addition & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -24,20 +24,12 @@
2424
2525
- [InterProScan](https://academic.oup.com/bioinformatics/article/17/9/847/206564)
2626

27-
> Zdobnov, Evgeni M., and Rolf Apweiler. “InterProScan – an Integration Platform for the Signature-Recognition Methods in InterPro.” Bioinformatics 17, no. 9 (September 1, 2001): 847–48. https://doi.org/10.1093/bioinformatics/17.9.847.
27+
> Jones P, Binns D, Chang HY, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S, Quinn, A. F, Sangrador-Vegas A, Scheremetjew M, Yong S-Y, Lopez R, Hunter S. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014 May 1;30(9):1236-40. doi: 10.1093/bioinformatics/btu031. PubMed PMID: 24451626; PubMed Central PMCID: PMC3998142.
2828
2929
- [s4pred](https://pubmed.ncbi.nlm.nih.gov/34213528/)
3030

3131
> Moffat L, Jones DT. Increasing the accuracy of single sequence prediction methods using a deep semi-supervised learning framework. Bioinformatics. 2021 Nov 1;37(21):3744-51. doi: 10.1093/bioinformatics/btab491. PubMed PMID: 34213528; PubMed Central PMCID: PMC8570780.
3232
33-
<!-- - [MMseqs2](https://www.nature.com/articles/nbt.3988)
34-
35-
> Zdobnov, Evgeni M., and Rolf Apweiler. “InterProScan – an Integration Platform for the Signature-Recognition Methods in InterPro.” Bioinformatics 17, no. 9 (September 1, 2001): 847–48. https://doi.org/10.1093/bioinformatics/17.9.847.
36-
37-
- [mTM-align](https://academic.oup.com/bioinformatics/article/34/10/1719/4769500)
38-
39-
> Dong, Runze, Zhenling Peng, Yang Zhang, and Jianyi Yang. “mTM-Align: An Algorithm for Fast and Accurate Multiple Protein Structure Alignment.” Bioinformatics 34, no. 10 (May 15, 2018): 1719–25. https://doi.org/10.1093/bioinformatics/btx828. -->
40-
4133
- [MultiQC](https://pubmed.ncbi.nlm.nih.gov/27312411/)
4234

4335
> Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924.

conf/modules.config

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -110,6 +110,35 @@ process {
110110
]
111111
}
112112

113+
withName: 'NFCORE_PROTEINANNOTATOR:PROTEINANNOTATOR:FUNCTIONAL_ANNOTATION:ARIA2' {
114+
publishDir = [
115+
path: { "${params.outdir}/downloaded_dbs/" },
116+
mode: params.publish_dir_mode,
117+
saveAs: { filename -> filename.equals('versions.yml') ? null : filename }
118+
]
119+
}
120+
121+
withName: 'NFCORE_PROTEINANNOTATOR:PROTEINANNOTATOR:FUNCTIONAL_ANNOTATION:UNTAR' {
122+
publishDir = [
123+
path: { "${params.outdir}/downloaded_dbs/" },
124+
mode: params.publish_dir_mode,
125+
saveAs: { filename -> filename.equals('versions.yml') ? null : filename },
126+
]
127+
}
128+
129+
withName: 'NFCORE_PROTEINANNOTATOR:PROTEINANNOTATOR:FUNCTIONAL_ANNOTATION:INTERPROSCAN' {
130+
publishDir = [
131+
path: { "${params.outdir}/functional_annotation/interproscan/${meta.id}/" },
132+
mode: params.publish_dir_mode,
133+
saveAs: { filename -> filename.equals('versions.yml') ? null : filename },
134+
]
135+
ext.args = [
136+
"--applications ${params.interproscan_applications}",
137+
params.interproscan_enableprecalc ? '' : '--disable-precalc'
138+
].join(' ').trim()
139+
cpus = 1 // bugs with more
140+
}
141+
113142
withName: 'NFCORE_PROTEINANNOTATOR:PROTEINANNOTATOR:S4PRED_RUNMODEL' {
114143
ext.prefix = { params.s4pred_outfmt }
115144
ext.args = { "--outfmt ${params.s4pred_outfmt}" }

conf/test.config

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -23,10 +23,11 @@ params {
2323
config_profile_description = 'Minimal test dataset to check pipeline function'
2424

2525
// Input data
26-
input = params.pipelines_testdata_base_path + 'proteinfold/testdata/samplesheet/v1.2/samplesheet.csv'
26+
input = params.pipelines_testdata_base_path + 'proteinannotator/samplesheet/samplesheet.csv'
2727
// Domain annotation
2828
pfam_latest_link = params.pipelines_testdata_base_path + 'proteinannotator/testdata/pfam/Pfam-A_test.hmm.gz'
2929
funfam_latest_link = params.pipelines_testdata_base_path + 'proteinannotator/testdata/funfam/funfam-hmm3-v4_3_0_test.lib.gz'
3030
// Functional annotation
31-
skip_interproscan = true
31+
interproscan_db_url = params.pipelines_testdata_base_path + 'proteinannotator/testdata/interproscan/interproscan_test.tar.gz'
32+
interproscan_applications = 'Hamap,TIGRFAM,sfld'
3233
}

conf/test_full.config

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,8 +15,11 @@ params {
1515
config_profile_description = 'Full test dataset to check pipeline function'
1616

1717
// Input data for full size test
18-
input = params.pipelines_testdata_base_path + 'proteinannotator/samplesheet/snap25-isoforms.csv'
18+
input = params.pipelines_testdata_base_path + 'proteinannotator/testdata/samplesheet.csv'
1919
// Domain annotation
2020
pfam_latest_link = params.pipelines_testdata_base_path + 'proteinannotator/testdata/pfam/Pfam-A_test.hmm.gz'
2121
funfam_latest_link = params.pipelines_testdata_base_path + 'proteinannotator/testdata/funfam/funfam-hmm3-v4_3_0_test.lib.gz'
22+
// Functional annotation
23+
interproscan_db_url = params.pipelines_testdata_base_path + 'proteinannotator/testdata/interproscan_test.tar.gz'
24+
interproscan_applications = 'Hamap,TIGRFAM,sfld'
2225
}
104 KB
Loading

docs/images/proteinannotator_metromap_dark.svg

Lines changed: 1 addition & 1 deletion
Loading
106 KB
Loading

0 commit comments

Comments
 (0)