Skip to content

qc and preprocessing nf-core subworkflow added#59

Merged
vagkaratzas merged 3 commits into
devfrom
qc-subworkflow
Dec 2, 2025
Merged

qc and preprocessing nf-core subworkflow added#59
vagkaratzas merged 3 commits into
devfrom
qc-subworkflow

Conversation

@vagkaratzas

@vagkaratzas vagkaratzas commented Dec 1, 2025

Copy link
Copy Markdown
Collaborator

Closes #7

Combination of seqkit and seqfu tools similar to the proteinfamilies pipeline.

seqfu_stats output example:

File	#Seq	Total bp	Avg	N50	N75	N90	auN	Min	Max
T1024.fasta	1	408	408.00	408	408	408	408.00	408	408

seqkit pre-processing option examples (found in modules.config):

"--remove-gaps",
"--upper-case",
"--validate-seq",
"--min-len ${params.min_seq_length}",
"--max-len ${params.max_seq_length}"
special character replacement
duplicates removal (by id or by sequence)

PR checklist

  • This comment contains a description of changes (with reason).
  • If you've fixed a bug or added code that should be tested, add tests!
  • If you've added a new tool - have you followed the pipeline conventions in the contribution docs
  • If necessary, also make a PR on the nf-core/proteinannotator branch on the nf-core/test-datasets repository.
  • Make sure your code lints (nf-core pipelines lint).
  • Ensure the test suite passes (e.g. nf-test test */local --profile=~test,docker for all new local tests).
  • Check for unexpected warnings in debug mode (nf-test test */local --profile=~test,docker,debug).
  • Usage Documentation in docs/usage.md is updated.
  • Output Documentation in docs/output.md is updated.
  • CHANGELOG.md is updated.
  • README.md is updated (including new tool citations and authors/contributors).
    • docs/images/proteinannotator-metromap.light.excalidraw.png and docs/images/proteinannotator-metromap.light.excalidraw.png (edit the light version only, then export and turn on dark mode) are both updated (use the excalidraw website or VS Code plugin to edit)

@github-actions

github-actions Bot commented Dec 1, 2025

Copy link
Copy Markdown

nf-core pipelines lint overall result: Passed ✅ ⚠️

Posted for pipeline commit 6386634

+| ✅ 201 tests passed       |+
#| ❔   7 tests were ignored |#
!| ❗   1 tests had warnings |!
Details

❗ Test warnings:

  • readme - README contains the placeholder zenodo.XXXXXXX. This should be replaced with the zenodo doi (after the first release).

❔ Tests ignored:

  • files_exist - File is ignored: .github/workflows/ci.yml
  • files_exist - File is ignored: conf/igenomes.config
  • files_exist - File is ignored: conf/igenomes_ignored.config
  • files_unchanged - File ignored due to lint config: .github/PULL_REQUEST_TEMPLATE.md
  • files_unchanged - File ignored due to lint config: assets/nf-core-proteinannotator_logo_light.png
  • files_unchanged - File ignored due to lint config: docs/images/nf-core-proteinannotator_logo_light.png
  • files_unchanged - File ignored due to lint config: docs/images/nf-core-proteinannotator_logo_dark.png

✅ Tests passed:

Run details

  • nf-core/tools version 3.5.1
  • Run at 2025-12-02 08:47:45

@Joon-Klaps Joon-Klaps left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor detail regarding the MMseqs modules

Comment on lines +13 to +16
// TODO remove if unused
// include { MMSEQS_SEARCH } from '../modules/nf-core/mmseqs/search/main'
// include { MTMALIGN_ALIGN } from '../modules/nf-core/mtmalign/align/main'

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Think u forgot to remove this?

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looking more in depth, the modules themselves aren't even used in the pipeline?

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Other people have left them there...so for now, until I figure out what they want to do with them for the first release, I've just commented ;)

@vagkaratzas

Copy link
Copy Markdown
Collaborator Author

Thanks @Joon-Klaps !

@vagkaratzas vagkaratzas merged commit 7d17d6c into dev Dec 2, 2025
13 checks passed
@vagkaratzas vagkaratzas deleted the qc-subworkflow branch December 2, 2025 09:22
@vagkaratzas vagkaratzas mentioned this pull request Feb 4, 2026
11 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add quality control / input file statistics

2 participants