You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: LICENSE
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
3
3
MIT License
4
4
5
-
Copyright (c) 2024, AUTHORS
5
+
Copyright (c) 2025, Rina Ahmed-Begrich
6
6
7
7
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
A Snakemake workflow for the preprocessing of short read rnaseq data in bacteria.
11
+
A Snakemake workflow for the processing of short read rnaseq data in bacteria.
11
12
12
-
-[Bacterial RNAseq preprocessing](#anchortitle)
13
+
-[Bacterial RNAseq processing](#anchortitle)
13
14
-[Usage](#usage)
14
15
-[Workflow overview](#workflow-overview)
15
16
-[Installation](#installation)
@@ -22,13 +23,13 @@ A Snakemake workflow for the preprocessing of short read rnaseq data in bacteria
22
23
23
24
## Usage
24
25
25
-
The usage of this workflow is described in the [Snakemake Workflow Catalog](https://snakemake.github.io/snakemake-workflow-catalog/?usage=MPUSP%2Fsnakemake-bacterial-rnaseq-preprocessing).
26
+
The usage of this workflow is described in the [Snakemake Workflow Catalog](https://snakemake.github.io/snakemake-workflow-catalog/?usage=MPUSP%2Fsnakemake-bacterial-rnaseq-processing).
26
27
27
28
If you use this workflow in a paper, don't forget to give credits to the authors by citing the URL of this (original) <repo>sitory and its DOI (see above).
28
29
29
30
## Workflow overview
30
31
31
-
This workflow is a best-practice workflow for the preprocessing of short read sequencing data in bacteria. The workflow is built using [snakemake](https://snakemake.readthedocs.io/en/stable/) and consists of the following steps:
32
+
This workflow is a best-practice workflow for the processing of short read sequencing data in bacteria. The workflow is built using [snakemake](https://snakemake.readthedocs.io/en/stable/) and consists of the following steps:
32
33
33
34
1. Obtain genome database in `fasta` and `gff` format (`python`, [NCBI Datasets](https://www.ncbi.nlm.nih.gov/datasets/docs/v2/))
34
35
1. Using automatic download from NCBI with a `RefSeq` ID
@@ -39,8 +40,9 @@ This workflow is a best-practice workflow for the preprocessing of short read se
39
40
5. Map reads to the reference genome ([STAR aligner](https://github.com/alexdobin/STAR))
40
41
6. Sort and index aligned rnaseq data ([Samtools](http://www.htslib.org/))
41
42
7. Deduplicate reads by unique molecular identifier (UMI, [UMI-tools](https://umi-tools.readthedocs.io/en/latest/))
42
-
8. Quantify biotype features ([featureCounts](https://subread.sourceforge.net/featureCounts.html))
43
-
9. Generate summary report for all processing steps ([MultiQC](https://seqera.io/multiqc/))
@@ -117,7 +119,7 @@ Currently, we support example configurations for three different sequencing prot
117
119
To run the workflow from command line, change the working directory.
118
120
119
121
```bash
120
-
cd snakemake-bacterial-rnaseq-preprocessing
122
+
cd snakemake-bacterial-rnaseq-processing
121
123
```
122
124
123
125
To run the complete workflow with test files using **`conda`**, execute the following command. The definition of the number of compute cores is mandatory.
0 commit comments