Skip to content

Commit 5675023

Browse files
authored
Merge pull request #29 from MPUSP/dev
docs: updating pipeline documentation and dag graph
2 parents 3aa74c8 + 390587a commit 5675023

3 files changed

Lines changed: 18 additions & 16 deletions

File tree

LICENSE

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
MIT License
44

5-
Copyright (c) 2024, AUTHORS
5+
Copyright (c) 2025, Rina Ahmed-Begrich
66

77
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
88

README.md

Lines changed: 17 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,16 @@
1-
# <a id="anchortitle" />Snakemake workflow: bacterial-rnaseq-preprocessing
1+
# <a id="anchortitle" />Snakemake workflow: bacterial-rnaseq-processing
22

3+
![Platform](https://img.shields.io/badge/platform-all-green)
34
[![Snakemake](https://img.shields.io/badge/snakemake-≥8.0.0-brightgreen.svg)](https://snakemake.github.io)
4-
[![Tests](https://github.com/MPUSP/snakemake-bacterial-rnaseq-preprocessing/actions/workflows/main.yml/badge.svg)](https://github.com/MPUSP/snakemake-bacterial-rnaseq-preprocessing/actions/workflows/main.yml)
5+
[![Tests](https://github.com/MPUSP/snakemake-bacterial-rnaseq-processing/actions/workflows/main.yml/badge.svg)](https://github.com/MPUSP/snakemake-bacterial-rnaseq-processing/actions/workflows/main.yml)
56
[![run with conda](http://img.shields.io/badge/run%20with-conda-3EB049?labelColor=000000&logo=anaconda)](https://docs.conda.io/en/latest/)
67
[![workflow catalog](https://img.shields.io/badge/Snakemake%20workflow%20catalog-darkgreen)](https://snakemake.github.io/snakemake-workflow-catalog)
78

89
------------------------------------------------------------------------
910

10-
A Snakemake workflow for the preprocessing of short read rnaseq data in bacteria.
11+
A Snakemake workflow for the processing of short read rnaseq data in bacteria.
1112

12-
- [Bacterial RNAseq preprocessing](#anchortitle)
13+
- [Bacterial RNAseq processing](#anchortitle)
1314
- [Usage](#usage)
1415
- [Workflow overview](#workflow-overview)
1516
- [Installation](#installation)
@@ -22,13 +23,13 @@ A Snakemake workflow for the preprocessing of short read rnaseq data in bacteria
2223

2324
## Usage
2425

25-
The usage of this workflow is described in the [Snakemake Workflow Catalog](https://snakemake.github.io/snakemake-workflow-catalog/?usage=MPUSP%2Fsnakemake-bacterial-rnaseq-preprocessing).
26+
The usage of this workflow is described in the [Snakemake Workflow Catalog](https://snakemake.github.io/snakemake-workflow-catalog/?usage=MPUSP%2Fsnakemake-bacterial-rnaseq-processing).
2627

2728
If you use this workflow in a paper, don't forget to give credits to the authors by citing the URL of this (original) <repo>sitory and its DOI (see above).
2829

2930
## Workflow overview
3031

31-
This workflow is a best-practice workflow for the preprocessing of short read sequencing data in bacteria. The workflow is built using [snakemake](https://snakemake.readthedocs.io/en/stable/) and consists of the following steps:
32+
This workflow is a best-practice workflow for the processing of short read sequencing data in bacteria. The workflow is built using [snakemake](https://snakemake.readthedocs.io/en/stable/) and consists of the following steps:
3233

3334
1. Obtain genome database in `fasta` and `gff` format (`python`, [NCBI Datasets](https://www.ncbi.nlm.nih.gov/datasets/docs/v2/))
3435
1. Using automatic download from NCBI with a `RefSeq` ID
@@ -39,8 +40,9 @@ This workflow is a best-practice workflow for the preprocessing of short read se
3940
5. Map reads to the reference genome ([STAR aligner](https://github.com/alexdobin/STAR))
4041
6. Sort and index aligned rnaseq data ([Samtools](http://www.htslib.org/))
4142
7. Deduplicate reads by unique molecular identifier (UMI, [UMI-tools](https://umi-tools.readthedocs.io/en/latest/))
42-
8. Quantify biotype features ([featureCounts](https://subread.sourceforge.net/featureCounts.html))
43-
9. Generate summary report for all processing steps ([MultiQC](https://seqera.io/multiqc/))
43+
8. Generate cpm normalized coverage files ([deepTools](https://deeptools.readthedocs.io/en/latest/))
44+
9. Quantify biotype features ([featureCounts](https://subread.sourceforge.net/featureCounts.html))
45+
10. Generate summary report for all processing steps ([MultiQC](https://seqera.io/multiqc/))
4446

4547
---
4648

@@ -52,8 +54,8 @@ This workflow is a best-practice workflow for the preprocessing of short read se
5254
**Step 1: Clone this repository**
5355

5456
``` bash
55-
git clone https://github.com/MPUSP/snakemake-bacterial-rnaseq-preprocessing.git
56-
cd snakemake-bacterial-rnaseq-preprocessing
57+
git clone https://github.com/MPUSP/snakemake-bacterial-rnaseq-processing.git
58+
cd snakemake-bacterial-rnaseq-processing
5759
```
5860

5961
**Step 2: Install dependencies**
@@ -62,12 +64,12 @@ It is recommended to install snakemake and run the workflow with `conda` or `mam
6264

6365
**Step 3: Create snakemake environment**
6466

65-
This step creates a new conda environment called `snakemake-bacterial-rnaseq-preprocessing`.
67+
This step creates a new conda environment called `snakemake-bacterial-rnaseq-processing`.
6668

6769
``` bash
6870
# create new environment with dependencies & activate it
69-
mamba create -c conda-forge -c bioconda -n snakemake-bacterial-rnaseq-preprocessing snakemake pandas python=3.12
70-
conda activate snakemake-bacterial-rnaseq-preprocessing
71+
mamba create -c conda-forge -c bioconda -n snakemake-bacterial-rnaseq-processing snakemake pandas python=3.12
72+
conda activate snakemake-bacterial-rnaseq-processing
7173
```
7274

7375
**Note:**
@@ -80,7 +82,7 @@ This step creates all conda environments specified in the snakemake rules. This
8082

8183
``` bash
8284
# activate new environment
83-
conda activate snakemake-bacterial-rnaseq-preprocessing
85+
conda activate snakemake-bacterial-rnaseq-processing
8486
snakemake -c 1 --sdm conda --conda-create-envs-only --conda-cleanup-pkgs cache
8587
```
8688

@@ -117,7 +119,7 @@ Currently, we support example configurations for three different sequencing prot
117119
To run the workflow from command line, change the working directory.
118120

119121
``` bash
120-
cd snakemake-bacterial-rnaseq-preprocessing
122+
cd snakemake-bacterial-rnaseq-processing
121123
```
122124

123125
To run the complete workflow with test files using **`conda`**, execute the following command. The definition of the number of compute cores is mandatory.

resources/images/dag.png

44.6 KB
Loading

0 commit comments

Comments
 (0)