These are the tutorials associated with downstream DARLIN data analysis, developed during our DARLIN paper L. Li,...,S.-W. Wang, F. Camargo, Cell (2023)
Single-cell lineage analysis includes
- Data loading
- Common allele filtering
- Allele integration across different target loci
- Lineage coupling or fate bias analysis
Bulk lineage analysis includes
- Data loading
- Common allele filtering
- Lineage coupling or shared clone fraction analysis
For allele integration across different loci, we used a heuristic approach that remains to be further improved. We use data associated with Figure 4 in paper for the single-cell tutorial, and use the data in Figure 5 for the bulk tutorial. In principle, the mutation patterns of the alleles can be further utilized for inferring lineage phylogeny. This is an advanced topic not covered by this tutorial.
To download the jupyter notebooks, please run
git clone https://github.com/ShouWenWang-Lab/DARLIN_tutorialThese notebooks cannot be directly executed without data. Please download the entire project file (~13G) from zendo, which includes all these notebooks, as well as all the raw and processed data needed for this tutorial, arranged in the correct folder structure that is runnable.
To download all raw or processed data, please go to GEO: GSE222486
To run this, you need to setup a proper environment, which we may call DARLIN_analysis. Particulary, it requires our in-house package called MosaicLineage as well as cospar
code_directory='.' # change it to the directory where you want to put the packages
cd $code_directory
kernel_name='DARLIN_analysis'
conda create -n $kernel_name python=3.8 --yes
conda activate $kernel_name
pip install cospar
pip install --user ipykernel
pip install pytest black ipywidgets
pip install pytest-datadir pytest-cov umi_tools toolz jupyterlab pyyaml scikit-bio biopython wand seaborn==0.11.2 scikit-learn==1.1.1 pandas==1.5.3
python -m ipykernel install --user --name=$kernel_name
pip uninstall cospar # uninstall the old version, and install the latest
cd $code_directory
git clone https://github.com/ShouWenWang-Lab/cospar --depth=1
cd cospar
python setup.py develop
cd $code_directory
git clone https://github.com/ShouWenWang-Lab/MosaicLineage --depth=1
cd MosaicLineage
python setup.py develop
cd $code_directory
git clone https://github.com/ShouWenWang-Lab/snakemake_DARLIN --depth=1
cd snakemake_DARLIN
python setup.py developL. Li, S. Bowling, S. E. McGeary, Q. Yu, B. Lemke, K. Alcedo, Y. Jia, X. Liu, M. Ferreira, A. M. Klein, S.-W. Wang*, F. D. Camargo*, A mouse model with high clonal barcode diversity for joint lineage, transcriptomic, and epigenomic profiling in single cells, Cell (2023). [* corresponding authors]
-
20231122_DARLIN_in_vivo_hematopoiesis.ipynbis also available from CoSpar package. -
MosaicLineage, A (mosaic) collection of python help functions related to lineage tracing data analysis, developed through the DARLIN project.