Skip to content
This repository was archived by the owner on Oct 14, 2025. It is now read-only.

Commit f110413

Browse files
committed
update vignette
1 parent 11a35bf commit f110413

1 file changed

Lines changed: 38 additions & 5 deletions

File tree

vignettes/Introduction.Rmd

Lines changed: 38 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,10 @@ vignette: >
77
%\VignetteEncoding{UTF-8}
88
---
99

10+
`CuratedAtlasQuery` is a query interface that allow the programmatic exploration and retrieval of the harmonised, curated and reannotated CELLxGENE single-cell human cell atlas. Data can be retrieved at cell, sample, or dataset levels based on filtering criteria.
11+
12+
# Query interface
13+
1014
```{r, include = FALSE}
1115
# Note: knit this to the repo readme file using:
1216
# rmarkdown::render("vignettes/readme.Rmd", output_format = "github_document", output_dir = getwd())
@@ -17,7 +21,7 @@ knitr::opts_chunk$set(
1721
```
1822

1923
```{r, echo=FALSE, out.height = "139px", out.width = "120px"}
20-
knitr::include_graphics("../inst/logo.png")
24+
knitr::include_graphics("inst/logo.png")
2125
```
2226

2327
## Load the package
@@ -168,10 +172,39 @@ get_metadata() |>
168172
```
169173

170174
```{r, echo=FALSE, message=FALSE, warning=FALSE}
171-
knitr::include_graphics("../inst/NCAM1_figure.png")
175+
knitr::include_graphics("inst/NCAM1_figure.png")
172176
```
173177

174-
```{r}
175-
sessionInfo()
176-
```
178+
# Cell metadata
179+
180+
Dataset-specific columns (definitions available at cellxgene.cziscience.com)
181+
182+
`cell_count`, `collection_id`, `created_at.x`, `created_at.y`, `dataset_deployments`, `dataset_id`, `file_id`, `filename`, `filetype`, `is_primary_data.y`, `is_valid`, `linked_genesets`, `mean_genes_per_cell`, `name`, `published`, `published_at`, `revised_at`, `revision`, `s3_uri`, `schema_version`, `tombstone`, `updated_at.x`, `updated_at.y`, `user_submitted`, `x_normalization`
183+
184+
Sample-specific columns (definitions available at cellxgene.cziscience.com)
185+
186+
`.sample`, `.sample_name`, `age_days`, `assay`, `assay_ontology_term_id`, `development_stage`, `development_stage_ontology_term_id`, `ethnicity`, `ethnicity_ontology_term_id`, `experiment___`, `organism`, `organism_ontology_term_id`, `sample_placeholder`, `sex`, `sex_ontology_term_id`, `tissue`, `tissue_harmonised`, `tissue_ontology_term_id`, `disease`, `disease_ontology_term_id`, `is_primary_data.x`
187+
188+
Cell-specific columns (definitions available at cellxgene.cziscience.com)
189+
190+
`.cell`, `cell_type`, `cell_type_ontology_term_idm`, `cell_type_harmonised`, `confidence_class`, `cell_annotation_azimuth_l2`, `cell_annotation_blueprint_singler`
191+
192+
Through harmonisation and curation we introduced custom column, not present in the original CELLxGENE metadata
193+
194+
- `tissue_harmonised`: a coarser tissue name for better filtering
195+
- `age_days`: the number of days corresponding to the age
196+
- `cell_type_harmonised`: the consensus call identiti (for immune cells) using the original and three novel annotations using Seurat Azimuth and SingleR
197+
- `confidence_class`: an ordinal class of how confident `cell_type_harmonised` is. 1 is complete consensus, 2 is 3 out of four and so on.
198+
- `cell_annotation_azimuth_l2`: Azimuth cell annotation
199+
- `cell_annotation_blueprint_singler`: SingleR cell annotation using Blueprint reference
200+
- `cell_annotation_blueprint_monaco`: SingleR cell annotation using Monaco reference
201+
- `sample_id_db`: Sample subdivision for internal use
202+
- `file_id_db`: File subdivision for internal use
203+
- `.sample`: Sample ID
204+
- `.sample_name`: How samples were defined
205+
206+
# RNA abundance
207+
208+
The `raw` assay includes RNA abundance in the positive real scale (not transformed with non-linear functions, e.g. log sqrt). Originally CELLxGENE include a mix of scales and tranformations specified in the `x_normalization` column.
177209

210+
The `cpm` assay includes counts per million.

0 commit comments

Comments
 (0)