Skip to content

Commit 73d68b2

Browse files
committed
raw text
1 parent 8ec3397 commit 73d68b2

1 file changed

Lines changed: 126 additions & 0 deletions

File tree

content/blog/2025-07-biomni.md

Lines changed: 126 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,126 @@
1+
+++
2+
title = "scverse × Biomni: Agentic Single-Cell Omics Analysis"
3+
date = 2025-07-28T00:00:00+00:00
4+
description = "tbd"
5+
author = "scverse team"
6+
draft = false
7+
+++
8+
9+
![][image1]
10+
11+
# scverse × Biomni: Democratizing Single-Cell Omics Analysis
12+
13+
Single-cell and spatial omics have unlocked unprecedented insights into cellular diversity, tissue architecture, and drug responses. Yet today, most researchers must stitch together fragmented code across multiple packages—writing custom scripts for data loading, normalization, clustering, spatial feature extraction, perturbation modeling, and visualization. This process demands deep programming expertise, detailed knowledge of each library’s API, and often takes days even weeks to build, debug, and reproduce.
14+
15+
**scverse** is a community-driven and open-source initiative behind many of the most widely adopted Python tools in single-cell biology, including scanpy, scvi-tools, squidpy, and more. Built on shared data structures, anndata, mudata, and SpatialData, scverse promotes modular, interoperable, and scalable analysis across modalities, from transcriptomics to spatial and immune profiling.
16+
17+
We’re excited to announce a partnership with **scverse** to fundamentally change how scientists interact with single-cell and spatial omics tools. Biomni now acts as an **intelligent agent for the scverse ecosystem**, capable of writing, executing, and managing code across ten core scverse packages on your behalf. Researchers can describe their analysis goals in plain English—e.g., *“cluster cells and identify markers”* or *“analyze perturbation effects between treatment groups”*—and Biomni automatically generates and runs the corresponding scanpy, pertpy, squidpy, or scvi-tools code. The agent understands biological context, handles parameters and dependencies, and returns reproducible results—no coding required. Notably, all codes written by agents are documented and available to users for reproduction or modification as a Jupyter notebook. This is an early-stage capability, and manual review of outputs is encouraged to ensure accuracy, and we invite constructive community feedback to improve.
18+
19+
---
20+
21+
## **Conduct Single-cell Analyses with Natural Language**
22+
23+
To illustrate, here are a few examples of how Biomni equipped with scverse packages can enable efficient analysis:
24+
25+
* *“Annotate cell types in this adata.h5ad, run Leiden clustering, and show a UMAP colored by CD3 and CD19 expression.”*
26+
27+
→ Biomni invokes scanpy for preprocessing, clustering, annotation, and visualization.
28+
29+
* *“Perform compositional analysis with scCODA in this perturb-seq dataset.”*
30+
31+
→ Biomni uses pertpy to perform scCODA compositional analysis.
32+
33+
* *“Detect spatially variable genes in this Visium dataset and overlay results on the H\&E image.”*
34+
35+
→ Biomni leverages squidpy and spatialdata for spatial feature detection and plotting.
36+
37+
---
38+
39+
## **Orchestrate Multi-Step Complex Single-Cell Workflows**
40+
41+
Moving beyond the agentic delegation of individual single-cell tasks, Biomni also enables scientists to run multi-step, cross-tool workflows using natural language. Instead of writing and debugging code across several libraries, you can now execute complex pipelines with a single prompt.
42+
43+
**Input**: A Visium spatial transcriptomics dataset in .h5ad format, aligned with histology images and metadata (e.g., treatment condition).
44+
45+
**Example prompt**:
46+
47+
*“Run a complete single‑cell + spatial workflow on the attached Visium .h5ad using the scverse stack—scanpy for QC/normalization, scvi‑tools for integration and clustering, celltypist for label transfer, scvelo for RNA‑velocity trajectories, squidpy for spatial interactions. Contrast treatment conditions, highlight driver pathways, and return key plots plus a brief HTML report.”*
48+
49+
**Plans composed by Biomni:**
50+
51+
- [] Load and explore the Visium .h5ad data
52+
- [] Scanpy QC and normalization (completed)
53+
- [] scvi-tools integration and clustering (completed)
54+
- [] Celltypist label transfer (completed)
55+
- [] scvelo RNA velocity trajectories (completed)
56+
- [] Squidpy spatial interactions (completed)
57+
- [] Driver pathway identification (completed)
58+
- [] Generate comprehensive HTML report (completed)
59+
60+
![][image2]
61+
62+
Biomni has successfully completed these steps in around 20 minutes. It preprocesses the data, identifies 21 distinct cell types from 17,771 high-quality cells, and maps them across spatial axes with clear anterior-posterior and dorsal-ventral organization. It uncovers major tissue compartments, quantifies gene expression variability (e.g., Slc4a1, T, Mesp2), and performs pathway enrichment, highlighting nervous system and cation channel activity. Biomni also infers cell-cell interactions, such as cardiomyocyte clustering and endothelial co-localization with hematopoietic progenitors. This entire sequence is automatically orchestrated by the agent, enabling reproducible and modular single-cell workflows without writing a single line of code.
63+
64+
---
65+
66+
### **Explore Single-cell Data to Generate New Hypotheses**
67+
68+
Biomni empowers researchers to go beyond standard analysis, treating single-cell and spatial omics datasets as hypothesis-generating engines. Through open-ended, natural language prompts, scientists can uncover patterns, outliers, and biological structure that spark new lines of inquiry.
69+
70+
Whether you’re examining cellular heterogeneity across disease states or discovering unexpected spatial niches, Biomni supports exploratory analysis guided by curiosity and domain expertise.
71+
72+
**Input:**
73+
74+
A spatial transcriptomics dataset in .h5ad format generated from seqFISH, with immune-related gene expression.
75+
76+
**Prompt:**
77+
78+
“Given the attached seqFISH data, explore it freely—analyze spatial and transcriptional patterns, identify interesting structures or cell populations, and generate hypotheses worth experimental follow-up.”
79+
80+
**What Biomni does:**
81+
82+
- [] Load and examine the seqFISH data structure
83+
- [] Explore basic data characteristics (genes, cells, spatial coordinates)
84+
- [] Visualize spatial distribution of cells
85+
- [] Perform quality control and filtering
86+
- [] Identify cell clusters and types
87+
- [] Analyze spatial organization patterns
88+
- [] Identify spatially variable genes
89+
- [] Generate hypotheses for experimental follow-up
90+
- [] Create comprehensive visualizations and summary
91+
92+
**Hypothesis generated by Biomni:**
93+
94+
| Hypothesis | Observation | Proposed Experiment | Expected Outcome |
95+
96+
|------------------------------------|-----------------------------------------------------------------------------|----------------------------------------------------------|----------------------------------------------------------------------|
97+
98+
| **Cardiac Development** | Cardiomyocytes show tight spatial clustering with high expression of Ttn, Hcn4 | Single-cell RNA-seq time course to track cardiac differentiation | Identify cardiac progenitor trajectories and developmental timing |
99+
100+
| **Neural Crest Migration** | Neural crest cells show dispersed distribution from CNS with high Sox10 expression | Live imaging of neural crest cell migration | Map migration routes and timing in real-time |
101+
102+
| **Hematopoietic Niche** | Endothelial cells co-localize with hematoendothelial progenitors | Lineage tracing of endothelial-hematopoietic transition | Identify hemogenic endothelium and transition timing |
103+
104+
| **Anterior-Posterior Patterning** | Clear spatial gradients in developmental gene expression | Perturbation of Wnt/FGF signaling with spatial readout | Identify morphogen sources and target regions |
105+
106+
| **Mesoderm Compartmentalization** | Multiple mesoderm subtypes show distinct spatial domains | Fate mapping of early mesoderm specification | Understand mesoderm subdivision mechanisms |
107+
108+
By combining data-driven discovery with biological reasoning, Biomni helps researchers ask better questions—and identify novel directions for experimental validation.
109+
110+
**[VIDEO: Spatial Hypothesis Generation](https://drive.google.com/file/d/1B7CLwDRLOor8WJNtL_MdUGEHZduIJIX5/view?usp=sharing)**
111+
112+
---
113+
114+
## **Access scverse through Biomni**
115+
116+
* **Biomni Web Platform**
117+
118+
Issue natural-language commands in [biomni.stanford.edu](http://biomni.stanford.edu) to load files, run workflows, and inspect results interactively—no code or infrastructure setup needed.
119+
120+
* **Biomni Open-Source Integration**
121+
122+
Clone the Biomni GitHub repository, add the new scverse dependencies, and run analyses on your own compute. This approach gives you full control over resources and data.
123+
124+
---
125+
126+
By wrapping scanpy, muon, anndata, mudata, scvi-tools, pertpy, decoupler, squidpy, spatialdata, and scirpy into an AI agent, Biomni × scverse removes the coding hurdle from single-cell and spatial omics research. Simply ask your question—and focus on biology, not boilerplate code. While the long-term vision is for agents to autonomously handle full experimental designs, human-in-the-loop remains critical today. We can’t wait to see the discoveries you make.

0 commit comments

Comments
 (0)