You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
title: "Visualizing Complexity: A Universal Ecosystem for Multi-Scale Scientific Discovery"
4
4
---
5
5
6
6
@@ -13,49 +13,90 @@ title: "Data Visualization"
13
13
</style>
14
14
15
15
16
-
## 1. A Grammar of Graphics for base plot
16
+
## Defining the Standards: A Universal Framework for Multi-Scale Scientific Visualization
17
17
18
-
plotbb
18
+
Modern scientific discovery relies on the ability to transform high-dimensional data into intuitive, accurate, and reproducible visual narratives. My work has established a **comprehensive visualization ecosystem** that spans from foundational plotting algebra to specialized interpretation frameworks for phylogeny and functional omics.
19
19
20
-
## 2. Align plots with axis reconciliation
20
+
## 1. Methodological Foundations: Bridging Systems and Logics
21
21
22
-
aplot
23
22
24
-
## 3. Supports image files and graphic objects in 'ggplot2'
25
23
26
-
ggimage
27
-
28
-
## 4. Text halos on plot lables
24
+
To address the fragmentation of the R plotting landscape, I developed a suite of tools that unify disparate plotting systems and introduce rigorous layout algebra.
29
25
26
+
***`plotbb` & `ggplotify` (System Interoperability):**`plotbb` brings the Grammar of Graphics to Base R, while `ggplotify` allows researchers to convert virtually any plot object (Base, Lattice, pheatmap, etc.) into a `ggplot2` compatible object, enabling unified themeing and assembly.
27
+
***`aplot` & `aplotExtra` (Layout Algebra):** These packages introduce an algebraic approach to plot alignment, allowing heterogeneous subplots to reconcile their coordinate systems automatically.
28
+
***`ggbreak` (Dynamic Range Logic):** Provides a seamless, non-destructive method for axis breaks, essential for visualizing datasets with extreme outliers or multi-scale distributions.
29
+
***`ggtangle` & `ggfun` (Network & UX):**`ggtangle` reimagines network visualization within the tidy framework, while `ggfun` provides foundational utilities that enhance the developer and user experience across the ecosystem.
<strong>Logic Unification:</strong> By treating plots as first-class algebraic objects, these tools allow for the "compositional" creation of complex figures that were previously impossible or required significant manual effort.
## 2. Specialized Academic Domains: Deep Integration
36
43
37
44
38
-
</td>
39
-
<tdstyle="border:none;">
40
45
41
-
shadowtext, meme
46
+
Beyond general-purpose utilities, I have pioneered visualization standards in specific high-impact biological domains.
42
47
48
+
***Evolutionary Narratives ([Phylogenetic Contribution](file:///e:/git/mybook/yulab-smu.github.io/content/contribution-tree-data.md)):** Led by **`ggtree`**, **`ggtreeExtra`**, **`ggtreeSpace`**, and **`ggtreeDendro`**, this suite has redefined how tree-structured data—ranging from deep evolutionary relationships to hierarchical clustering (dendrograms)—is integrated with multi-omics layers, becoming the de facto global standard.
49
+
***Functional Discovery ([Knowledge Mining Contribution](file:///e:/git/mybook/yulab-smu.github.io/content/contribution-knowledge-mining.md)):****`enrichplot`** transforms abstract enrichment results into statistically rigorous and biologically intuitive visual insights, enabling the automated interpretation of massive omics datasets.
50
+
***Sequence & Genomic Landscapes:****`seqcombo`** and **`ggmsa`** provide a modular grammar for multiple sequence alignment and the visualization of **genomic reassortment** events. These tools facilitate the visual exploration of segmental exchanges and complex evolutionary associations, ensuring that structural and genomic conservation is accessible at multiple scales.
51
+
***Single-Cell & Fine-Scale Omics:****`ggsc`** and **`ivolcano`** address the unique needs of high-resolution data, providing specialized geometries and interactive exploration for single-cell clusters and differential expression.
<strong>Domain Leadership:</strong> These tools are not mere plotting scripts but are **interpretative frameworks** cited in thousands of studies across <em>Nature</em>, <em>Science</em>, and <em>Cell</em>.
44
60
</td>
45
61
</tr>
46
62
</table>
47
63
64
+
## 3. Semantic Enrichment & Scientific Communication
48
65
49
66
50
67
68
+
To bridge the gap between abstract data and human intuition, I developed tools for semantic enrichment and professional branding.
51
69
70
+
***`ggimage` & `scatterpie`:** Extending the visual vocabulary of `ggplot2` to include external imagery and composite geometries (like pie-charts within coordinates), allowing for "presentation-ready" visuals.
71
+
***`emojifont`, `shadowtext`, & `meme`:** Enhancing semantic storytelling through advanced typography and cultural icons. These tools improve optical clarity through text halos and allow for creative, engaging data interaction, bridging the gap between formal analysis and impactful communication.
72
+
***`hexSticker`:** Revolutionizing how R developers brand their work. `hexSticker` has established the "Hex Logo" as the universal symbol of professional R package development.
<strong>Scientific Communication:</strong> These tools empower researchers to communicate complex data with clarity, impact, and professional polish, fostering broader adoption of open-source science.
81
+
</td>
82
+
</tr>
83
+
</table>
54
84
85
+
## Community Impact & Influence
86
+
87
+
My visualization frameworks have achieved global adoption, with **multi-million downloads** and integration into hundreds of third-party packages. This work serves as the foundational infrastructure for modern bioinformatics visualization workflows, cited in top-tier journals world-wide.
88
+
89
+
---
90
+
91
+
## Selected Publications
55
92
56
93
<fontsize="3">
57
94
58
-
+ S Xu<sup>#</sup>, M Chen<sup>#</sup>, T Feng, L Zhan, L Zhou, __G Yu__<sup>\*</sup>. [Use ggbreak to effectively utilize plotting space to deal with large datasets and outliers](https://www.frontiersin.org/articles/10.3389/fgene.2021.774846). __*Frontiers in Genetics*__. 2021, 12:774846.
95
+
+ S Xu, M Chen, T Feng, L Zhan, L Zhou, __G Yu__<sup>\*</sup>. [Use ggbreak to effectively utilize plotting space to deal with large datasets and outliers](https://www.frontiersin.org/articles/10.3389/fgene.2021.774846). __*Frontiers in Genetics*__. 2021, 12:774846.
96
+
97
+
+__G Yu__. Data Integration, Manipulation and Visualization of Phylogenetic Trees (1st edition). __*Chapman and Hall/CRC*__, 2022. doi: [10.1201/9781003279242](https://doi.org/10.1201/9781003279242)
59
98
99
+
+ S Xu, H Dai, X Bo, __G Yu__<sup>\*</sup>. [ggmsa: a visual exploration tool for multiple sequence alignment and associated data](https://doi.org/10.1093/bib/bbab222). __*Briefings in Bioinformatics*__. 2021, 22(6):bbab222.
0 commit comments