Skip to content

[scatter-embedding] t-SNE and UMAP Embedding Visualization #5236

@MarkusNeusinger

Description

@MarkusNeusinger

Description

A scatter-style visualization showing high-dimensional data projected into 2D space using t-SNE or UMAP dimensionality reduction. Points are colored by cluster or class label, revealing groupings and structure in the data. This is a standard tool in machine learning for exploring embeddings, single-cell RNA-seq data, and NLP document clustering.

Applications

  • Visualizing clusters in single-cell RNA-seq data (bioinformatics)
  • Exploring word or document embeddings from NLP models
  • Inspecting latent space structure of autoencoders or VAEs
  • Quality check after K-means or DBSCAN clustering

Data

  • x (float) — first embedding dimension
  • y (float) — second embedding dimension
  • label (categorical) — cluster or class assignment
  • Size: 500–5000 points typical

Notes

  • Show cluster labels as colored point groups with legend
  • Optionally annotate cluster centroids
  • Perplexity / n_neighbors parameter note in subtitle

Metadata

Metadata

Assignees

No one assigned

    Labels

    approvedApproved for implementationspec-readySpecification merged to main

    Projects

    Status

    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions