Skip to content

Commit 611b3b5

Browse files
spec: add parallel-categories-basic specification (#2505)
## New Specification: `parallel-categories-basic` Related to #2501 --- ### specification.md # parallel-categories-basic: Basic Parallel Categories Plot ## Description A parallel categories plot visualizes categorical data across multiple dimensions, with vertical axes representing each categorical variable and ribbons connecting categories to show observation flow. Unlike parallel coordinates (which use lines for numeric data), parallel categories use width-proportional ribbons to show counts or frequencies, making it ideal for understanding how categorical values co-occur and flow across multiple classification dimensions. ## Applications - Analyzing customer journey paths from acquisition channel through product category to purchase outcome - Visualizing survey response patterns across multiple demographic or preference questions - Exploring classification results showing predicted vs actual categories with feature breakdowns - Understanding multi-stage process flows like support ticket routing through departments ## Data - `dimension_1` through `dimension_n` (categorical) - Multiple categorical variables for each observation - `count` (numeric, optional) - Observation count or weight for aggregated data - Size: 3-6 categorical dimensions, each with 2-8 unique values - Example: Titanic survival data with class, sex, age group, and survival status ## Notes - Ribbon width should be proportional to observation count or frequency - Color by first dimension (source) or last dimension (outcome) for clarity - Interactive highlighting helps trace paths through multiple categories - Consider ordering categories within each dimension to minimize ribbon crossings - Too many categories per dimension reduces readability; aggregate rare values if needed --- **Next:** Add `approved` label to the issue to merge this PR. --- :robot: *[spec-create workflow](https://github.com/MarkusNeusinger/pyplots/actions/runs/20585343433)* Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
1 parent 19aa775 commit 611b3b5

2 files changed

Lines changed: 57 additions & 0 deletions

File tree

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
# parallel-categories-basic: Basic Parallel Categories Plot
2+
3+
## Description
4+
5+
A parallel categories plot visualizes categorical data across multiple dimensions, with vertical axes representing each categorical variable and ribbons connecting categories to show observation flow. Unlike parallel coordinates (which use lines for numeric data), parallel categories use width-proportional ribbons to show counts or frequencies, making it ideal for understanding how categorical values co-occur and flow across multiple classification dimensions.
6+
7+
## Applications
8+
9+
- Analyzing customer journey paths from acquisition channel through product category to purchase outcome
10+
- Visualizing survey response patterns across multiple demographic or preference questions
11+
- Exploring classification results showing predicted vs actual categories with feature breakdowns
12+
- Understanding multi-stage process flows like support ticket routing through departments
13+
14+
## Data
15+
16+
- `dimension_1` through `dimension_n` (categorical) - Multiple categorical variables for each observation
17+
- `count` (numeric, optional) - Observation count or weight for aggregated data
18+
- Size: 3-6 categorical dimensions, each with 2-8 unique values
19+
- Example: Titanic survival data with class, sex, age group, and survival status
20+
21+
## Notes
22+
23+
- Ribbon width should be proportional to observation count or frequency
24+
- Color by first dimension (source) or last dimension (outcome) for clarity
25+
- Interactive highlighting helps trace paths through multiple categories
26+
- Consider ordering categories within each dimension to minimize ribbon crossings
27+
- Too many categories per dimension reduces readability; aggregate rare values if needed
Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
# Specification-level metadata for parallel-categories-basic
2+
# Auto-synced to PostgreSQL on push to main
3+
4+
spec_id: parallel-categories-basic
5+
title: Basic Parallel Categories Plot
6+
7+
# Specification tracking
8+
created: 2025-12-29T23:55:26Z
9+
updated: null
10+
issue: 2501
11+
suggested: MarkusNeusinger
12+
13+
# Classification tags (applies to all library implementations)
14+
# See docs/concepts/tagging-system.md for detailed guidelines
15+
tags:
16+
plot_type:
17+
- parallel-categories
18+
- flow
19+
- categorical
20+
data_type:
21+
- categorical
22+
- frequency
23+
domain:
24+
- general
25+
- business
26+
- statistics
27+
features:
28+
- basic
29+
- proportional
30+
- multi-dimensional

0 commit comments

Comments
 (0)