Skip to content

Commit e0e0016

Browse files
Merge remote-tracking branch 'origin/main'
2 parents 3a044f5 + ebb6201 commit e0e0016

4 files changed

Lines changed: 110 additions & 0 deletions

File tree

plots/boxen-basic/specification.md

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
# boxen-basic: Basic Boxen Plot (Letter-Value Plot)
2+
3+
## Description
4+
5+
A boxen plot (also known as letter-value plot) extends the traditional box plot to show more quantile information, making it ideal for large datasets with 1000+ observations. Instead of just displaying the median and quartiles, it shows additional "letter values" (eighths, sixteenths, etc.) as nested boxes, revealing the full shape of the distribution including tail behavior. This makes outlier detection more meaningful and distribution comparison more detailed.
6+
7+
## Applications
8+
9+
- Analyzing response time distributions across server clusters with millions of requests
10+
- Comparing gene expression levels in large-scale genomics studies
11+
- Quality control in manufacturing with high-volume production data
12+
- Exploring salary or income distributions in large census datasets
13+
14+
## Data
15+
16+
- `category` (string) - group labels for comparison (optional for single distribution)
17+
- `value` (numeric) - numerical values to plot
18+
- Size: 1000-100000+ points per category, 1-8 categories
19+
- Example: Server response times by endpoint, test scores by school
20+
21+
## Notes
22+
23+
- Show nested boxes representing letter values (median, quartiles, eighths, sixteenths, etc.)
24+
- Boxes should decrease in width for deeper quantiles
25+
- Use contrasting colors or shading to distinguish quantile levels
26+
- Display outliers beyond the deepest letter value as individual points
27+
- Include clear axis labels and legend explaining quantile levels
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
# Specification-level metadata for boxen-basic
2+
# Auto-synced to PostgreSQL on push to main
3+
4+
spec_id: boxen-basic
5+
title: Basic Boxen Plot (Letter-Value Plot)
6+
7+
# Specification tracking
8+
created: 2026-01-09T07:51:15Z
9+
updated: null
10+
issue: 3414
11+
suggested: MarkusNeusinger
12+
13+
# Classification tags (applies to all library implementations)
14+
# See docs/reference/tagging-system.md for detailed guidelines
15+
tags:
16+
plot_type:
17+
- boxen
18+
- box
19+
data_type:
20+
- numeric
21+
- continuous
22+
domain:
23+
- statistics
24+
- general
25+
features:
26+
- basic
27+
- distribution
28+
- large-data
Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
# frequency-polygon-basic: Frequency Polygon for Distribution Comparison
2+
3+
## Description
4+
5+
A frequency polygon connects the midpoints of histogram bins with straight line segments, creating a smooth outline of the distribution shape. This visualization excels at comparing multiple distributions simultaneously since lines overlap without obscuring each other, unlike stacked or overlapping histogram bars. Frequency polygons reveal differences in central tendency, spread, skewness, and modality across groups with minimal visual clutter.
6+
7+
## Applications
8+
9+
- Comparing test score distributions across multiple classes or exam sessions
10+
- Analyzing response time distributions between experimental conditions in psychology research
11+
- Visualizing age distributions across different customer segments or cohorts
12+
13+
## Data
14+
15+
- `values` (numeric) - The continuous variable to bin and display
16+
- `group` (categorical) - The grouping variable distinguishing each distribution
17+
- Size: 50-1000 observations per group recommended; works well with 2-5 groups
18+
- Example: Heights by age group, salaries by department, reaction times by treatment
19+
20+
## Notes
21+
22+
- Use distinct line colors and/or styles (solid, dashed) to differentiate groups
23+
- Include a legend clearly identifying each group
24+
- Consider adding markers at data points for small datasets
25+
- Extend lines to zero at both ends to close the polygon shape
26+
- Align bin edges across all groups for accurate comparison
27+
- Semi-transparent fill beneath lines can enhance visual appeal while preserving clarity
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
# Specification-level metadata for frequency-polygon-basic
2+
# Auto-synced to PostgreSQL on push to main
3+
4+
spec_id: frequency-polygon-basic
5+
title: Frequency Polygon for Distribution Comparison
6+
7+
# Specification tracking
8+
created: 2026-01-09T07:51:35Z
9+
updated: null
10+
issue: 3418
11+
suggested: MarkusNeusinger
12+
13+
# Classification tags (applies to all library implementations)
14+
# See docs/reference/tagging-system.md for detailed guidelines
15+
tags:
16+
plot_type:
17+
- line
18+
- histogram
19+
data_type:
20+
- numeric
21+
- continuous
22+
domain:
23+
- statistics
24+
- general
25+
features:
26+
- basic
27+
- distribution
28+
- comparison

0 commit comments

Comments
 (0)