nimsync Performance Benchmarks

Official benchmark suite following industry best practices from Tokio, Go, and Rust crossbeam.

Quick Start

# Run all SPSC benchmarks
./tests/performance/run_all_benchmarks.sh

# Run MPSC benchmarks
nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_mpsc.nim
./tests/performance/benchmark_mpsc

# Or run SPSC benchmarks individually
nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_latency.nim
./tests/performance/benchmark_latency

Benchmark Suite

SPSC Benchmarks (Single Producer Single Consumer)

1. benchmark_spsc_simple.nim - Throughput (Baseline)

What it measures: Raw SPSC channel throughput
Industry reference: Standard practice for lock-free queue benchmarking
Results: 600M+ ops/sec peak, 593M+ average
Use case: Establishes baseline performance

nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_spsc_simple.nim
./tests/performance/benchmark_spsc_simple

2. benchmark_latency.nim - Latency Distribution

What it measures: p50, p95, p99, p99.9 latency percentiles
Industry reference: HdrHistogram approach (Tokio, Netty, Cassandra)
Results: 20ns p50, 31ns p99, 50ns p99.9
Use case: Understand tail latency for latency-sensitive applications

nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_latency.nim
./tests/performance/benchmark_latency

Key metrics:

p50 (median): Typical latency
p99: 99% of operations complete within this time
p99.9: Extreme tail latency

3. benchmark_burst.nim - Burst Load Patterns

What it measures: Performance under bursty workloads
Industry reference: Redis/Memcached burst testing methodology
Results: 408M ops/sec average, 16.6% variance
Use case: Real-world applications have bursty traffic patterns

nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_burst.nim
./tests/performance/benchmark_burst

Key metrics:

Average throughput: Overall performance
Variance: Stability across different burst sizes (lower is better)

4. benchmark_sizes.nim - Buffer Size Optimization

What it measures: Impact of channel buffer size on throughput
Industry reference: LMAX Disruptor ring buffer sizing
Results: Finds optimal buffer size for your workload
Use case: Tune channel size for memory vs performance tradeoff

nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_sizes.nim
./tests/performance/benchmark_sizes

Key metrics:

Optimal size: Best performing buffer size
Efficiency curve: Performance relative to optimal

5. benchmark_stress.nim - Maximum Sustainable Load

What it measures: System limits and contention rate
Industry reference: Apache JMeter/Gatling stress testing
Results: Identifies breaking point and contention behavior
Use case: Understand system limits before production

nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_stress.nim
./tests/performance/benchmark_stress

Key metrics:

Contention rate: Failed operations percentage (lower is better)
Sustainable throughput: Maximum load before degradation

6. benchmark_sustained.nim - Long-Duration Stability

What it measures: Performance consistency over time
Industry reference: Cassandra/ScyllaDB sustained load testing
Results: Verifies no performance degradation
Use case: Detect memory leaks, GC pressure, thermal throttling

nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_sustained.nim
./tests/performance/benchmark_sustained

Key metrics:

Variance: Stability over time (< 5% is excellent)
Min/Max throughput: Performance envelope

7. benchmark_concurrent.nim - Async Performance

What it measures: Real async send/recv overhead
Industry reference: Standard async runtime benchmarking
Results: 512K ops/sec (async wrapper overhead)
Use case: Understand cost of convenience (async) vs performance (trySend/tryReceive)

nim c -r tests/performance/benchmark_concurrent.nim

Key insight: Channel itself is 600M+ ops/sec, async wrapper adds polling overhead

MPSC Benchmarks (Multi-Producer Single Consumer)

8. benchmark_mpsc.nim - Multi-Producer Performance

What it measures: MPSC channel throughput, latency, and scalability Industry reference: JCTools MPSC queue benchmarking, Disruptor patterns Results: 15M ops/sec (2 producers), 8.5M (4 producers), 5.3M (8 producers) - wait-free algorithm Use case: Concurrent producer scenarios (worker threads, event aggregation)

nim c -d:danger --opt:speed --mm:orc tests/performance/benchmark_mpsc.nim
./tests/performance/benchmark_mpsc

Benchmark suite includes:

Throughput comparison: SPSC vs MPSC with 1/2/4/8 producers
Latency measurement: Average latency across different producer counts
Scalability analysis: Fixed items per producer, measuring scalability
Size impact: Performance across buffer sizes (64/256/1024/4096)
Burst workload: Handling bursty traffic patterns

Key findings:

2 producers: Optimal sweet spot (15M ops/sec)
4 producers: Good scalability (8.5M ops/sec)
8 producers: Memory bandwidth limited (5.3M ops/sec)
Wait-free algorithm: No CAS retry loops, predictable latency
SPSC advantage: 3.5× faster in realistic threaded workloads (35M vs 10M ops/sec)

Design Principles

✅ Non-redundant: Each benchmark measures a different aspect
✅ Fast execution: All complete in <30 seconds
✅ Industry standard: Based on proven methodologies
✅ Actionable metrics: Not just throughput numbers
✅ Reproducible: Clear instructions and minimal variance

Benchmark Categories

Category	Benchmarks	Purpose
Throughput	simple, concurrent, mpsc	Raw performance numbers
Latency	latency, mpsc	Tail latency analysis
Stability	burst, sustained, mpsc	Real-world behavior
Tuning	sizes, mpsc	Optimization guidance
Limits	stress, mpsc	Breaking point analysis
Scalability	mpsc	Multi-producer scaling

CI Integration

The benchmark_spsc_simple runs automatically on every commit via GitHub Actions:

View results: https://github.com/codenimja/nimsync/actions/workflows/benchmark.yml
Download artifacts for detailed analysis

Comparison with Other Systems

To fairly compare with Go channels, Rust crossbeam, etc:

Use same hardware: Run all tests on same machine
Equivalent operations: Same send/recv patterns
Release builds: Go with -ldflags, Rust with --release
Multiple runs: Average of 3-5 runs
Report variance: Include min/max/stddev

See BENCHMARKING_STANDARDS.md for our methodology.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nimsync Performance Benchmarks

Quick Start

Benchmark Suite

SPSC Benchmarks (Single Producer Single Consumer)

1. benchmark_spsc_simple.nim - Throughput (Baseline)

2. benchmark_latency.nim - Latency Distribution

3. benchmark_burst.nim - Burst Load Patterns

4. benchmark_sizes.nim - Buffer Size Optimization

5. benchmark_stress.nim - Maximum Sustainable Load

6. benchmark_sustained.nim - Long-Duration Stability

7. benchmark_concurrent.nim - Async Performance

MPSC Benchmarks (Multi-Producer Single Consumer)

8. benchmark_mpsc.nim - Multi-Producer Performance

Design Principles

Benchmark Categories

CI Integration

Comparison with Other Systems

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

nimsync Performance Benchmarks

Quick Start

Benchmark Suite

SPSC Benchmarks (Single Producer Single Consumer)

1. benchmark_spsc_simple.nim - Throughput (Baseline)

2. benchmark_latency.nim - Latency Distribution

3. benchmark_burst.nim - Burst Load Patterns

4. benchmark_sizes.nim - Buffer Size Optimization

5. benchmark_stress.nim - Maximum Sustainable Load

6. benchmark_sustained.nim - Long-Duration Stability

7. benchmark_concurrent.nim - Async Performance

MPSC Benchmarks (Multi-Producer Single Consumer)

8. benchmark_mpsc.nim - Multi-Producer Performance

Design Principles

Benchmark Categories

CI Integration

Comparison with Other Systems