Name	Name	Last commit message	Last commit date
parent directory ..
Project.toml	Project.toml
README.md	README.md
distributed_parallel.jl	distributed_parallel.jl
gpu_acceleration.jl	gpu_acceleration.jl
real_data_climate.jl	real_data_climate.jl
simple_2d.jl	simple_2d.jl
threaded_calculation.jl	threaded_calculation.jl

""" examples/README.md

Worked examples and tutorials for StructureFunctions.jl """

Examples: Getting Started with StructureFunctions.jl

This directory contains runnable examples demonstrating different aspects of structure function computation in StructureFunctions.jl.

Quick Start

1. Simple 2D Calculation (`simple_2d.jl`)

Best for: First-time users, understanding basics

julia examples/simple_2d.jl

What it does:

Generates synthetic 2D turbulent velocity data (256×256 grid)
Computes 2nd-order structure functions
Validates K41 scaling ($S_2(r) \sim r^{2/3}$)
Produces visualization (plot saved)

Key concepts:

Regular grids and structure functions
Spectral scaling laws
Visualization of results

Learn next: docs/theory.md for physics background

2. Threaded Parallelization (`threaded_calculation.jl`)

Best for: Medium datasets (10M–500M points), multi-core machines

# Set thread count
JULIA_NUM_THREADS=4 julia examples/threaded_calculation.jl

# Or let Julia auto-detect
JULIA_NUM_THREADS=auto julia examples/threaded_calculation.jl

What it does:

Generates 50M points (scalable based on CPU cores)
Compares serial vs threaded execution
Measures speedup and parallel efficiency
Demonstrates best practices

Key metrics:

Speedup vs serial: typically 2–8x on 4–32 cores
Efficiency: depends on memory bandwidth
Optimal for cache-resident data

Advanced: Try different dataset sizes; plot speedup curve vs thread count

Learn next: docs/backends.md for backend comparison

3. GPU Acceleration (`gpu_acceleration.jl`)

Best for: Large datasets (>500M points), with GPU hardware

# Requires CUDA.jl + KernelAbstractions.jl
julia --project -e 'using Pkg; Pkg.add(["CUDA", "KernelAbstractions"])'

# Run example
julia examples/gpu_acceleration.jl

What it does:

Generates 1 billion points (or 10M if GPU unavailable)
Uses GPUBackend for computation
Measures memory efficiency of Float32 vs Float64
Estimates speedup

Expected results:

NVIDIA A100: 20–50x faster than single CPU
RTX 4090: 10–30x faster than CPU
AMD MI250X: 15–80x faster than CPU

Key insights:

Float32 sufficient for turbulence (saves 50% memory)
GPU shines for >1B points
Kernel compilation amortized over many calls

Learn next: docs/backends.md#gpubackend, docs/real_data.md#performance-tips

4. Distributed Computing (`distributed_parallel.jl`)

Best for: Massive datasets across clusters (>1B points, multi-node)

# Local cluster (spawn 4 processes)
julia examples/distributed_parallel.jl

# Custom process count
julia examples/distributed_parallel.jl 8

# Or via Distributed.jl
julia -p 8 examples/distributed_parallel.jl

What it does:

Starts multiple Julia processes
Distributes data locally (no copy)
Computes structure functions in parallel
Demonstrates SLURM submission script

HPC Usage:

# Save the SLURM template:
# sbatch submit_distributed.sbatch

# Scales to 100s of processes on HPC clusters

Scaling characteristics:

Near-linear speedup with N nodes (for large problems)
Suitable for >500M points
Minimal communication overhead if data is local

Learn next: docs/backends.md#distributedbackend

5. Real Climate Data (`real_data_climate.jl`)

Best for: Practical atmospheric/ocean applications

# Optional: install NetCDF support
julia -e 'using Pkg; Pkg.add("NetCDF")'

# Run with synthetic data (generates demo)
julia examples/real_data_climate.jl

What it does:

Simulates realistic atmospheric turbulence data
Handles missing data (NaNs) and outliers
Constructs 3D coordinate systems
Performs K41 scaling analysis
Multi-time-step statistics

Key workflows:

Data loading (NetCDF, HDF5, CSV)
Preprocessing (unit conversion, detrending)
Structure function calculation
K41 validation
Temporal statistics

Real data sources:

SOCRATES campaign: https://data.eol.ucar.edu/
ERA5 reanalysis: https://cds.climate.copernicus.eu/
MERRA-2: https://gmao.gsfc.nasa.gov/reanalysis/MERRA-2/
WRF simulations: Custom NetCDF output

Learn next: docs/real_data.md

Advanced Workflows

Combining Backends

Scenario: Process each file with GPU, aggregate across time

using StructureFunctions, DistributedArrays

# Load time series in parallel
for t in 1:n_time_steps
    # Each worker computes one time step
    result_t = @distributed (+) for i in workers()
        data_i = load_time_slice(t, i)
        x, u = construct_coordinates(data_i)
        calculate_structure_function(x, u, bins; backend=GPUBackend())
    end
end

Batch Processing Out-of-Core

Scenario: Dataset too large for memory (terabytes)

# Process in streaming chunks
all_sums = zeros(n_bins)
all_counts = zeros(Int, n_bins)

for chunk in read_chunks("huge_data.zarr")
    x, u = extract_data(chunk)
    result = calculate_structure_function(x, u, bins; backend=AutoBackend())
    all_sums .+= result.sums
    all_counts .+= result.counts
end

# Normalize at end
final_sf = all_sums ./ all_counts

Multi-Scale Nested Analysis

Scenario: Analyze turbulence at multiple length scales

# Coarse grid: long-range structure
bins_coarse = 100:500:100_000
result_coarse = calculate_structure_function(x, u, bins_coarse)

# Fine grid: local correlation
bins_fine = 1:0.5:100
result_fine = calculate_structure_function(x, u, bins_fine)

# Compare scaling exponents
alpha_coarse = estimate_exponent(result_coarse)
alpha_fine = estimate_exponent(result_fine)

Running All Examples

# Prerequisites
julia --project -e 'using Pkg; Pkg.instantiate()'

# Run all in sequence
for script in simple_2d threaded_calculation gpu_acceleration distributed_parallel real_data_climate; do
    echo "Running example: $script"
    julia examples/${script}.jl
done

Performance Expectations

Example	Input Size	Time	Backend
simple_2d	65K pts	~1 sec	Serial
threaded_calculation	50M pts	~1 sec	Threaded (4 cores)
gpu_acceleration	1B pts	~20 sec	GPU (A100)
distributed_parallel	1B pts	~30 sec	Multi-process
real_data_climate	50K pts	~0.5 sec	Serial

Actual times vary by hardware. Use @time to measure.

Choosing an Example to Start With

I want to...

Learn the basics → simple_2d.jl
Speed up my computation → threaded_calculation.jl
Use my GPU → gpu_acceleration.jl
Scale to a supercomputer → distributed_parallel.jl
Analyze real atmospheric data → real_data_climate.jl
Combine multiple approaches → Mix the examples!

Common Issues & Solutions

"ThreadedBackend not found"

julia> ThreadedBackend()
ERROR: UndefVarError: ThreadedBackend not defined

# Solution: Load OhMyThreads
using OhMyThreads  # Triggers extension

"GPU out of memory"

# Solution: Reduce dataset size or use Float32
x = Float32.(x)  # Half the memory
u = Float32.(u)

result = calculate_structure_function(x, u, bins; backend=GPUBackend())

"Structure function is NaN"

# Solution: Check for NaN inputs or remove them
sum(isnan.(u))  # Check NaNs
u_clean = u[.!isnan.(u), :]  # Remove

# Or let calculate_structure_function handle it
result = calculate_structure_function(x, u, bins)  # Skips NaN pairs

"Serial computation is too slow"

# Solution: Use ThreadedBackend
backend = ThreadedBackend()  # Requires OhMyThreads

# Or try GPU
backend = GPUBackend()  # Requires KernelAbstractions

Contributing Examples

To add a new example:

Create examples/new_example.jl with:
- Clear docstring explaining what it does
- Section headers for organization
- Comments explaining key concepts
- Print statements summarizing results
- Links to relevant docs
Update this README with:
- Brief description
- Run instructions
- Key concepts learned
- Next steps
Test it runs without errors:
```
julia examples/new_example.jl
```

Questions?

Check docstrings: ?calculate_structure_function
Search examples/ for similar workflows
Read the relevant doc file (theory / backends / real_data)
Open an issue on GitHub

Happy computing! 🚀

Last updated: v0.3.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Examples: Getting Started with StructureFunctions.jl

Quick Start

1. Simple 2D Calculation (`simple_2d.jl`)

2. Threaded Parallelization (`threaded_calculation.jl`)

3. GPU Acceleration (`gpu_acceleration.jl`)

4. Distributed Computing (`distributed_parallel.jl`)

5. Real Climate Data (`real_data_climate.jl`)

Advanced Workflows

Combining Backends

Batch Processing Out-of-Core

Multi-Scale Nested Analysis

Running All Examples

Performance Expectations

Choosing an Example to Start With

I want to...

Common Issues & Solutions

"ThreadedBackend not found"

"GPU out of memory"

"Structure function is NaN"

"Serial computation is too slow"

Further Reading

Contributing Examples

Questions?

FilesExpand file tree

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Examples: Getting Started with StructureFunctions.jl

Quick Start

1. Simple 2D Calculation (simple_2d.jl)

2. Threaded Parallelization (threaded_calculation.jl)

3. GPU Acceleration (gpu_acceleration.jl)

4. Distributed Computing (distributed_parallel.jl)

5. Real Climate Data (real_data_climate.jl)

Advanced Workflows

Combining Backends

Batch Processing Out-of-Core

Multi-Scale Nested Analysis

Running All Examples

Performance Expectations

Choosing an Example to Start With

I want to...

Common Issues & Solutions

"ThreadedBackend not found"

"GPU out of memory"

"Structure function is NaN"

"Serial computation is too slow"

Further Reading

Contributing Examples

Questions?

1. Simple 2D Calculation (`simple_2d.jl`)

2. Threaded Parallelization (`threaded_calculation.jl`)

3. GPU Acceleration (`gpu_acceleration.jl`)

4. Distributed Computing (`distributed_parallel.jl`)

5. Real Climate Data (`real_data_climate.jl`)