Skip to content

Commit 0dd8b1a

Browse files
committed
Add summary for benchmarking for Zarr
1 parent 3c6c8a1 commit 0dd8b1a

1 file changed

Lines changed: 24 additions & 0 deletions

File tree

README.md

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,30 @@ This textbook gives scientists:
1313
- a guide to designing parallel processing algorithms to work efficiently with chunked datasets
1414
- a guide to exporting chunked datasets to other 'tradditional' datasets
1515

16+
## Benchmarking for Zarr
17+
18+
We created a set of benchmarks for writing data to Zarr with a range of different configurations as guidance for the options available when reading and writing 3D imaging data.
19+
The different parameters were:
20+
21+
- Type of image
22+
- Heart: HiP-CT scan of a heart from the Human Organ Atlas
23+
- Dense: segmented neurons from electron microscopy
24+
- Sparse: A few select segmented neurons from electron microscopy
25+
- Software libraries
26+
- Tensorstore (fastest for both reading and writing data)
27+
- zarr-python version 3
28+
- zarr-python version 2 (slowest for both reading and writing data)
29+
- Compressor
30+
- blosc-zstd provides the best compression ratio, for image and segmentation data. (options were blosc-blosclz, blosc-lz4, blosc-lz4hc, blosc-zlib, blosc-zstd as well as gzip and zstd)
31+
- Compression level
32+
- Setting compression levels beyond ~3 results in slightly better data compression but much longer write times. Compression level does not affect read time.
33+
- Shuffle
34+
- Setting the shuffle option increases data compression with no adverse effect on read/write times (shuffle, bitshuffle and noshuffle were the 3 options)
35+
- Zarr format version
36+
- There was no noticeable difference between Zarr format 2 and Zarr format 3 data
37+
- Chunk size
38+
- Setting a low chunk size (below around 90) has an adverse effect on read and write times.
39+
1640
## Tools for working with chunked datasets
1741

1842
## Improvements to cloud visualisation

0 commit comments

Comments
 (0)