Issue
Hierarchical Data Format is commonly used to store large amounts of data for scientific computing and analysis. This includes multidimensional arrays, raster images, and sensor data. Spark has no built-in data source format for HDF4 or HDF5 files.
Proposed Change
Create a custom data source format for HDF5 using the h5py library.
Issue
Hierarchical Data Format is commonly used to store large amounts of data for scientific computing and analysis. This includes multidimensional arrays, raster images, and sensor data. Spark has no built-in data source format for HDF4 or HDF5 files.
Proposed Change
Create a custom data source format for HDF5 using the h5py library.