next up previous contents
Next: HDF5 Filters Up: NEMO on HECToR A Previous: NEMO output comparison scripts   Contents


Some notes on HDF5 datasets

HDF5 is a set of tools and libraries that allows extremely large and complicated data collections to be managed. The file format used by HDF5 is designed to be portable. Further information on HDF5 can be found at [8].

An HDF5 dataset is an object comprised of a collection of data elements and metadata. In addition the dataset may have optional attribute objects.

When an HDF5 dataset is created a number of properties of the dataset are set:

Most of these dataset properties are permanent, they cannot be changed during the lifetime of the dataset. The key exception is the, dataspace which can be expanded up to its maximum dimensions.

Data Transfer - e.g. how does the data get from the application to a physical file? Essentially the HDF5 library implements data transfers through a pipeline which includes:

Storage allocation in the file, early, incremental, late - may need consideration for parallel I/O.



Subsections
next up previous contents
Next: HDF5 Filters Up: NEMO on HECToR A Previous: NEMO output comparison scripts   Contents