Group implementations in HDF5:
The original HDF5 group implementation provided
a single indexed structure for link storage.
A new group implementation, in HDF5 Release 1.8.0,
enables more efficient compact storage for very small groups,
improved link indexing for large groups,
and other advanced features.
H5Pset_char_encoding
)
and is required for all link types other than hard or soft links,
e.g., external and user-defined links
(see the H5L APIs).
The original group structure and the newer structures are not directly interoperable. By default, a group will be created in the original indexed format. An existing group can be changed to a compact-or-indexed format if the need arises; there is no capability to change back. As stated above, once in the compact-or-indexed format, a group can switch between compact and indexed as needed.
Groups will be initially created in the compact-or-indexed format only when one or more of the following conditions is met:
H5Pset_libver_bounds
).
Currently, that would require an H5Pset_libver_bounds
call with the low parameter set to
H5F_LIBVER_LATEST
.
When this property is set for an HDF5 file, all objects in the file will be created using the latest available format; no effort will be made to create a file that can be read by older libraries.
H5P_CRT_ORDER_TRACKED
, has been set
in the group creation property list
(see
H5Pset_link_creation_order
).
An existing group, currently in the original indexed format, will be converted to the compact-or-indexed format upon the occurrence of any of the following events:
The compact-or-indexed format offers performance improvements
that will be most notable at the extremes,
i.e., in groups with zero members
and in groups with tens of thousands of members.
But measurable differences may sometimes appear
at a threshold as low as eight group members.
Since these performance thresholds and criteria differ from
application to application, tunable settings are provided to
govern the switch between the compact and indexed formats
(see
H5Pset_link_phase_change
).
Optimal thresholds will depend on the application and the
operating environment.
Future versions of HDF5 will retain the ability to create, read, write, and manipulate all groups stored in either the original indexed format or the compact-or-indexed format.