How does ph5, the PASSCAL implementation of the HDF5 data model, stack up against SEG-Y as a data archiving format?
Comparison Category
ph5
SEG-Y
Time series length
Essentially unlimited
32766 samples
Data set archive-able with incomplete meta-data?
Yes*
No
Customizable gather parameters set during archiver data request?
Yes
No
Meta-data correctable after archiving?
Yes
No
Need for reprocessing or re-archiving?
Rare
Whenever meta-data change, for example shot time corrections
Raw data storage?
Yes
No, unless the experiment is considered an Earthscope FlexArray project
State of health data storage?
Yes
No
Tracking of archived reports?
Yes
No
Potential for meta-data queries?
Yes
Limited to IRIS DMC forms completed by PI
Ability to store instrument and datalogger responses?
Yes, in EVALRESP format
No
All ph5 data sets are available from the IRIS DMC as requester-configurable SEG-Y gathers or PASSCAL SEG-Y files. The SEG-Y files are built on-demand as needed, hence the flexibility in gather parameters (start time, length (within the limits of SEG-Y), and selected arrays for those data sets with multiple arrays.
In what format would you choose to archive your active source data set?
*Note: archiving raw data without meta-data or with incomplete meta-data does not satisfy the PASSCAL Data Delivery Policy.