Data Delivery Policy
October 9, 2015
This policy outlines the guidelines and archiving requirements for data collected with IRIS PASSCAL equipment. IRIS’s policy is that delivery of data to the IRIS Data Management Center (DMC) is an obligation of the Principal Investigator (PI). It is important to IRIS that the PI acknowledges this obligation and meets it within the required time frame. Failure to complete this requirement not only deprives the community of a valuable data resource, but also may jeopardize future requests to borrow IRIS equipment. IRIS expects data delivery while the experiment is in the field (for long term deployments), or immediately at the conclusion of the field experiment. At the request of the PI, the data will remain confidential for a period of 2 years after the end of the fieldwork, but will then become part of the open data archive to benefit the entire seismological community. The Data report and all QC metrics will be immediately open and available to ensure data sharing after the moratorium period is facilitated.
The equipment in the PASSCAL facility represents a significant community resource. In addition, the data collected by the PI community using the PASSCAL equipment has tremendous value past the specific scientific goals of the original experiment. Abiding by this data policy ensures the continued contribution of the IRIS PASSCAL facility to seismological studies.
Data Evaluation Report and Data Object Identifiers (DOI)
IRIS has worked with the editors of the Seismological Research Letters to allow publishing data reports on experimental data sets to introduce the dataset and experiment to the community, and to provide a citable reference for others who will eventually use your data. Guidelines for these reports can be found at www.iris.edu/TO_BE_DETERMINED
IRIS Data Services has worked with the FDSN to develop a procedure to create DOI's for data from temporary networks. DOIs promote the citation of a PI's work as well as the facility support for each experiment. More information on DOI's can be found at: http://www.fdsn.org/services/doi/ and temporary experiments can request a DOI from the same page where they request a temporary network code. Look for the section titled Network Citation.
The Data Evaluation Report (http://www.passcal.nmt.edu/forms/data_eval) is due immediately after the completion of the experiment.
The actual format of the data and the amount of data depend upon the type of experiment. Most PASSCAL experiments are classified as passive source, controlled source, or more recently some combination of controlled and passive recording (mixed-mode).
Passive Source Experiment Data
PI’s conducting a passive source experiment will provide all of the data and appropriate metadata from their experiment to the IRIS DMC for archive in SEED or stationXML format. It is expected that the PI will coordinate with the PASSCAL Instrument Center to arrange delivery of the data to the DMC on a continuing basis during the experiment (after service runs, as appropriate, or in real time if available). The final data shall be delivered to the DMC immediately after the experiment is over. The DMC will make the data available only to the PI or his designated representative for a period of up to two years after the completion of the experiment. This exclusive access can be waived by the PI if open data can be immediately made available. After the proscribed moratorium (no more than 2 years after the conclusion of the experiment), the data will be made freely and openly available to the public.
Controlled Source Experiment Data
Controlled source experiments are more frequently recording continuous data encompassing the controlled source time windows. To best preserve the entire data record the accepted data format for continuously recorded data is PH5, SEED, or stationXML. For conventional, shot windowed data , data should be delivered in SEG-Y format. In addition it may be delivered in PH5 format as well.The data should include all of the necessary information on the geometry of the experiment (metadata) and they should be corrected for all known timing problems.
Mixed-Mode Experiment Data
More recently, experiments are being proposed and deployed that involve both controlled source and passive source experiment components. In these cases, we are working towards a single format (PH5) to allow ingestion, reformatting and distribution of the particular components of the data in the appropriate formats as required by the data users and this may be used in its proto-form. However, at the time of this report, the operational release of PH5 is not complete, so it may be used intermittently. Until this becomes a standard, or a new standard is found, it is acceptable to provide continuous data in SEED and shot data in SEG-Y, as appropriate.
There will always be some experiments that do not fit directly into one of the above categories. In those cases the exact form of the data delivery will be negotiated between the PI, Data Services and PASSCAL.
Data of all types should be delivered to the DMC, in the appropriate format with complete metadata, as soon as possible and well before the general release of the data. The DMC will only allow access to the waveforms to the PI and others designated by the PI during the moratorium (not more than 2 years from the completion of the field experiment). Access will be by password that will be provided by the DMC to the PI. The PI can share the password with anyone he/she wishes. The PI will be notified when anyone registers for access to a proprietary dataset.
Information about the experiment contained in the metadata and data quality characteristics will be made publicly available during the experiment, only waveform data will be limited in distribution during the proprietary period.
All passive experiments with five or more stations will designate at least one station as an “open station”. The data from the “open station/s” will be made available to the public immediately upon being archived.
Support Available from IRIS
Every PI and their field staff need to attend training at the IRIS PASSCAL Instrument Center (PIC) prior to their experiment where they will receive training on instrumentation and installation techniques, and the handling of the data and development of the station metadata. Teams will be taught how to archive their data using PASSCAL field computers and software that is provided as a part of the experiment equipment set. Every field computer has the software necessary to accomplish the data delivery task and the PASSCAL Instrument Center has personnel who can provide assistance to the PI during and after the experiment. We encourage PI's to interact with PIC staff before and after service runs and immediately after the experiment conclusion to allow refreshers on data handling and ensure immediate ingestion of the data to the DMC. This will not only get help PI's with data handling training that may have occurred months to years ago, but also assure that the data delivery requirement to the DMC is met in a timely manner. Please note that many new tools for assessing data quality are now available for experiment datasets that can only be accessed once the data are uploaded.
The PI is encouraged to utilize these resources at all stages of the work. In all cases, however, the ultimate responsibility for delivery of the data rests with the Principal Investigator. The PI must ensure that adequate resources are budgeted to accomplish this task.
Data Quality Assessment
All data submitted to the IRIS DMC will be run through the data quality assessment tools (MUSTANG) and the data quality metrics will be made open without delay for data quality evaluation - ie quality metrics are not subject to the data moratorium described above. MUSTANG data may be accessed through standard web services tools or through the use of the MUSTANG client interfaces (like MUSTANGDataBrowser or LASSO).
Experiment Completion and Delinquent Datasets
A PASSCAL data submission is not considered complete until both the PASSCAL Program Manager and the Director of Data Services certify that the information contained in the report is sufficient to allow other members of the community to utilize the data. IRIS will not certify that it has received data from any PI until the data submission is deemed useable.
PIC staff will remind PI's that data are due beginning 6 months prior to the data becoming delinquent (ie, more than 90 days after completion of the experiment) and will offer guidance on how to ensure your data are properly uploaded to the DMC. If your dataset becomes delinquent, the Program Manager and PASSCAL Standing committee chair will work with you to determine how best to make your data available to the community to ensure the program resources are being adequately allocated. We certainly hope that this escalation will not be required, but it is in the best interest of the entire IRIS community to ensure that when IRIS PASSCAL equipment is used, that the data are archived in a useable format as required.
Exemptions for Strictly Educational Use Datasets
If you are using equipment from the PASSCAL facility for strictly educational training on the instrumentation and are not using the equipment for observing a science target, PASSCAL management may grant an exemption from the data distribution policy if there is no useful information to be shared among the seismological community. This is at the discretion of the PASSCAL program manager. Of course, if you are willing to share the data, all data sets are welcome. All exemptions must be request of the PASSCAL Program Manager and are to be considered due to IRIS unless the Program Manager grants and exemption.
There may be other needs for exemptions, including complete failure of the experiment to capture data, experiment cancellation, or no useable data were captured. However, all of these need to be approved by management with no exceptions.
This policy is effective as of Sept 29, 2015 and is subject to change and revision as needs dictate. For updated versions of the policy and additional information on data delivery see the PASSCAL and DS pages on the IRIS web site. There is also an IRIS document detailing the policies regarding release of restricted data.
- General Information
- Borrowing Equipment
- Equipment Inventory
- Shipping Information
- PASSCAL Standing Committee
- Visiting PASSCAL
- Acknowledgment, Citation, and Logos
- FAQ Bank
- 1-Channel (Texan) Dataloggers
- 3-Channel Dataloggers
- Restoring a Clie
- Quanterra Q330 Datalogger
- RefTek RT 130 Datalogger
- Multi-Channel Dataloggers
- Offload Data
- Grounding when Static is a Problem
- Power Systems
- Sensor Certification
- Sensor Comparison Chart
- Broadband Sensors
- High Frequency Sensors
- Intermediate Period Sensors
- Short Period Sensors
- Field Procedures
- Huddle Testing
- Station Siting
- Seismic Vaults
- Station Installation
- Working with Responses to Get Units of Displacement
- Controlled Sources
- Data Archiving
- Design Drawings
- Polar Presentations
- Sample Deployments
- Useful Web Links
- Expt. Schedule
- Important Hardware/Software Notes