A Summary of Tools and Capabilities of the ARM Data Quality Office
Authors
Alyssa Jordan Sockol — University of Oklahoma
Austin King — NOAA
Corey Godine — University of Oklahoma
Kenneth Kehoe — University of Oklahoma
Randy A. Peppler — University of Oklahoma
Category
ARM infrastructure
Description
The ARM Data Quality Office (DQO) is an ARM user facility program located at the Cooperative Institute for Mesoscale Meteorological Studies at the University of Oklahoma tasked to characterize the quality of ARM data to provide the best-possible data to end users. The DQO serves as the first line of defense for ARM in discovering data quality issues. This poster will provide a comprehensive summary of the DQO and the tools, capabilities, and processes used to assess data quality.
The discovery of potential data quality issues begins with a daily to weekly review of available data. This is accomplished through the inspection of baseline ARM data by DQO staff and student analysts. Student analysts learn to identify problems using a training program utilizing an internal Wiki page that documents ARM instrument behavior, and a suite of web-based tools to visualize instrument data and quality control (QC) information. Students then submit weekly Data Quality Assessment (DQA) reports to communicate overall data quality observations to Instrument Mentors and Site Operations.
If a potential data quality issue is found, a Data Quality Problem Report (DQPR) is submitted to alert appropriate ARM personnel of potentially unacceptable data via a common interface. DQPRs enable on-line discussion and suggested action from all involved in the quality assurance process, and make it possible to track a problem from detection through resolution to documentation.
The final step in the process involves Data Quality Reports (DQRs). DQRs are written statements of data quality intended for data users for particular ARM datastream(s) over specified time range(s), accompanied by a quality category and a description of the issue, which may also describe how to fix the issue. DQRs are displayed to the data user via Data Discovery, and are included with their data order. DQRs are the primary method of communicating to users all data quality issues not described in instrument handbooks or flagged by embedded QC.
The DQO is recognized as an independent arbiter of data quality for the ARM program, and it is these tools, capabilities, and processes that help to minimize the time required to address instrument problems. This, combined with collaboration with ARM’s instrument mentors, site operators, and data system staff, allows the DQO to achieve its mission and provide the best-possible measurements for scientific research.
Lead PI
Randy A. Peppler — University of Oklahoma