ARM Data Center: Data Discovery Improvements and Requirements Gathering

 

Authors

Ranjeet Devarakonda — Oak Ridge National Laboratory
Maggie Davis — Oak Ridge National Laboratory
Richard T. Cederwall — Oak Ridge National Laboratory
Harold Shanafield — Oak Ridge National Laboratory
Alka Singh — Oak Ridge National Laboratory
Kyle K Dumas — Oak Ridge National Laboratory
Kavya Guntupally — Oak Ridge National Laboratory
Giri Prakash — Oak Ridge National Laboratory

Category

ARM infrastructure

Description

The ARM Data Center (ADC: https://www.arm.gov/data) is responsible for providing end-to-end capabilities for ARM’s multi-dimensional climate data, including storing, managing, and distributing data. In this poster, we will discuss some recent improvements made to two critical pieces to ARM data discovery, i.e. Metadata and the Search interface itself. Metadata improvements: The ADC will streamline the metadata workflow from OME submission through database population to ensure that precise metadata supports timely and complete data delivery. Our goal is to make submission of datasets comprehensive and efficient while also making pertinent information and data easily accessible for users. Specific efforts to streamline the metadata workflow are extensive and include: • Changes to the Metadata Management Tool (MMT) and underlying database structure are ongoing. These changes will enhance the metadata assignment process for increased utility and speed from data submission to discovery. We will also leverage automation tools to reduce time necessary for the flow of data between OME submission and IOP placement, further bolstering efficiency of the metadata process. • Improved pathways for users to obtain the data they want and have more access to information about the data will be available in 2018. Following the Triennial Review feedback to “identify a process for reviewing and updating recommended datastreams”, a major update will occur to include the development of tools to facilitate more frequent and consistent updates to the list of ARM core measurements and associated recommended datastreams. • DOIs are also being extended to data levels to assist the user in navigating through ARM Data Discovery to specific data streams. Applicable instrument handbooks that are referenced on ARM instrument web pages will also be linked more prominently in the data delivery notification. • Metadata is being created, distributed, and published in various formats (i.e. XML, JSON) on various data portals (i.e. data.gov). This effort will promote the data visibility of ARM data. Data Discovery tool improvements: The ADC has made several improvements to the Data Discovery tool including, the addition of LASSO model data, DQR ‘missing’ data representation, new citation generator tool, and finally the new data retrieval system for faster access to the data. In addition, we will also discuss the recent triennial review feedback related to the Data Discovery tool and ADC’s implementation plan. Highlights from the review include refining the way to pick “the right data, and the range of products in the database can be a bit overwhelming”. Improvements are also planned for 2018 to display more detailed information about recommended data sources to users in the Data Discovery Tool. This poster will demonstrate upcoming improvements for the metadata activities and the data discovery tool and also engage the ADC development team with stakeholders for gathering immediate feedback.

Supporting URL

https://www.arm.gov/data