A new and user centric data discovery tool to access the ARM data

 

Authors

Giri Prakash — Oak Ridge National Laboratory
Kyle K Dumas — Oak Ridge National Laboratory
Ranjeet Devarakonda — Oak Ridge National Laboratory

Category

General topics

Description

ARM Data Center (ADC) currently archives over 11,000 data products with a total holding of over 1.7 petabytes of data that dates back to 1992, these include data from instruments, value added products, model outputs, field campaign and PI contributed data. The ARM data discovery tool (https://adc.arm.gov/discovery/) helps scientist to find and access these datasets. This tool is currently undergoing a major design revision with a goal to improve the user experience. The author will explain the end-to-end development process and best practices that include: gathering stakeholder recommendations, usability testing, design development using continuous integration methodology. The presentation will include the recently enabled data access and delivery options such as THREDDS/OpenDAP, GlobusOnline, and near real-time data access API, automated data access via web services, data citation generator, advanced visualizations and big data analysis platform for identifying data of interest. The author will demonstrate how users can request the data using data discovery tool and perform their data analysis using the ARM high performing computing clusters. There will be a discussion to collect feedback from the scientists about the new features to further improve the data discovery tool.