Authors

Kyle K Dumas (Quicklooks) — Oak Ridge National Laboratory
Giri Prakash — Oak Ridge National Laboratory
Kyle K Dumas — Oak Ridge National Laboratory
Alka Singh — Oak Ridge National Laboratory

Category

ARM infrastructure

Description

The big data architecture for Atmospheric Radiation Measurement (ARM) data has been in experiment for couple of years now. Few tools have been built based on this architecture and has been made available to the scientist, such as LASSO bundle browser [1] and ARMBE histogram tool [2]. We are currently exploring use cases that can potentially use this analytical platform. The proof of concept tool called ARM Data Studio, which will let the users to apply ranges in multiple variables from multiple datastreams. Many such sample queries had been obtained from different scientists and the results have been demonstrated at different platforms using cassandra database and Spark processing framework. Typically scientists spend considerable time in post download processing before they could use the data in their research, hence the Data Studio aims to help reduce the time needed to extract the data of interest. Currently a generalized framework is been created to select any datastream that is in this analytical platform to narrow the results based on the input query and to visualize the variables using time-series or scatter plots. The proof of concept can be explored at the data booth as well as at the poster session. References: [1] http://adc.arm.gov/lassobrowser [2] http://ac.arm.gov/armbe