IN23C-1738
Shifting from Stewardship to Analytics of Massive Science Data

Tuesday, 15 December 2015
Poster Hall (Moscone South)
Daniel J Crichton1, Richard Doyle1, Emily Law2, Steve Hughes2, Thomas Huang2 and Ashish Mahabal3, (1)Jet Propulsion Laboratory, Pasadena, CA, United States, (2)NASA Jet Propulsion Laboratory, Pasadena, CA, United States, (3)California Institute of Technology, Pasadena, CA, United States
Abstract:
Currently, the analysis of large data collections is executed through traditional computational and data analysis approaches, which require users to bring data to their desktops and perform local data analysis. Data collection, archiving and analysis from future remote sensing missions, be it from earth science satellites, planetary robotic missions, or massive radio observatories may not scale as more capable instruments stress existing architectural approaches and systems due to more continuous data streams, data from multiple observational platforms, and measurements and models from different agencies. A new paradigm is needed in order to increase the productivity and effectiveness of scientific data analysis. This paradigm must recognize that architectural choices, data processing, management, analysis, etc are interrelated, and must be carefully coordinated in any system that aims to allow efficient, interactive scientific exploration and discovery to exploit massive data collections. Future observational systems, including satellite and airborne experiments, and research in climate modeling will significantly increase the size of the data requiring new methodological approaches towards data analytics where users can more effectively interact with the data and apply automated mechanisms for data reduction, reduction and fusion across these massive data repositories.

This presentation will discuss architecture, use cases, and approaches for developing a big data analytics strategy across multiple science disciplines.