EUBrazilCC as a case study for Ophidia project at ESGF F2F Workshop. December 2014

The Ophidia project is a research effort on big data analytics–facing scientific data analysis challenges in the climate change domain. It provides a parallel data analysis service (PDAS), an internal storage model, and a hierarchical data organisation to manage large amount of multidimensional scientific data.

The PDAS provides several data operators to manipulate multidimensional data sets. Some relevant examples include: 1) data sub-setting (slicing and dicing), 2) data aggregation, and 3) data analysis.

Additionally, the PDAS provides about 100 primitives to perform time series analysis, sub-setting, and data aggregation on large arrays of scientific data. Multiple primitives can be also nested to implement a single more complex task (e.g., aggregating by sum a subset of the entire array). The entire Ophidia software stack has been deployed at CMCC on 24 nodes (16-cores/node) of the Athena HPC cluster. A comprehensive benchmark and test cases are being defined with climate scientists to extensively test all of the features provided by the system. Preliminary experimental results are already available and have been published on scientific research papers.

PDAS is also included on EUBrazilCC architecture, which was presented at the meeting in the context of the series of applications for the analysis of biodiversity impacts of climate change that EUBrazilCC team is developing.

More about the event:

The presentation is available at this link: