A user-centered design for a spatial data warehouse for data exploration in environmental research

Abstract The integration of data from diverse fields of ecological research is paramount in the discovery of new ecological patterns and processes. The spatial exploration of an integrated dataset that spans multiple studies and disciplines can allow researchers to gain unforeseen insight into their data, spawn new research questions and hypotheses and identify data gaps. A user-centered approach was taken to design a spatial data warehouse and online analytical processing (OLAP) tools for data exploration in ecological research. The users in this study had diverse needs and current methods of data management do not easily allow for integration and exploration of data in multidimensional space. A generalizable data warehouse design methodology was created based on the results of a user study. This methodology was then demonstrated in the design of a data warehouse for data exploration in stream ecology resulting in a multidimensional data model with a fact table representing biological stream survey measurements and dimension tables representing spatial and categorical site and landscape variables. A generalizable extraction transformation and loading (ETL) workflow was created to integrate data across spatial dimensions before it was loaded into the data warehouse. A prototype data warehouse was implemented using biological stream survey, hydrologic, and vegetation data to observe spatial patterns in biological community distributions. Based on the exploration requirements identified in the user study, prototype OLAP queries were designed to facilitate spatial data cube exploration. Finally, a web-based interface was implemented to allow for multidimensional spatial visualization of biological stream survey data. The data warehouse and interface will allow researchers to explore biological assessment data at multiple spatial scales across many dimensions.

[1]  Simon Scheider,et al.  Interactive visual exploration of multidimensional data: requirements for CommonGIS with OLAP , 2004 .

[2]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[3]  Jiawei Han,et al.  GeoMiner: a system prototype for spatial data mining , 1997, SIGMOD '97.

[4]  Esteban Zimányi,et al.  Representing spatiality in a conceptual multidimensional model , 2004, GIS '04.

[5]  Robert H. Deng,et al.  Evolution of Fair Non-repudiation with TTP , 1999, ACISP.

[6]  Yvan Bédard,et al.  SOLAP technology: Merging business intelligence with geospatial technology for interactive spatio-temporal exploration and analysis of data , 2005 .

[7]  Dong Liu,et al.  An Approach to Enabling Spatial OLAP by Aggregating on Spatial Hierarchy , 2003, DaWaK.

[8]  Yvan Bédard,et al.  Toward better support for spatial decision making: Defining the characteristics of spatial on-line analytical processing (SOLAP) , 2001 .

[9]  Jiawei Han,et al.  Efficient Polygon Amalgamation Methods for Spatial OLAP and Spatial Data Mining , 1999, SSD.

[10]  Long Zhang,et al.  Spatial hierarchy and OLAP-favored search in spatial data warehouse , 2003, DOLAP '03.

[11]  Xiaofang Zhou,et al.  Multiresolution Amalgamation: Dynamic Spatial Data Cube Generation , 2004, ADC.

[12]  Shashi Shekhar,et al.  Data Mining and Visualization of Twin-Cities Traffic Data , 2001 .

[13]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[14]  D. Maidment Arc hydro : GIS for water resources , 2002 .

[15]  Bambang Parmanto,et al.  SOVAT: Spatial OLAP Visualization and Analysis Tool , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[16]  M.M. da Silva,et al.  Spatial OLAP modeling: an overview base on spatial objects changing over time , 2005, IEEE 3rd International Conference on Computational Cybernetics, 2005. ICCC 2005..

[17]  Torben Bach Pedersen,et al.  Capturing complex multidimensional data in location-based data warehouses , 2004, GIS '04.

[18]  Sandro Bimonte,et al.  Towards a spatial multidimensional model , 2005, DOLAP '05.

[19]  Ranga Raju Vatsavai,et al.  Map cube: A visualization tool for spatial data warehouses , 2001 .

[20]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[21]  Shashi Shekhar,et al.  CubeView: a system for traffic data visualization , 2002, Proceedings. The IEEE 5th International Conference on Intelligent Transportation Systems.

[22]  S. Weisberg,et al.  Maryland Biological Stream Survey: Development of a Fish Index of Biotic Integrity , 1998 .

[23]  Yvan Bédard,et al.  Integrating GIS components with knowledge discovery technology for environmental health decision support , 2003, Int. J. Medical Informatics.

[24]  Sandro Bimonte,et al.  GeoCube, a Multidimensional Model and Navigation Operators Handling Complex Measures: Application in Spatial OLAP , 2006, ADVIS.

[25]  Nebojsa Stefanovic Design and implementation of on-line analytical processing (OLAP) of spatial data , 1997 .