When Spatial Analytics Meets Cyberinfrastructure: an Interoperable and Replicable Platform for Online Spatial-Statistical-Visual Analytics

Developing spatial analytical methods as open source libraries is an important endeavor to enable open and replicable science. However, despite the fact that large geospatial data and geospatial cyberinfrastructure (GeoCI) resources are becoming available, many libraries and toolkits are only initialized and designed for analytics in a desktop environment. Coupling spatial analytical functionality with big data and high-performance computing will result in immediate benefits for multidisciplinary research in terms of addressing challenging socioeconomic and environmental problems, as well as supporting remote collaboration between participants from physically distributed research groups, and assisting informed decision-making. In this article, we present the design and implementation of a general workflow to integrate state-of-the-art open source libraries with GeoCI resources. We also solve various interoperability and replicability issues that arise during the implementation process. The popular open source Python Spatial Analysis Library (PySAL) was selected to build the interoperable Web service, WebPySAL, which was then successfully integrated in GeoCI. With this integration between spatial analytics and cyberinfrastructure, the new GeoCI platform provides easy-to-use, efficient, and interactive exploratory spatial analysis functions to public users. The GeoCI capability is demonstrated through two regional economic case studies of (1) evaluating global spatial autocorrelation and identifying local clusters in the spatial pattern of median household incomes for US counties (with global and local Moran’s I statistics) and (2) modeling the space-time dynamics of per capita incomes at the state level (with spatial Markov statistics).

[1]  Frank Warmerdam,et al.  The Geospatial Data Abstraction Library , 2008 .

[2]  John Unsworth Cyber Infrastructure for the Humanities and SocialSciences , 2008 .

[3]  Sergio J. Rey,et al.  Spatial econometrics in an age of CyberGIScience , 2012, Int. J. Geogr. Inf. Sci..

[4]  Bert Veenendaal,et al.  USING THE GEOSPATIAL WEB TO DELIVER AND TEACH GISCIENCE EDUCATION PROGRAMS , 2015 .

[5]  Suzie Allard,et al.  DataONE: Facilitating eScience through Collaboration , 2012 .

[6]  Wenwen Li,et al.  A2CI: A Cloud-Based, Service-Oriented Geospatial Cyberinfrastructure to Support Atmospheric Research , 2016, CloudCom 2016.

[7]  Songnian Li,et al.  Real-time collaborative GIS: A technological review , 2016 .

[8]  Markus Metz,et al.  GRASS GIS: A multi-purpose open source GIS , 2012, Environ. Model. Softw..

[9]  John M. Carroll,et al.  Supporting collaborative sense-making in emergency management through geo-visualization , 2013, Int. J. Hum. Comput. Stud..

[10]  G. Randy Keller,et al.  GEON (GEOscience Network): A First Step in Creating Cyberinfrastructure for the Geosciences , 2003 .

[11]  Wenwen Li,et al.  PolarHub: A large-scale web crawling engine for OGC service discovery in cyberinfrastructure , 2016, Comput. Environ. Urban Syst..

[12]  Sergio J. Rey,et al.  Sex Offender Residential Movement Patterns: A Markov Chain Analysis , 2014 .

[13]  L. Anselin Local Indicators of Spatial Association—LISA , 2010 .

[14]  M. Goodchild,et al.  CyberGIS for Geospatial Discovery and Innovation , 2019, GeoJournal Library.

[15]  Sergio J. Rey,et al.  Metadata and provenance for spatial analysis: the case of spatial weights , 2014, Int. J. Geogr. Inf. Sci..

[16]  Sergio J. Rey,et al.  Space–Time Patterns of Rank Concordance: Local Indicators of Mobility Association with Application to Spatial Income Inequality Dynamics , 2016 .

[17]  Bin Zhou,et al.  Performance improvement techniques for geospatial web services in a cyberinfrastructure environment - A case study with a disaster management portal , 2015, Comput. Environ. Urban Syst..

[18]  Dimitri P. Solomatine,et al.  Development of a web application for water resources based on open source software , 2014, Comput. Geosci..

[19]  Sergio J. Rey,et al.  Parallelization of a regionalization heuristic in distributed computing platforms – a case study of parallel-p-compact-regions problem , 2015, Int. J. Geogr. Inf. Sci..

[20]  Chaitanya K. Baru,et al.  OpenTopography: a services oriented architecture for community access to LIDAR topography , 2011, COM.Geo.

[21]  Wenwen Li,et al.  Lowering the Barriers for Accessing Distributed Geospatial Big Data to Advance Spatial Data Science: The PolarHub Solution , 2017 .

[22]  Sergio J. Rey Python Spatial Analysis Library (Pysal): An Update and Illustration , 2015 .

[23]  Sheng Wu,et al.  A scalable cyberinfrastructure solution to support big data management and multivariate visualization of time-series sensor observation data , 2016, Earth Science Informatics.

[24]  Manish Parashar,et al.  Architecting the cyberinfrastructure for National Science Foundation Ocean Observatories Initiative (OOI) , 2016 .

[25]  Lan Zhao,et al.  SWATShare - A web platform for collaborative research and education through online sharing, simulation and visualization of SWAT models , 2016, Environ. Model. Softw..

[26]  Wenwen Li,et al.  A comprehensive optimization strategy for real-time spatial feature sharing and visual analytics in cyberinfrastructure , 2019, Int. J. Digit. Earth.

[27]  E. Pebesma spacetime: Spatio-Temporal Data in R , 2012 .

[28]  May Yuan,et al.  Cloud Computing in Ocean and Atmospheric Sciences , 2016 .

[29]  G. Brent Hall,et al.  Open Source Approaches in Spatial Data Handling , 2008 .

[30]  Richard Harris,et al.  Building a GIScience Community in Cyberspace: reflections on GIScOnline , 2003 .

[31]  Grégoire Dubois,et al.  eHabitat, a multi-purpose Web Processing Service for ecological modeling , 2013, Environ. Model. Softw..

[32]  Shaowen Wang,et al.  TeraGrid GIScience Gateway: Bridging cyberinfrastructure and GIScience , 2009, Int. J. Geogr. Inf. Sci..

[33]  Wenwen Li,et al.  Intelligent polar cyberinfrastructure: enabling semantic search in geospatial metadata catalogue to support polar data discovery , 2014, Earth Science Informatics.

[34]  Claus Rinner,et al.  The use of Web 2.0 concepts to support deliberation in spatial decision-making , 2008, Comput. Environ. Urban Syst..

[35]  J. Keith Ord,et al.  Spatial Processes Models and Applications , 1981 .

[36]  Gregory Giuliani,et al.  An interoperable web portal for parallel geoprocessing of satellite image vegetation indices , 2015, Earth Science Informatics.

[37]  James C. Meyer,et al.  A Web-based environmental decision support system (WEDSS) for environmental planning and watershed management , 2004, J. Geogr. Syst..

[38]  Wei Kang,et al.  Conditional and joint tests for spatial effects in discrete Markov chain models of regional income distribution dynamics , 2018 .

[39]  Shaowen Wang,et al.  A theoretical approach to the use of cyberinfrastructure in geographical analysis , 2009, Int. J. Geogr. Inf. Sci..

[40]  Nathan R. Swain,et al.  A review of open source software solutions for developing water resources web applications , 2015, Environ. Model. Softw..

[41]  Shaowen Wang CyberGIS: blueprint for integrated and scalable geospatial software ecosystems , 2013, Int. J. Geogr. Inf. Sci..

[42]  Danny Quah,et al.  Empirical cross-section dynamics in economic growth , 1993 .

[43]  Liping Di,et al.  Sharing geoscience algorithms in a Web service-oriented environment (GRASS GIS example) , 2010, Comput. Geosci..

[44]  Wenwen Li,et al.  Spatiotemporal data representation and its effect on the performance of spatial analysis in a cyberinfrastructure environment - A case study with raster zonal analysis , 2016, Comput. Geosci..

[45]  L. Anselin,et al.  Modern Spatial Econometrics in Practice: A Guide to GeoDa, GeoDaSpace and PySAL , 2014 .

[46]  Stefan Steiniger,et al.  The 2012 free and open source GIS software map - A guide to facilitate research, development, and adoption , 2013, Comput. Environ. Urban Syst..

[47]  Dorian Gorgan,et al.  Grid Based Processing of Satellite Images in GreenLand Platform , 2013 .

[48]  Jáchym Čepický PyWPS 2.0.0: The presence and the future , 2007 .

[49]  Zhipeng Gui,et al.  GeoSquare: collaborative geoprocessing models’ building, execution and sharing on Azure Cloud , 2015, Ann. GIS.

[50]  A Markov Chain model of zoning change , 1991 .

[51]  Cristina Urdiales,et al.  Smartphone-Based Vehicle Emission Estimation , 2018, ICITS.

[52]  R. Bivand Spatial Dependence: Weighting Schemes, Statistics and Models , 2015 .

[53]  Sergio J. Rey,et al.  STARS: Space-Time Analysis of Regional Systems , 2004 .

[54]  Xun Li,et al.  Open Geospatial Analytics with PySAL , 2015, ISPRS Int. J. Geo Inf..

[55]  Sergio J. Rey,et al.  The properties of tests for spatial effects in discrete Markov chain models of regional income distribution dynamics , 2016, J. Geogr. Syst..

[56]  Wenwen Li,et al.  An Ontology-driven Cyberinfrastructure for Intelligent Spatiotemporal Question Answering and Open Knowledge Discovery , 2019, ISPRS Int. J. Geo Inf..

[57]  Manfred M. Fischer,et al.  Handbook of Applied Spatial AnalysisSoftware Tools, Methods and Applications , 2010 .

[58]  Michael F. Goodchild,et al.  A Smart Service-Oriented CyberGIS Framework for Solving Data-Intensive Geospatial Problems , 2018, CyberGIS for Geospatial Discovery and Innovation.

[59]  Youngihn Kho,et al.  GeoDa: An Introduction to Spatial Data Analysis , 2006 .

[60]  Yike Guo,et al.  Eight Times Acceleration of Geospatial Data Archiving and Distribution on the Grids , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[61]  Zhipeng Gui,et al.  Optimizing and accelerating space-time Ripley's K function based on Apache Spark for distributed spatiotemporal point pattern analysis , 2019, Future Gener. Comput. Syst..

[62]  S. Kullback,et al.  Tests for Contingency Tables and Marltov Chains , 1962 .

[63]  Sergio J. Rey,et al.  PySAL: A Python Library of Spatial Analytical Methods , 2010 .