A theoretical approach to the use of cyberinfrastructure in geographical analysis

This paper presents a theoretical approach that has been developed to capture the computational intensity and computing resource requirements of geographical data and analysis methods. These requirements are then transformed into a common framework, a grid‐based representation of a spatial computational domain, which supports the efficient use of emerging cyberinfrastructure environments. Two key types of transformational functions (data‐centric and operation‐centric) are identified and their relationships are explained. The application of the approach is illustrated using two geographical analysis methods: inverse distance weighted interpolation and the spatial statistic. We describe the underpinnings of these two methods, present their conventional sequential algorithms, and then address their latent parallelism based on a spatial computational domain representation. Through the application of this theoretical approach, the development of domain decomposition methods is decoupled from specific high‐performance computer architectures and task scheduling implementations, which makes the design of generic parallel processing solutions feasible for geographical analyses.

[1]  David A. Bennett Fundamentals of geographic information systems (GIS) , 2010 .

[2]  Shaowen Wang,et al.  Grid computing of spatial statistics: using the TeraGrid for G i * (d) analysis , 2008 .

[3]  Michael J. Flynn,et al.  Very high-speed computing systems , 1966 .

[4]  Jan Karel Lenstra,et al.  Perspectives on parallel computing , 1989 .

[5]  Shaowen Wang,et al.  Grid computing of spatial statistics: using the TeraGrid for G  i* (d) analysis , 2008, Concurr. Comput. Pract. Exp..

[6]  Shashi Shekhar,et al.  Parallelizing a GIS on a Shared Address Space Architecture , 1996, Computer.

[7]  David Sun,et al.  UNIFORM GRIDS: A TECHNIQUE FOR INTERSECTION DETECTION ON SERIAL AND PARALLEL MACHINES , 2008 .

[8]  Waldo R. Tobler,et al.  A Transformational View of Cartography , 1979 .

[9]  James E. Mower Developing Parallel Procedures for Line Simplification , 1996, Int. J. Geogr. Inf. Sci..

[10]  Hao Chen,et al.  Parallel Simulated Annealing and Genetic Algorithms: a Space of Hybrid Methods , 1994, PPSN.

[11]  Leila De Floriani,et al.  Parallelizing Visibility Computations on Triangulated Terrains , 1994, Int. J. Geogr. Inf. Sci..

[12]  Mark Birkin,et al.  The Use of Parallel Computers to Solve Nonlinear Spatial Optimisation Problems: An Application to Network Planning , 1995 .

[13]  Shaowen Wang,et al.  Using a Computational Grid for Geographic Information Analysis: A Reconnaissance , 2005 .

[14]  Bruce Gittings,et al.  Towards a framework for High Performance Geocomputation: Handling Vector Topology within a Distributed Service Environment , 2000 .

[15]  Marc P. Armstrong,et al.  Massively parallel strategies for local spatial interpolation , 1997 .

[16]  Henri Casanova,et al.  A decoupled scheduling approach for Grid application development environments , 2003, J. Parallel Distributed Comput..

[17]  Arthur Getis,et al.  Cliff, A.D. and Ord, J.K. 1973: Spatial autocorrelation. London: Pion , 1995 .

[18]  Ed Oates,et al.  Massively parallel processing , 1993 .

[19]  A. Fotheringham Spatial Interaction Models , 2001 .

[20]  N. Lam Spatial Interpolation Methods: A Review , 1983 .

[21]  Marc P. Armstrong,et al.  Using Linda to compute spatial autocorrelation in parallel , 1996 .

[22]  C. J. Ancker,et al.  A SIMPLE RENEWAL MODEL OF THROUGHPUT AT AN OVERSATURATED SIGNALIZED INTERSECTION. , 1967 .

[23]  Jörg-Rüdiger Sack,et al.  Parallel implementation of geometric shortest path algorithms , 2003, Parallel Comput..

[24]  T. Hassard,et al.  Applied Linear Regression , 2005 .

[25]  G. G. Wilkinson Parallel processing approaches in remotely sensed image analysis , 1997 .

[26]  Marc P. Armstrong,et al.  An Evaluation of Domain Decomposition Strategies for Parallel Spatial Interpolation of Surfaces , 1999 .

[27]  Shaowen Wang,et al.  A quadtree approach to domain decomposition for spatial interpolation in Grid computing environments , 2003, Parallel Comput..

[28]  Clark F. Olson Maximum-Likelihood Image Matching , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Richard Healey,et al.  Parallel Processing Algorithms for GIS , 1997 .

[30]  D. Griffith Spatial Autocorrelation , 2020, Spatial Analysis Methods and Practice.

[31]  Michael W. Berry,et al.  Parallelization in a spatially explicit individual-based ecological model—1: spatial data interpolation , 1997 .

[32]  Demin Xiong,et al.  Strategies for Real-Time Spatial Analysis Using Massively Parallel SIMD Cpmputers: An Application to Urban Traffic Flow Analysis , 1996, Int. J. Geogr. Inf. Sci..

[33]  Marc P. Armstrong,et al.  Parallel processing of spatial statistics , 1994 .

[34]  A. Getis The Analysis of Spatial Association by Use of Distance Statistics , 2010 .

[35]  S. Weisberg Applied Linear Regression , 1981 .

[36]  Antonio J. Plaza,et al.  Impact of platform heterogeneity on the design of parallel algorithms for morphological processing of high-dimensional image data , 2007, The Journal of Supercomputing.

[37]  Michael N. DeMers,et al.  Fundamentals of Geographic Information Systems , 1996 .

[38]  J. Ord,et al.  Local Spatial Autocorrelation Statistics: Distributional Issues and an Application , 2010 .

[39]  Hanan Samet,et al.  Applications of spatial data structures - computer graphics, image processing, and GIS , 1990 .

[40]  L. Anselin What is Special About Spatial Data? Alternative Perspectives on Spatial Data Analysis (89-4) , 1989 .

[41]  M. Jacobs,et al.  Comparison of Methods for Interpolating Soil Properties Using Limited Data , 2001 .

[42]  R. G. Healey,et al.  Implementation case study I: generalization of raster data , 1998 .

[43]  Hanan Samet,et al.  Hierarchical Spatial Data Structures , 1989, SSD.

[44]  Shaowen Wang,et al.  Parallelizing MCMC for Bayesian spatiotemporal geostatistical models , 2007, Stat. Comput..

[45]  Yuemin Ding,et al.  Spatial Strategies for Parallel Spatial Modelling , 1996, Int. J. Geogr. Inf. Sci..

[46]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[47]  Stan Openshaw,et al.  A parallel Kohonen algorithm for the classification of large spatial datasets , 1996 .

[48]  Gang-Len Chang,et al.  Exploring real-time traffic simulation with massively parallel computing architecture , 1993 .

[49]  Marc P. Armstrong,et al.  Massively Parallel Processing of Spatial Statistics , 1995, Int. J. Geogr. Inf. Sci..

[50]  Marc P. Armstrong,et al.  DOMAIN DECOMPOSITION FOR PARALLEL PROCESSING OF SPATIAL PROBLEMS , 1992 .

[51]  Nabil H. Mustafa,et al.  Dynamic simplification and visualization of large maps , 2006, Int. J. Geogr. Inf. Sci..

[52]  Marc P. Armstrong,et al.  Experiments in the Identification of Terrain Features Using a PC-Based Parallel Computer , 1998 .

[53]  Jack Dongarra,et al.  Sourcebook of parallel computing , 2003 .

[54]  Marc P. Armstrong,et al.  PARALLEL PROCESSING FOR NETWORK ANALYSIS - DECOMPOSING SHORTEST-PATH ALGORITHMS FOR MIMD COMPUTERS , 1992 .

[55]  Daniel Atkins,et al.  Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure , 2003 .

[56]  Nicholas Chrisman A transformational approach to GIS operations , 1999, Int. J. Geogr. Inf. Sci..

[57]  Hanan Samet,et al.  Applications of spatial data structures , 1989 .

[58]  H. Sagan Space-filling curves , 1994 .

[59]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[60]  Hanan Samet,et al.  Data-parallel polygonization , 2003, Parallel Comput..

[61]  Hanan Samet,et al.  Foundations of multidimensional and metric data structures , 2006, Morgan Kaufmann series in data management systems.

[62]  Michael J. Mineter A software framework to create vector-topology in parallel GIS operations , 2003, Int. J. Geogr. Inf. Sci..

[63]  Luc Anselin,et al.  WHAT IS SPECIAL ABOUT SPATIAL DATA , 2006 .

[64]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Scalable Comput. Pract. Exp..

[65]  W. Daniel Hillis,et al.  Data parallel algorithms , 1986, CACM.

[66]  Keith Devlin Sets, functions and logic : an introduction to abstract mathematics / Keith Devlin , 1992 .

[67]  Walid G. Aref,et al.  Analysis of Multi-Dimensional Space-Filling Curves , 2003, GeoInformatica.