Spatiotemporal correlation in WebGIS group-user intensive access patterns

ABSTRACT Group-user intensive access to WebGIS exhibits spatiotemporal behaviour patterns with aggregation features and regularity distributions when geospatial data are accessed repeatedly over time and aggregated in certain spatial areas. We argue that these observable group-user access patterns provide a foundation for improved optimization of WebGIS so that it can respond to volume intensive requests with a higher quality of service and improve performance. Subsequently, a measure of access popularity distribution must precisely reflect the access aggregation and regularity features found in group-user intensive access. In our research, we considered both the temporal distribution characteristics and spatial correlation in the access popularity of tiled geospatial data (tiles). Based on the observation that group-user access follows a Zipf-like law, we built a tile-access popularity distribution based on time-sequence, to express the access aggregation of group-users with heavy-tailed characteristics. Considering the spatial locality of user-browsed tiles, we built a quantitative expression for the correlation between tile-access popularities and the distances to hotspot tiles, reflecting the attenuation of tile-access popularity to distance. Moreover, given the geographical spatial dependency and scale attribute of tiles, and the time-sequence of tile-access popularity, we built a Poisson regression model to express the degree of correlation among the accesses to adjacent tiles at different scales, reflecting the spatiotemporal correlation in tile access patterns. Experiments verify the accuracy of our Poisson regression model, which we then applied to a cluster-based cache-prefetching scenario. The results show that our model successfully reflects the spatiotemporal aggregation features of group-user intensive access and group-user behaviour patterns in WebGIS. The refined mathematical method in our model represents a time-sequence distribution of intensive access to tiles and the spatial aggregation and correlation in access to tiles at different scales, quantitatively expressing group-user spatiotemporal behaviour patterns with aggregation features and a regular distribution. Our proposed model provides a precise and empirical basis for performance-optimization strategies in WebGIS services, such as planning computing resource allocation and utilization, distributed storage of geospatial data, and providing distributed services so as to respond rapidly to geospatial data requests, thus addressing the challenges of volume-intensive user access.

[1]  Qunying Huang,et al.  Using adaptively coupled models and high-performance computing for enabling the computability of dust storm forecasting , 2013, Int. J. Geogr. Inf. Sci..

[2]  L. Sedda,et al.  Spatio-temporal analysis of tree height in a young cork oak plantation , 2011, Int. J. Geogr. Inf. Sci..

[3]  Harvey J. Miller,et al.  User‐centred time geography for location‐based services , 2004 .

[4]  László Böszörményi,et al.  A survey of Web cache replacement strategies , 2003, CSUR.

[5]  H. Miller Tobler's First Law and Spatial Analysis , 2004 .

[6]  Qunying Huang,et al.  Spatial Cloud Computing: A Practical Approach , 2013 .

[7]  Giorgos Mountrakis,et al.  Multi‐scale spatiotemporal analyses of moose–vehicle collisions: a case study in northern Vermont , 2009, Int. J. Geogr. Inf. Sci..

[8]  Hyoung-Joo Kim,et al.  Prefetch policies for large objects in a Web-enabled GIS application , 2001, Data Knowl. Eng..

[9]  Qunying Huang,et al.  Using spatial principles to optimize distributed computing for enabling the physical science discoveries , 2011, Proceedings of the National Academy of Sciences.

[10]  Xing Yong Modeling User Navigation Sequences Based on Multi-Markov Chains , 2003 .

[11]  Elias Ioup,et al.  Tile-Based Geospatial Information Systems: Principles and Practices , 2010 .

[12]  Steffen Rothkugel,et al.  World Wide Web caching: the application-level view of the Internet , 1997, IEEE Commun. Mag..

[13]  Zhao Ying Recent Development in Time Geography , 2009 .

[14]  Mei-Po Kwan,et al.  VISUALISATION OF SOCIO‐SPATIAL ISOLATION BASED ON HUMAN ACTIVITY PATTERNS AND SOCIAL NETWORKS IN SPACE‐TIME , 2011 .

[15]  M. Dijst,et al.  Bringing emotions to time geography: the case of mobilities of poverty , 2012 .

[16]  Rui Li,et al.  A prefetching model based on access popularity for geospatial data in a cluster-based caching system , 2012, Int. J. Geogr. Inf. Sci..

[17]  Elena Verdú,et al.  An OLS regression model for context-aware tile prefetching in a web map cache , 2013, Int. J. Geogr. Inf. Sci..

[18]  Kai Liu,et al.  Optimizing an index with spatiotemporal patterns to support GEOSS Clearinghouse , 2014, Int. J. Geogr. Inf. Sci..

[19]  M. Kwan Gis methods in time‐geographic research: geocomputation and geovisualization of human activity patterns , 2004 .

[20]  Claire Cardie,et al.  Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[21]  Veysi Isler,et al.  Retrospective adaptive prefetching for interactive Web GIS applications , 2011, GeoInformatica.

[22]  Yoo-Sung Kim,et al.  Probability-Based Tile Pre-fetching and Cache Replacement Algorithms for Web Geographical Information Systems , 2001, ADBIS.

[23]  Representing Data Distributions with a Nonparametric Kernel Density: The Way to Estimate the Optimal Oil Contents of Palm Mesocarp at Various Periods , 2013 .

[24]  Johannes Schöning,et al.  Improving interaction with virtual globes through spatial thinking: helping users ask "why?" , 2008, IUI '08.

[25]  Xin Shen,et al.  Geospatial information service based on digital measurable image , 2010, Geo spatial Inf. Sci..

[26]  Danyel Fisher,et al.  Hotmap: Looking at Geographic Attention , 2007, IEEE Transactions on Visualization and Computer Graphics.

[27]  Elias Ioup,et al.  Tile-Based Geospatial Information Systems , 2010 .

[28]  Wolfgang Karl,et al.  Analysis of the Spatial and Temporal Locality in Data Accesses , 2006, International Conference on Computational Science.

[29]  Rui Li,et al.  Simulation and Analysis of Cluster-Based Caching Replacement Based on Temporal and Spatial Locality of Tiles Access , 2013 .

[30]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[31]  Bernhard Seeger,et al.  Sort-based query-adaptive loading of R-trees , 2012, CIKM.

[32]  Li Rui Zipf-like Distribution and Its Application to Image Data Tile Request in Digital Earth , 2010 .