Individual Movements and Geographical Data Mining. Clustering Algorithms for Highlighting Hotspots in Personal Navigation Routes

The rapid developments in the availability and access to spatially referenced information in a variety of areas, has induced the need for better analysis techniques to understand the various phenomena. In particular our analysis represents a first insight into a wealth of geographical data collected by individuals as activity dairy data. The attention is drawn on point datasets corresponding to GPS traces driven along a same route in different days. Our aim here is to explore the presence of clusters along the route, trying to understand the origins and motivations behind that in order to better understand the road network structure in terms of 'dense' spaces along the network. In this paper the attention is therefore focused on methods to highlight such clusters and see their impact on the network structure. Spatial clustering algorithms are examined (DBSCAN) and a comparison with other non-parametric density based algorithm (Kernel Density Estimation) is performed. A test is performed over the urban area of Trieste (Italy).

[1]  David Taniar,et al.  Computational Science and Its Applications - ICCSA 2005, International Conference, Singapore, May 9-12, 2005, Proceedings, Part I , 2005, ICCSA.

[2]  Neil Stuart,et al.  When is a hotspot a hotspot? A procedure for creating statistically robust hotspot maps of crime , 2002 .

[3]  Jean-Claude Thill,et al.  Comparison of planar and network K-functions in traffic accident analysis , 2004 .

[4]  David Taniar,et al.  Computational Science and Its Applications - ICCSA 2008, International Conference, Perugia, Italy, June 30 - July 3, 2008, Proceedings, Part I , 2008, ICCSA.

[5]  Ping Li,et al.  Using Greedy algorithm: DBSCAN revisited II , 2004, Journal of Zhejiang University. Science.

[6]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[7]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[8]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[9]  P. Diggle,et al.  Spatial point pattern analysis and its application in geographical epidemiology , 1996 .

[10]  Giuseppe Borruso,et al.  Network Density Estimation: A GIS Approach for Analysing Point Patterns in a Network Space , 2008, Trans. GIS.

[11]  Beniamino Murgante,et al.  Kernel Density Estimation Methods for a Geostatistical Approach in Seismic Risk Analysis: The Case Study of Potenza Hilltop Town (Southern Italy) , 2008, ICCSA.

[12]  Giuseppe Borruso,et al.  Network Density Estimation: Analysis of Point Patterns over a Network , 2005, ICCSA.

[13]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[14]  Gabriella Schoier,et al.  A Clustering Method for Large Spatial Databases , 2004, ICCSA.

[15]  Atsuyuki Okabe,et al.  Uniform network transformation for points pattern analysis on a non-uniform network , 2006, J. Geogr. Syst..

[16]  David O'Sullivan,et al.  Geographic Information Analysis , 2002 .

[17]  Mohamed A. Ismail,et al.  An efficient density based clustering algorithm for large databases , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[18]  Trevor C. Bailey,et al.  Interactive Spatial Data Analysis , 1995 .

[19]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[20]  Stan Openshaw,et al.  Census users' handbook , 1995 .

[21]  Antonio Laganà,et al.  Computational Science and Its Applications – ICCSA 2004 , 2004, Lecture Notes in Computer Science.

[22]  V. A. Epanechnikov Non-Parametric Estimation of a Multivariate Probability Density , 1969 .

[23]  Deyi Zhou,et al.  A new clustering algorithm based on distance and density , 2005, Proceedings of ICSSSM '05. 2005 International Conference on Services Systems and Services Management, 2005..