Clustering and Hot Spot Detection in Socio-economic Spatio-temporal Data

Distribution of socio-economic features in urban space is an important source of information for land and transportation planning. The metropolization phenomenon has changed the distribution of types of professions in space and has given birth to different spatial patterns that the urban planner must know in order to plan a sustainable city. Such distributions can be discovered by statistical and learning algorithms through different methods. In this paper, an unsupervised classification method and a cluster detection method are discussed and applied to analyze the socio-economic structure of Switzerland. The unsupervised classification method, based on Ward's classification and self-organized maps, is used to classify the municipalities of the country and allows to reduce a highly-dimensional input information to interpret the socio-economic landscape. The cluster detection method, the spatial scan statistics, is used in a more specific manner in order to detect hot spots of certain types of service activities. The method is applied to the distribution services in the agglomeration of Lausanne. Results show the emergence of new centralities and can be analyzed in both transportation and social terms.

[1]  Martin Charlton,et al.  A Mark 1 Geographical Analysis Machine for the automated analysis of point data sets , 1987, Int. J. Geogr. Inf. Sci..

[2]  B. Turnbull,et al.  Monitoring for clusters of disease: application to leukemia incidence in upstate New York. , 1990, American journal of epidemiology.

[3]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[4]  Daniel Baier,et al.  Innovations in Classification, Data Science, and Information Systems , 2005 .

[5]  L. Anselin Local Indicators of Spatial Association—LISA , 2010 .

[6]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[7]  M. Kulldorff A spatial scan statistic , 1997 .

[8]  Maged N Kamel Boulos,et al.  The use of interactive graphical maps for browsing medical/health Internet information resources , 2003, International journal of health geographics.

[9]  Alfred Ultsch,et al.  Pareto Density Estimation: A Density Estimation for Knowledge Discovery , 2005 .

[10]  Gareth O. Roberts,et al.  Robust Markov chain Monte Carlo Methods for Spatial Generalized Linear Mixed Models , 2006 .

[11]  Kurt H Riitters,et al.  Geographic Analysis of Forest Health Indicators Using Spatial Scan Statistics , 2003, Environmental management.

[12]  Martin Kulldorff,et al.  Cancer map patterns: are they random or not? , 2006, American journal of preventive medicine.

[13]  Alfred Ultsch,et al.  Pareto Density Estimation: Probability Density Estimation for Knowledge Discovery , 2003 .

[14]  Michael Batty,et al.  Cities and Complexity: Understanding Cities with Cellular Automata, Agent-Based Models, and Fractals , 2007 .

[15]  Martin Kulldorff,et al.  Power evaluation of disease clustering tests , 2003, International journal of health geographics.

[16]  Robert Haining,et al.  Crime in Border Regions: The Scandinavian Case of Öresund, 1998–2001 , 2004, Annals of the Association of American Geographers.

[17]  A. Ultsch Maps for the Visualization of high-dimensional Data Spaces , 2003 .

[18]  E. Lesaffre,et al.  Disease mapping and risk assessment for public health. , 1999 .

[19]  F. Benjamin Zhan,et al.  A Comparison of Three Exploratory Methods for Cluster Detection in Spatial Point Patterns , 2010 .

[20]  Dirk Helbing,et al.  Scaling laws in urban supply networks , 2006 .

[21]  Sylvie Thiria,et al.  Detecting decadal changes in ENSO using neural networks , 2006 .

[22]  W. F. Athas,et al.  Evaluating cluster alarms: a space-time scan statistic and brain cancer in Los Alamos, New Mexico. , 1998, American journal of public health.

[23]  M. Kulldorff,et al.  Evaluation of Spatial Scan Statistics for Irregularly Shaped Clusters , 2006 .

[24]  Mark Gahegan,et al.  A Genetic Approach to Detecting Clusters in Point Data Sets , 2005 .

[25]  Peter J. Park,et al.  Power comparisons for disease clustering tests , 2003, Comput. Stat. Data Anal..

[26]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[27]  Alfred Ultsch,et al.  The architecture of emergent self-organizing maps to reduce projection errors , 2005, ESANN.