A Comparison of Simulated Annealing, Elliptic and Genetic Algorithms for Finding Irregularly Shaped Spatial Clusters

Methods for the detection and evaluation of the statistical significance of spatial clusters are important geographic tools in epidemiology, disease surveillance and crime analysis. Their fundamental role in the elucidation of the etiology of diseases (Lawson, 1999; Heffernan et al., 2004; Andrade et al., 2004), the availability of reliable alarms for the detection of intentional and non-intentional infectious diseases outbreaks (Duczmal and Buckeridge, 2005, 2006a; Kulldorff et al., 2005, 2006) and the analysis of spatial patterns of criminal activities (Ceccato, 2005) are current topics of intense research. The spatial scan statistic (Kulldorff, 1997) and the program SatScan (Kulldorff, 1999) are now widely used by health services to detect disease clusters with circular geometric shape. Contrasting to the naive statistic of the relative count of cases, the scan statistic is less prone to the random variations of cases in small populations. Although the circular scan approach sweeps completely the configuration space of circularly shaped clusters, in many situations we would like to recognize spatial clusters in a much more general geometric setting. Kulldorff et al. (2006) extended the SatScan approach to detect elliptic shaped clusters. It is important to note that for both circular and elliptic scans there is a need to impose size limits for the clusters; this requisite is even more demanding for the other irregularly shaped cluster detectors. Other methods, also using the scan statistic, were proposed recently to detect connected clusters of irregular shape (Duczmal et al., 2004, 2006b, 2007, Iyengar, 2004, Tango & Takahashi, 2005, Assuncao et al., 2006, Neill et al., 2005). Patil & Tallie (2004) used the relative incidence cases count for the objective function. Conley et al. (2005) proposed a genetic algorithm to explore a configuration space of multiple agglomerations of ellipses; Sahajpal et al. (2004) also used a genetic algorithm to find clusters shaped as intersections of circles of different sizes and centers. Two kinds of maps could be employed. The point data set approach assigns one point in the map for each case and for each non-case individual. This approach is interested in finding, among all the allowed geometric shape candidates defined within a specific strategy, the one that encloses the highest ratio of cases vs. non-cases, thus defining the most likely cluster. The second approach assumes that a map is divided into M regions, with total population N and C total cases. Defining the zone z as any set of connected regions, the

[1]  Renato Assunção,et al.  A Simulated Annealing Strategy for the Detection of Arbitrarily Shaped Spatial Clusters , 2022 .

[2]  Vijay S Iyengar,et al.  Space-time clusters with flexible shapes. , 2005, MMWR supplements.

[3]  M. Kulldorff,et al.  An elliptic spatial scan statistic , 2006, Statistics in medicine.

[4]  Andrew W. Moore,et al.  A Bayesian Spatial Scan Statistic , 2005, NIPS.

[5]  Ricardo H. C. Takahashi,et al.  A genetic algorithm for irregularly shaped spatial scan statistics , 2007, Comput. Stat. Data Anal..

[6]  Peter J. Park,et al.  Power comparisons for disease clustering tests , 2003, Comput. Stat. Data Anal..

[7]  M. Kulldorff Spatial Scan Statistics: Models, Calculations, and Applications , 1999 .

[8]  M. Kulldorff,et al.  Multivariate scan statistics for disease surveillance , 2007, Statistics in medicine.

[9]  M. Kulldorff,et al.  Syndromic surveillance in public health practice, New York City. , 2004, Emerging infectious diseases.

[10]  M. Kulldorff,et al.  Evaluation of Spatial Scan Statistics for Irregularly Shaped Clusters , 2006 .

[11]  David L Buckeridge,et al.  A workflow spatial scan statistic. , 2006, Statistics in medicine.

[12]  Simonne Almeida e Silva,et al.  Population-based surveillance of pediatric pneumonia: use of spatial analysis in an urban area of Central Brazil. , 2004, Cadernos de saude publica.

[13]  M. Kulldorff A spatial scan statistic , 1997 .

[14]  T. Tango,et al.  International Journal of Health Geographics a Flexibly Shaped Spatial Scan Statistic for Detecting Clusters , 2005 .

[15]  R. Assunção,et al.  Fast detection of arbitrarily shaped disease clusters , 2006, Statistics in medicine.

[16]  E. Lesaffre,et al.  Disease mapping and risk assessment for public health. , 1999 .

[17]  V. Ceccato,et al.  Homicide in São Paulo, Brazil: Assessing spatial-temporal and weather variations , 2005 .

[18]  Ricardo H. C. Takahashi,et al.  A multiobjective methodology for evaluating genetic operators , 2003 .

[19]  Mark Gahegan,et al.  A Genetic Approach to Detecting Clusters in Point Data Sets , 2005 .

[20]  M Kulldorff,et al.  Spatial disease clusters: detection and inference. , 1995, Statistics in medicine.

[21]  R. Sahajpal,et al.  Applying niching genetic algorithms for multiple cluster discovery in spatial analysis , 2004, International Conference on Intelligent Sensing and Information Processing, 2004. Proceedings of.

[22]  G. P. Patil,et al.  Upper level set scan statistic for detecting arbitrarily shaped hotspots , 2004, Environmental and Ecological Statistics.

[23]  Eduardo G. Carrano,et al.  Electric distribution network multiobjective design using a problem-specific genetic algorithm , 2006, IEEE Transactions on Power Delivery.