PULSE , Progressive Upper Level Set Scan Statistic for Geospatial Hotspot Detection

This paper presents a scan statistic, progressive upper level set (PULSE) scan statistic, for geospatial hotspot detection and its software implementation. Like ULS, the PULSE scan statistic is based on the arbitrarily shaped scan window and can be adapted for a network setting. PULSE is a refinement of the upper level set (ULS) scan statistic. Like some other likelihood based scanning devices, the ULS scan statistic identifies maximum likelihood estimate (MLE) zones that tend to be ‘stringy’ and sprawling. Its search path increases possibility of inclusion of extraneous cells in its MLE zones and, to a smaller extent, of exclusion of cells that belong to a true hotspot from its MLE zone. The PULSE scan statistic achieves improvement over the ULS scan statistic in two ways. First, it begins its search for a most likely zone with a large population of candidate zones obtained by modifying the ULS tree structure and continues its search using a genetic algorithm. Secondly, to reduce chances of generating an MLE that is excessively stringy and that includes extraneous cells in the MLE zone, PULSE uses cardinality and compactness of zones along with their likelihoods as the fitness function in the genetic algorithm and uses several pertinent criteria including evenness of intra-zone cellular response ratios to determine the MLE zone. To reduce computation, Gumbel distribution of extreme values is used to determine the p-value of the MLE zone. Better results come at the cost of increased processing time. An evaluative performance study is presented.

[1]  M. Kulldorff,et al.  Dead Bird Clusters as an Early Warning System for West Nile Virus Activity , 2003, Emerging infectious diseases.

[2]  Renato Assunção,et al.  A Simulated Annealing Strategy for the Detection of Arbitrarily Shaped Spatial Clusters , 2022 .

[3]  M. Kulldorff,et al.  An elliptic spatial scan statistic , 2006, Statistics in medicine.

[4]  Ricardo H. C. Takahashi,et al.  A genetic algorithm for irregularly shaped spatial scan statistics , 2007, Comput. Stat. Data Anal..

[5]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[6]  G. P. Patil,et al.  Spatially constrained clustering and upper level set scan hotspot detection in surveillance geoinformatics , 2006, Environmental and Ecological Statistics.

[7]  M Kulldorff,et al.  Spatial disease clusters: detection and inference. , 1995, Statistics in medicine.

[8]  S. Nadarajah,et al.  Extreme Value Distributions: Theory and Applications , 2000 .

[9]  Ganapati P. Patil,et al.  ULS Scan Statistic for Hotspot Detection with Continuous Gamma Response , 2009 .

[10]  S. R. Jammalamadaka,et al.  Scan Statistics and Applications , 2000 .

[11]  Martin Kulldorff,et al.  Prospective time periodic geographical disease surveillance using a scan statistic , 2001 .

[12]  S. Wallenstein Joseph Naus: Father of the Scan Statistic , 2009 .

[13]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[14]  Shashi Phoha,et al.  Digital governance, hotspot geoinformatics, and sensor networks for monitoring, etiology, early warning, and sustainable management , 2009 .

[15]  G. P. Patil,et al.  Upper level set scan statistic for detecting arbitrarily shaped hotspots , 2004, Environmental and Ecological Statistics.

[16]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[17]  M. Kulldorff,et al.  Evaluation of Spatial Scan Statistics for Irregularly Shaped Clusters , 2006 .

[18]  Allyson M. Abrams,et al.  Empirical/Asymptotic P-Values for Monte Carlo-Based Hypothesis Testing: an Application to Cluster Detection Using the Scan Statistic , 2006 .

[19]  T. Tango,et al.  International Journal of Health Geographics a Flexibly Shaped Spatial Scan Statistic for Detecting Clusters , 2005 .

[20]  G. P. Patil,et al.  Digital governance, hotspot geoinformatics, and sustainable development: A Preface , 2010, Environmental and Ecological Statistics.

[21]  Ganapati P. Patil,et al.  Dictionary and Classified Bibliography of Statistical Distributions in Scientific Work , 1987 .

[22]  Narayanaswamy Balakrishnan,et al.  Scan Statistics and Applications , 2012 .

[23]  G. P. Patil,et al.  Multiple indicators, partially ordered sets, and linear extensions: Multi-criterion ranking and prioritization , 2004, Environmental and Ecological Statistics.

[24]  Ganapati P. Patil,et al.  Geographic and Network Surveillance via Scan Statistics for Critical Area Detection , 2003 .

[25]  M. Kulldorff A spatial scan statistic , 1997 .

[26]  Vladimir Pozdnyakov,et al.  Scan Statistics: Methods and Applications , 2009 .