Power evaluation of disease clustering tests

BackgroundMany different test statistics have been proposed to test for spatial clustering. Some of these statistics have been widely used in various applications. In this paper, we use an existing collection of 1,220,000 simulated benchmark data, generated under 51 different clustering models, to compare the statistical power of several disease clustering tests. These tests are Besag-Newell's R, Cuzick-Edwards' k-Nearest Neighbors (k-NN), the spatial scan statistic, Tango's Maximized Excess Events Test (MEET), Swartz' entropy test, Whittemore's test, Moran's I and a modification of Moran's I.ResultsExcept for Moran's I and Whittemore's test, all other tests have good power for detecting some kind of clustering. The spatial scan statistic is good at detecting localized clusters. Tango's MEET is good at detecting global clustering. With appropriate choice of parameter, Besag-Newell's R and Cuzick-Edwards' k-NN also perform well.ConclusionThe power varies greatly for different test statistics and alternative clustering models. Consideration of the power is important before we decide which test statistic to use.

[1]  Werner Vach,et al.  Locally optimal tests on spatial clustering , 1994 .

[2]  Julian Besag,et al.  The Detection of Clusters in Rare Diseases , 1991 .

[3]  M Kulldorff,et al.  Spatial disease clusters: detection and inference. , 1995, Statistics in medicine.

[4]  P. Meirmans,et al.  Spatial ecological and genetic structure of a mixed population of sexual diploid and apomictic triploid dandelions , 2003, Journal of evolutionary biology.

[5]  Peter J. Park,et al.  Power comparisons for disease clustering tests , 2003, Comput. Stat. Data Anal..

[6]  J. Cuzick,et al.  Spatial clustering for inhomogeneous populations , 1990 .

[7]  A. Whittemore,et al.  A test to detect clusters of disease , 1987 .

[8]  N. Oden,et al.  Adjusting Moran's I for population density. , 1995, Statistics in medicine.

[9]  B K Szymanski,et al.  Lyme disease in New York State: spatial pattern at a regional scale. , 2001, The American journal of tropical medicine and hygiene.

[10]  H Becher,et al.  Clustering of childhood mortality in rural Burkina Faso. , 2001, International journal of epidemiology.

[11]  Michael P Ward,et al.  Use of spatial statistics and monitoring data to identify clustering of bovine tuberculosis in Argentina. , 2002, Preventive veterinary medicine.

[12]  T Tango,et al.  A class of tests for detecting 'general' and 'focused' clustering of rare diseases. , 1995, Statistics in medicine.

[13]  S. Merhar,et al.  Letter to the editor , 2005, IEEE Communications Magazine.

[14]  K. Sharples,et al.  An assessment of spatial clustering of leukaemias and lymphomas among young people in New Zealand. , 1999, Journal of epidemiology and community health.

[15]  T. Tango,et al.  A test for spatial disease clustering adjusted for multiple testing. , 2000, Statistics in medicine.

[16]  Rosemary J. Day,et al.  Disease Mapping and Risk Assessment for Public Health , 1999 .

[17]  Peter A. Rogerson,et al.  The Detection of Clusters Using a Spatial Version of the Chi‐Square Goodness‐of‐Fit Statistic , 1999 .

[18]  P. Moran Notes on continuous stochastic phenomena. , 1950, Biometrika.

[19]  J. Madigan,et al.  Association of Ixodes pacificus (Acari: ixodidae) with the spatial and temporal distribution of equine granulocytic ehrlichiosis in California. , 1999, Journal of medical entomology.

[20]  G. Elwinger,et al.  Spatiotemporal Genetic Structure within White Clover Populations in Grazed Swards , 2003 .

[21]  Robert Heimer,et al.  Spatial Analysis of Human Granulocytic Ehrlichiosis near Lyme, Connecticut , 2002, Emerging infectious diseases.

[22]  J. Viel,et al.  Soft-tissue sarcoma and non-Hodgkin's lymphoma clusters around a municipal solid waste incinerator with high dioxin emission levels. , 2000, American journal of epidemiology.

[23]  J B Swartz,et al.  An entropy-based algorithm for detecting clusters of cases and controls and its comparison with a method using nearest neighbours. , 1998, Health & place.

[24]  Jessica Gurevitch,et al.  Ecography 25: 553 -- 557, 2002 , 2022 .

[25]  H. Piégay,et al.  PRATIQUE DE L'ANALYSE DE L'AUTOCORRÉLATION SPATIALE EN GÉOMORPHOLOGIE : DÉFINITIONS OPÉRATOIRES ET TESTS , 2004 .

[26]  M Kulldorff Mathematical formula for Swartz' Entropy Test Statistic. , 1999, Health & place.

[27]  B. Richardson,et al.  Spatial analysis of genetic variation as a rapid assessment tool in the conservation management of narrow-range endemics , 2002 .

[28]  M. Dwass Modified Randomization Tests for Nonparametric Hypotheses , 1957 .

[29]  Chuan Yi Tang,et al.  A 2.|E|-Bit Distributed Algorithm for the Directed Euler Trail Problem , 1993, Inf. Process. Lett..

[30]  M. Miller,et al.  Coastal freshwater runoff is a risk factor for Toxoplasma gondii infection of southern sea otters (Enhydra lutris nereis). , 2002, International journal for parasitology.