Effect of relative risk and cluster configuration on the power of the one-dimensional scan statistic.

The scan test for clustering in time is based on the maximum number of events in an interval (window) of width w as the window moves across the entire time frame. Power estimates of the scan statistic are simulated for a variety of epidemiologically motivated situations. Two cluster configurations are used: a rectangular pulse, and a triangular pulse designed to emulate environmental contamination. For a rectangular pulse, the relative risk R of disease in the cluster region is R-fold as high as it is for the background region. The power is strongly influenced by the sample size, the relative risk, and the width or duration of the cluster region, whereas the effect of the cluster configuration is small. Using a 5 per cent significance level, a relative risk of 4, a standardized cluster duration of 0.10, a relative window width of 1.5, and a (non-random) sample size of 50, the simulated power is approximately 80 per cent, indicating that the minimum sample size in the cluster region for adequate power is in the 12-32 range for values of the parameters used in this study.

[1]  Nathan Mantel,et al.  A Statistical Problem in Space and Time: Do Leukemia Cases Come in Clusters? , 1964 .

[2]  Ronald L. Iman,et al.  On a method for detecting clusters of possible uranium deposits , 1979 .

[3]  S Wallenstein,et al.  Use of the scan statistic to detect time-space clustering. , 1989, American journal of epidemiology.

[4]  R C Grimson,et al.  Clustering in sparse data and an analysis of rhabdomyosarcoma incidence. , 1992, Statistics in medicine.

[5]  D. Wartenberg,et al.  Detecting disease clusters: the importance of statistical power. , 1990, American journal of epidemiology.

[6]  T. Aldrich,et al.  Cancer clusters: a myth or a method. , 1983, Medical hypotheses.

[7]  Joseph Glaz,et al.  Approximations and Bounds for the Distribution of the Scan Statistic , 1989 .

[8]  Roger Marshall,et al.  A Review of Methods for the Statistical Analysis of Spatial Patterns of Disease , 1991 .

[9]  S Wallenstein,et al.  Power of the scan statistic for detection of clustering. , 1993, Statistics in medicine.

[10]  T. Aldrich,et al.  The status of cancer cluster investigations undertaken by state health departments. , 1988, American journal of public health.

[11]  Giovanni Radaelli Using the Cuscore technique in the surveillance of rare health events , 1992 .

[12]  S Wallenstein,et al.  An approximation for the distribution of the scan statistic. , 1987, Statistics in medicine.

[13]  J. Naus The Distribution of the Size of the Maximum Cluster of Points on a Line , 1965 .

[14]  N. Mantel The detection of disease clustering and a generalized regression approach. , 1967, Cancer research.

[15]  S Wallenstein,et al.  A test for detection of clustering over time. , 1980, American journal of epidemiology.

[16]  D F Stroup,et al.  Detection of aberrations in the occurrence of notifiable diseases surveillance data. , 1989, Statistics in medicine.

[17]  S. Thacker,et al.  The public health importance of clusters: a note from the Centers for Disease Control. , 1990, American journal of epidemiology.

[18]  Joseph Naus,et al.  Power Comparison of Two Tests of Non-Random Clustering , 1966 .

[19]  J. Glaz Approximations for tail probabilities and moments of the scan statistic , 1992 .