Global, local and focused geographic clustering for case-control data with residential histories

BackgroundThis paper introduces a new approach for evaluating clustering in case-control data that accounts for residential histories. Although many statistics have been proposed for assessing local, focused and global clustering in health outcomes, few, if any, exist for evaluating clusters when individuals are mobile.MethodsLocal, global and focused tests for residential histories are developed based on sets of matrices of nearest neighbor relationships that reflect the changing topology of cases and controls. Exposure traces are defined that account for the latency between exposure and disease manifestation, and that use exposure windows whose duration may vary. Several of the methods so derived are applied to evaluate clustering of residential histories in a case-control study of bladder cancer in south eastern Michigan. These data are still being collected and the analysis is conducted for demonstration purposes only.ResultsStatistically significant clustering of residential histories of cases was found but is likely due to delayed reporting of cases by one of the hospitals participating in the study.ConclusionData with residential histories are preferable when causative exposures and disease latencies occur on a long enough time span that human mobility matters. To analyze such data, methods are needed that take residential histories into account.

[1]  Martin Kulldorff,et al.  Geographic differences in invasive and in situ breast cancer incidence according to precise geographic coordinates, Connecticut, 1991–95 , 2002, International journal of cancer.

[2]  S Greenland,et al.  Estimability and estimation of expected years of life lost due to a hazardous exposure. , 1991, Statistics in medicine.

[3]  Torsten Hägerstraand WHAT ABOUT PEOPLE IN REGIONAL SCIENCE , 1970 .

[4]  Bradley P Carlin,et al.  Late detection of breast and colorectal cancer in Minnesota counties: an application of spatial smoothing and clustering. , 2003, Statistics in medicine.

[5]  Geoffrey M. Jacquez,et al.  Space-time visualization and analysis in the Cancer Atlas Viewer , 2005, J. Geogr. Syst..

[6]  Andrew B. Lawson,et al.  A REVIEW OF POINT PATTERN METHODS FOR SPATIAL MODELLING OF EVENTS AROUND SOURCES OF POLLUTION , 1996 .

[7]  Mei-Po Kwan,et al.  Human Extensibility and Individual Hybrid-accessibility in Space-time: A Multi-scale Representation Using GIS , 2000 .

[8]  Max J. Egenhofer,et al.  Modeling Moving Objects over Multiple Granularities , 2002, Annals of Mathematics and Artificial Intelligence.

[9]  Michael F. Goodchild,et al.  GIS and Transportation: Status and Challenges , 2000, GeoInformatica.

[10]  Geoffrey M. Jacquez,et al.  Improving exposure assessment in environmental epidemiology: Application of spatio-temporal visualization tools , 2005, J. Geogr. Syst..

[11]  J. Cuzick,et al.  Spatial clustering for inhomogeneous populations , 1990 .

[12]  P. Goovaerts,et al.  Accounting for regional background and population size in the detection of spatial clusters and outliers using geostatistical filtering and spatial neutral models: the case of lung cancer in Long Island, New York , 2004, International journal of health geographics.

[13]  David M. Mark,et al.  Measuring similarity between geospatial lifelines in studies of environmental health , 2005, J. Geogr. Syst..

[14]  L. Pickle,et al.  Current practices in cancer spatial data analysis: a call for guidance , 2005, International journal of health geographics.

[15]  M. Kulldorff,et al.  A geographic analysis of prostate cancer mortality in the United States, 1970–89 , 2002, International journal of cancer.

[16]  Torsten Hägerstrand REFLECTIONS ON “WHAT ABOUT PEOPLE IN REGIONAL SCIENCE?” , 1989 .

[17]  Ric Skinner,et al.  Use of a geographic information system to identify and characterize areas with high proportions of distant stage breast cancer. , 2002, Journal of public health management and practice : JPHMP.

[18]  P. Moran Notes on continuous stochastic phenomena. , 1950, Biometrika.

[19]  A C Gatrell,et al.  Spatial clustering of amyotrophic lateral sclerosis in Finland at place of birth and place of death. , 2003, American journal of epidemiology.

[20]  W. F. Athas,et al.  Evaluating cluster alarms: a space-time scan statistic and brain cancer in Los Alamos, New Mexico. , 1998, American journal of public health.

[21]  Eric J. Gustafson,et al.  Quantifying Landscape Spatial Pattern: What Is the State of the Art? , 1998, Ecosystems.

[22]  R. Simes,et al.  An improved Bonferroni procedure for multiple tests of significance , 1986 .

[23]  G. Jacquez A k nearest neighbour test for space-time interaction. , 1996, Statistics in medicine.

[24]  G M Jacquez,et al.  Disease Models Implicit in Statistical Tests of Disease Clustering , 1995, Epidemiology.

[25]  G M Jacquez,et al.  The Analysis of Disease Clusters, Part I: State of the Art , 1996, Infection Control & Hospital Epidemiology.

[26]  Michael F. Goodchild,et al.  Accessibility in space and time: A theme in spatially integrated social science , 2003, J. Geogr. Syst..

[27]  T. Webster,et al.  A method for spatial analysis of risk in a population-based case-control study. , 2002, International journal of hygiene and environmental health.

[28]  P. S. Hu,et al.  Transferability of Nationwide Personal Transportation Survey Data to Regional and Local Scales , 2002 .

[29]  M. Kulldorff,et al.  Syndromic surveillance in public health practice, New York City. , 2004, Emerging infectious diseases.

[30]  G M Jacquez,et al.  Cuzick and Edwards' test when exact locations are unknown. , 1994, American journal of epidemiology.

[31]  Frances Jean Mather,et al.  Statistical Methods for Linking Health, Exposure, and Hazards , 2004, Environmental health perspectives.

[32]  K. G. Stollenwerk,et al.  Arsenic in ground water , 2003 .

[33]  W. Halperin,et al.  An alternate characterization of hazard in occupational epidemiology: years of life lost per years worked. , 2002, American journal of industrial medicine.

[34]  B. Turnbull,et al.  Chronic disease surveillance and testing of clustering of disease and exposure: Application to leukemia incidence and TCE‐contaminated dumpsites in upstate New York , 1992 .

[35]  Peggy Reynolds,et al.  International Journal of Health Geographics Open Access Current Practices in Spatial Analysis of Cancer Data: Data Characteristics and Data Sources for Geographic Studies of Cancer , 2022 .

[36]  L. Waller,et al.  The Analysis of Disease Clusters, Part II: Introduction to Techniques , 1996, Infection Control & Hospital Epidemiology.

[37]  G. Jacquez,et al.  Visualization and exploratory analysis of epidemiologic data using a novel space time information system , 2004, International journal of health geographics.

[38]  M J Small,et al.  Source attribution of elevated residential soil lead near a battery recycling site. , 1995, Environmental science & technology.

[39]  D. Collia,et al.  The 2001 National Household Travel Survey: a look into the travel patterns of older Americans. , 2003, Journal of safety research.

[40]  E. Ziegel,et al.  geoENV VII: Geostatistics for Environmental Applications , 1997 .

[41]  Delineation of Hazardous Areas and Additional Sampling Strategy in Presence of a Location-Specific Threshold , 2001 .

[42]  P. Morfeld RE: An alternate characterization of hazard in occupational epidemiology: years of life lost per years worked. Am J Ind Med 42:1-10, 2002. , 2003, American journal of industrial medicine.

[43]  P. Whigham,et al.  A Time Geography Approach to the Visualisation of Sport , 1995 .

[44]  W. H. Engelmann,et al.  The National Human Activity Pattern Survey (NHAPS): a resource for assessing exposure to environmental pollutants , 2001, Journal of Exposure Analysis and Environmental Epidemiology.

[45]  P. Forer,et al.  Computational agents and urban life spaces : a preliminary realisation of the time - geography of student lifestyles , 1998 .

[46]  D. Barker Fetal and infant origins of adult disease , 2001, Monatsschrift Kinderheilkunde.

[47]  Pierre Goovaerts,et al.  New Methods to Generate Neutral Images for Spatial Pattern Recognition , 2002, GIScience.

[48]  Myoung-Jin Kim,et al.  Arsenic in southeastern Michigan , 2003 .

[49]  P. Goovaerts,et al.  Geostatistical modeling of the spatial variability of arsenic in groundwater of southeast Michigan , 2005 .

[50]  P Goovaerts,et al.  Monte Carlo analysis of uncertainty attached to microbial pollutant degradation rates. , 2001, Environmental science & technology.

[51]  H. Miller A MEASUREMENT THEORY FOR TIME GEOGRAPHY , 2005 .

[52]  Holly Jacobson,et al.  Evaluating the disparity of female breast cancer mortality among racial groups - a spatiotemporal analysis , 2004, International journal of health geographics.

[53]  P. Goovaerts,et al.  Accounting for source location and transport direction into geostatistical prediction of contaminants. , 2001, Environmental science & technology.

[54]  Timothy C. Coburn,et al.  Geostatistics for Natural Resources Evaluation , 2000, Technometrics.

[55]  Harvey J. Miller,et al.  Modelling accessibility using space-time prism concepts within geographical information systems , 1991, Int. J. Geogr. Inf. Sci..

[56]  Sander Greenland,et al.  Modern Epidemiology 3rd edition , 1986 .

[57]  A. Agudo,et al.  Secondary matching: a method for selecting controls in case-control studies on environmental risk factors. , 1999, International journal of epidemiology.

[58]  S. Edge,et al.  Geographic clustering of residence in early life and subsequent risk of breast cancer (United States) , 2004, Cancer Causes & Control.

[59]  G M Jacquez,et al.  Statistical software for the clustering of health events. , 1996, Statistics in medicine.

[60]  David Ozonoff,et al.  Spatial analysis of lung, colorectal, and breast cancer on Cape Cod: An application of generalized additive models to case-control data , 2005, Environmental health : a global access science source.

[61]  James M. Robins,et al.  Causal Inference from Complex Longitudinal Data , 1997 .

[62]  P. Morfeld Years of Life Lost due to exposure: Causal concepts and empirical shortcomings , 2004, Epidemiologic perspectives & innovations : EP+I.

[63]  Geoffrey M. Jacquez,et al.  Design and implementation of a Space-Time Intelligence System for disease surveillance , 2005, J. Geogr. Syst..