Ecologic studies revisited.

Ecologic studies use data aggregated over groups rather than data on individuals. Such studies are popular because they use existing databases and can offer large exposure variation if the data arise from broad geographical areas. Unfortunately, the aggregation of data that define ecologic studies results in an information loss that can lead to ecologic bias. Specifically, ecologic bias arises from the inability of ecologic data to characterize within-area variability in exposures and confounders. We describe in detail particular forms of ecologic bias so that their potential impact on any particular study may be assessed. The only way to overcome such bias, while avoiding uncheckable assumptions concerning the missing information, is to supplement the ecologic with individual-level information, and we outline a number of proposals that may achieve this aim.

[1]  J E White,et al.  A two stage design for the study of the relationship between a rare exposure and a rare disease. , 1982, American journal of epidemiology.

[2]  A. M. Walker,et al.  Anamorphic analysis: sampling and estimation for covariate effects when both exposure and disease are known. , 1982, Biometrics.

[3]  D B Rubin,et al.  Difficulties with regression analyses of age-adjusted rates. , 1984, Biometrics.

[4]  D Hémon,et al.  Comparison of relative risks obtained in ecological and individual studies: some methodological considerations. , 1987, International journal of epidemiology.

[5]  Norman E. Breslow,et al.  Logistic regression for two-stage case-control data , 1988 .

[6]  S. Piantadosi,et al.  The ecological fallacy. , 1988, American journal of epidemiology.

[7]  S Greenland,et al.  Ecological bias, confounding, and effect modification. , 1989, International journal of epidemiology.

[8]  N. Cressie,et al.  Spatial Modeling of Regional Variables , 1993 .

[9]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[10]  S. Greenland,et al.  Effects of nondifferential exposure misclassification in ecologic studies. , 1992, American Journal of Epidemiology.

[11]  I Kleinschmidt,et al.  The Small Area Health Statistics Unit: a national facility for investigating health around point sources of environmental pollution in the United Kingdom. , 1992, Journal of epidemiology and community health.

[12]  D. Dockery,et al.  An association between air pollution and mortality in six U.S. cities. , 1993, The New England journal of medicine.

[13]  C Montomoli,et al.  Spatial correlation in ecological analysis. , 1993, International journal of epidemiology.

[14]  J. Robins,et al.  Invited commentary: ecologic studies--biases, misconceptions, and counterexamples. , 1994, American journal of epidemiology.

[15]  D Thomas,et al.  Design and analysis of multilevel analytic studies with applications to a study of air pollution. , 1994, Environmental health perspectives.

[16]  L. Sheppard,et al.  On the reliability and precision of within- and between- population estimates of relative rate parameters. , 1995, Biometrics.

[17]  H. Morgenstern,et al.  Ecologic studies in epidemiology: concepts, principles, and methods. , 1995, Annual review of public health.

[18]  B. Cohen,et al.  Divergent biases in ecologic and individual level studies. , 1995, Statistics in medicine.

[19]  Ross L. Prentice,et al.  Aggregate data studies of disease risk factors , 1995 .

[20]  I Kleinschmidt,et al.  Cancer incidence near municipal solid waste incinerators in Great Britain , 1996, British Journal of Cancer.

[21]  David G Steel,et al.  Analysing and Adjusting Aggregation Effects: The Ecological Fallacy Revisited , 1996 .

[22]  R L Prentice,et al.  Design considerations for estimation of exposure effects on disease risk, using aggregate data studies. , 1996, Statistics in medicine.

[23]  N. Künzli,et al.  The semi-individual study in air pollution epidemiology: a valid design as compared to ecologic studies. , 1997, Environmental health perspectives.

[24]  G. Shaddick,et al.  Small-area study of the incidence of neoplasms of the brain and central nervous system among adults in the West Midlands region, 1974-86. Small Area Health Statistics Unit. , 1997, British Journal of Cancer.

[25]  The semi-individual study in air pollution epidemiology: a valid design as compared to ecologic studies. , 1997 .

[26]  D. Freedman,et al.  A solution to the ecological inference problem , 1997 .

[27]  K. Judge,et al.  Income inequality and population health. , 1998, Social science & medicine.

[28]  John Fox,et al.  A Life Course Approach to Chronic Disease Epidemiology , 1998, BMJ.

[29]  T. C. Haas,et al.  Model-based geostatistics - Discussion , 1998 .

[30]  J Wakefield,et al.  Magnesium in drinking water supplies and mortality from acute myocardial infarction in north west England , 1999, Heart.

[31]  P Elliott,et al.  Issues in the statistical analysis of small area health data. , 1999, Statistics in medicine.

[32]  Nilanjan Chatterjee,et al.  Design and analysis of two‐phase studies with binary outcome applied to Wilms tumour prognosis , 1999 .

[33]  Bradley P. Carlin,et al.  Spatio-Temporal Hierarchical Models for Analyzing Atlanta Pediatric Asthma ER Visit Rates , 1999 .

[34]  N. G. Best,et al.  Spatial Poisson Regression for Health and Exposure Data Measured at Disparate Resolutions , 2000 .

[35]  Norman E. Breslow,et al.  Estimation of Disease Rates in Small Areas: A new Mixed Model for Spatial Dependence , 2000 .

[36]  E. Doorslaer,et al.  Income inequality and health: what does the literature tell us? , 2000, Annual review of public health.

[37]  G. Smith,et al.  Infant mortality, stomach cancer, stroke, and coronary heart disease: ecological analysis , 2000, BMJ : British Medical Journal.

[38]  C Guihenneuc-Jouyaux,et al.  Biases in ecological studies: utility of including within-area distribution of confounders. , 2000, Statistics in medicine.

[39]  J. Wakefield,et al.  Spatial epidemiology: methods and applications. , 2000 .

[40]  M. Lippmann,et al.  Toxicological bases for the setting of health-related air pollution standards. , 2000, Annual review of public health.

[41]  S. Richardson,et al.  Ecological correlation studies , 2001 .

[42]  Jon Wakefield,et al.  Ecological regression analysis of environmental benzene exposure and childhood leukaemia: sensitivity to data inaccuracies, geographical scale and ecological bias , 2001 .

[43]  P. Elliott,et al.  Bias and confounding in spatial epidemiology , 2001 .

[44]  Ruth Salway,et al.  A statistical framework for ecological and aggregate studies , 2001 .

[45]  S. Greenland Ecologic versus individual-level sources of bias in ecologic estimates of contextual health effects. , 2001, International journal of epidemiology.

[46]  J. Wakefield,et al.  Modeling Spatial Variation in Disease Risk , 2002 .

[47]  R. Waagepetersen,et al.  Bayesian Prediction of Spatial Count Data Using Generalized Linear Mixed Models , 2002, Biometrics.

[48]  Sander Greenland,et al.  A review of multilevel theory for ecologic analyses , 2002, Statistics in medicine.

[49]  Sara L McLafferty,et al.  GIS and health care. , 2003, Annual review of public health.

[50]  Allen Cheadle,et al.  Combining Aggregate and Individual Level Data to Estimate an Individual Level Correlation Coefficient , 2003 .

[51]  Gerard Rushton,et al.  Public health, GIS, and spatial analytic tools. , 2003, Annual review of public health.

[52]  C. M. Croner,et al.  Public Health GIS and the Internet , 2003, Annual review of public health.

[53]  Ellen K Cromley,et al.  GIS and disease. , 2003, Annual review of public health.

[54]  Jon Wakefield,et al.  Sensitivity Analyses for Ecological Regression , 2003, Biometrics.

[55]  Lianne Sheppard,et al.  Insights on bias and information in group-level studies. , 2003, Biostatistics.

[56]  T. Ricketts Geographic information systems and public health. , 2003, Annual review of public health.

[57]  S. Scobie Spatial epidemiology: methods and applications , 2003 .

[58]  Eric J. Beh,et al.  The Information in Aggregate Data , 2004 .

[59]  Gillian Bartlett,et al.  Bias due to aggregation of individual covariates in the Cox regression model. , 2003, American journal of epidemiology.

[60]  L. Waller,et al.  Applied Spatial Statistics for Public Health Data: Waller/Applied Spatial Statistics , 2004 .

[61]  R. Burnett,et al.  Imputing Unmeasured Explanatory Variables in Environmental Epidemiology With Application to Health Impact Analysis of Air Pollution , 1998, Environmental and Ecological Statistics.

[62]  L. Waller,et al.  Applied Spatial Statistics for Public Health Data , 2004 .

[63]  Duncan C. Thomas,et al.  Statistical Issues in Studies of the Long-Term Effects of Air Pollution: The Southern California Children’s Health Study , 2004 .

[64]  J. Forster Ecological inference for 2 × 2 tables - Discussion , 2004 .

[65]  Jon Wakefield,et al.  Ecological inference for 2 × 2 tables , 2004 .

[66]  Altaf Arain,et al.  A review and evaluation of intraurban air pollution exposure models , 2005, Journal of Exposure Analysis and Environmental Epidemiology.

[67]  Ruth Salway,et al.  Sources of bias in ecological studiesof non-rare events , 2005, Environmental and Ecological Statistics.

[68]  Sylvia Richardson,et al.  Improving ecological inference using individual‐level data , 2006, Statistics in medicine.

[69]  Jon Wakefield,et al.  Health-exposure modeling and the ecological fallacy. , 2005, Biostatistics.

[70]  Sebastien J-P A Haneuse,et al.  Hierarchical Models for Combining Ecological and Case–Control Data , 2007, Biometrics.

[71]  Jon Wakefield,et al.  Disease mapping and spatial regression with count data. , 2007, Biostatistics.

[72]  J. Wakefield,et al.  The interpretation of exposure effect estimates in chronic air pollution studies , 2007, Statistics in medicine.

[73]  S. Richardson,et al.  Hierarchical related regression for combining aggregate and individual data in studies of socio‐economic disease risk factors , 2007 .

[74]  Michael P Walsh,et al.  Ancillary benefits for climate change mitigation and air pollution control in the world's motor vehicle fleets. , 2008, Annual review of public health.

[75]  Jon Wakefield,et al.  A hybrid model for reducing ecological bias. , 2008, Biostatistics.

[76]  M. Stoto,et al.  Regionalization of local public health systems in the era of preparedness. , 2008, Annual review of public health.

[77]  Ronald J Ozminkowski,et al.  The health and cost benefits of work site health-promotion programs. , 2008, Annual review of public health.

[78]  Jon Wakefield,et al.  Alleviating linear ecological bias and optimal design with subsample data , 2007, Journal of the Royal Statistical Society. Series A,.

[79]  Sebastien J-P A Haneuse,et al.  The Combination of Ecological and Case-Control Data. , 2006, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[80]  E. Maibach,et al.  The effectiveness of mass communication to change public behavior. , 2008, Annual review of public health.

[81]  S Haneuse,et al.  Geographic‐based ecological correlation studies using supplemental case–control data , 2008, Statistics in medicine.

[82]  R. Kessler,et al.  The descriptive epidemiology of commonly occurring mental disorders in the United States. , 2008, Annual review of public health.

[83]  Stan Openshaw,et al.  Modifiable Areal Unit Problem , 2008, Encyclopedia of GIS.

[84]  R. Pasick,et al.  A critical review of theory in breast cancer screening promotion across cultures. , 2008, Annual review of public health.

[85]  Karen Glanz,et al.  Creating healthy food and eating environments: policy and environmental approaches. , 2008, Annual review of public health.

[86]  W. S. Robinson,et al.  Ecological correlations and the behavior of individuals. , 1950, International journal of epidemiology.