Sources of bias in ecological studiesof non-rare events

Ecological studies investigate relationships at the level of the group, rather than at the level of the individual. Although such studies are a common design in epidemiology, it is well-known that estimates may be subject to ecological bias. Most discussion of ecological bias has focused on rare disease events, where the tractability of the loglinear model allows some characterization of the nature of different biases. This paper concentrates on non-rare events, where the Poisson approximation to the binomial distribution is not appropriate. We limit the discussion to bias that arises from within-area variability in exposures and confounders. Our aims are to investigate the likely sizes and directions of bias and, where possible, to suggest methods for controlling the bias or for addressing the sensitivity of inference to assumptions on the nature of the bias. We illustrate that for non-rare events it is much more difficult to characterize the direction of bias than in the rare case. A series of simple numerical examples based on a chronic study of respiratory health illustrate the ideas of the paper.

[1]  Bradley P. Carlin,et al.  Hierarchical regression with misaligned spatial data: relating ambient ozone and pediatric asthma ER visits in Atlanta , 2003 .

[2]  L. A. Goodman Ecological Regressions and Behavior of Individuals , 1953 .

[3]  Ross L. Prentice,et al.  Aggregate data studies of disease risk factors , 1995 .

[4]  J. Pearl,et al.  Confounding and Collapsibility in Causal Inference , 1999 .

[5]  R L Prentice,et al.  Design considerations for estimation of exposure effects on disease risk, using aggregate data studies. , 1996, Statistics in medicine.

[6]  S. Greenland,et al.  Correcting for Non‐Differential Misclassification in Ecologic Analyses , 1993 .

[7]  J. Morris,et al.  Patterns of mortality in middle and early old age in the county boroughs of England and Wales. , 1969, British journal of preventive & social medicine.

[8]  S. Greenland,et al.  Effects of nondifferential exposure misclassification in ecologic studies. , 1992, American Journal of Epidemiology.

[9]  Lianne Sheppard,et al.  Overcoming biases and misconceptions in ecological studies , 2001 .

[10]  B. Cohen,et al.  Divergent biases in ecologic and individual level studies. , 1995, Statistics in medicine.

[11]  W. R. Buckland,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1973 .

[12]  Ruth Salway,et al.  A common framework for ecological inference in epidemiology, political science and sociology , 2004 .

[13]  J. Wakefield Ecological inference for 2 × 2 tables (with discussion) , 2004 .

[14]  David Clayton,et al.  Estimation of Population Exposure in Ecological Studies , 1996 .

[15]  Leo A. Goodman,et al.  Some Alternatives to Ecological Correlation , 1959, American Journal of Sociology.

[16]  S. Richardson,et al.  Statistical methods for geographical correlation studies , 1996 .

[17]  W. R. Buckland,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1973 .

[18]  Jonathan Wakefield,et al.  A critique of statistical aspects of ecological studies in spatial epidemiology , 2004, Environmental and Ecological Statistics.

[19]  J. Robins,et al.  Invited commentary: ecologic studies--biases, misconceptions, and counterexamples. , 1994, American journal of epidemiology.

[20]  Duncan C. Thomas,et al.  Statistical Issues in Studies of the Long-Term Effects of Air Pollution: The Southern California Children’s Health Study , 2004 .

[21]  Jon Wakefield,et al.  Ecological inference for 2 × 2 tables , 2004 .

[22]  S. Richardson,et al.  Ecological correlation studies , 2001 .

[23]  S. Piantadosi,et al.  The ecological fallacy. , 1988, American journal of epidemiology.

[24]  Ruth Salway,et al.  A statistical framework for ecological and aggregate studies , 2001 .

[25]  N. Jewell,et al.  A geometric approach to assess bias due to omitted covariates in generalized linear models , 1993 .

[26]  S Greenland,et al.  Ecological bias, confounding, and effect modification. , 1989, International journal of epidemiology.

[27]  V. Carstairs,et al.  Deprivation and health in Scotland. , 1990, Health bulletin.

[28]  D. Dockery,et al.  Particulate air pollution as a predictor of mortality in a prospective study of U.S. adults. , 1995, American journal of respiratory and critical care medicine.

[29]  D. Dockery,et al.  An association between air pollution and mortality in six U.S. cities. , 1993, The New England journal of medicine.

[30]  D. Freedman,et al.  A solution to the ecological inference problem , 1997 .

[31]  Samuel Kotz,et al.  Continuous univariate distributions : distributions in statistics , 1970 .

[32]  R. Carroll Surprising effects of measurement error on an aggregate data estimator , 1997 .

[33]  H. Morgenstern,et al.  Ecologic studies in epidemiology: concepts, principles, and methods. , 1995, Annual review of public health.

[34]  S. Chinn,et al.  The relation of mortality in England and Wales 1969-73 to measurements of air pollution , 2022 .

[35]  Jon Wakefield,et al.  Sensitivity Analyses for Ecological Regression , 2003, Biometrics.

[36]  D Hémon,et al.  Comparison of relative risks obtained in ecological and individual studies: some methodological considerations. , 1987, International journal of epidemiology.