Basic models for disease occurrence in epidemiology.

BACKGROUND One of the epidemiologist's most basic tasks is estimation of disease occurrence. To perform this task, the epidemiologist frequently models variability in disease occurrence using one of three distributions--the binomial, the Poisson or the exponential distribution. Although epidemiologists often use them and their properties appear in standard texts, we know of no text or review that compares and contrasts epidemiological application of these distributions. METHODS In this commentary, we discuss these three basic distributions. We note key assumptions as well as limitations, and compare results from analyses based on each distribution. RESULTS AND CONCLUSIONS We illustrate that the three distributions, although superficially different, often lead to similar results. We argue that epidemiologists should often obtain similar results regardless of which distribution they use. We also point out that application of all three distributions can be inappropriate if assumptions of independence or homogeneity of risks fail to hold. Finally, we briefly review how these basic distributions can be used to justify use of other distributions, such as the Gaussian distribution, for studying disease-exposure associations.

[1]  DAVID G. KENDALL,et al.  Introduction to Mathematical Statistics , 1947, Nature.

[2]  R. Doll,et al.  Lung Cancer and Other Causes of Death in Relation to Smoking , 1956, British medical journal.

[3]  Emanuel Parzen,et al.  Stochastic Processes , 1962 .

[4]  David R. Cox The analysis of binary data , 1970 .

[5]  J. Fleiss Statistical methods for rates and proportions , 1974 .

[6]  W. Grove Statistical Methods for Rates and Proportions, 2nd ed , 1981 .

[7]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[8]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[9]  N. Breslow,et al.  Statistical methods in cancer research. Volume II--The design and analysis of cohort studies. , 1987, IARC scientific publications.

[10]  G. Bonney,et al.  Logistic regression for dependent binary observations. , 1987, Biometrics.

[11]  I M Longini,et al.  Statistical inference for infectious diseases. Risk-specific household and community transmission parameters. , 1988, American journal of epidemiology.

[12]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[13]  S Greenland,et al.  Randomization, Statistics, and Causal Inference , 1990, Epidemiology.

[14]  Norman E. Breslow,et al.  Statistical Methods in Cancer Research, Vol. II: The Design and Analysis of Cohort Studies. , 1990 .

[15]  Sander Greenland,et al.  On the Logical Justification of Conditional Tests for Two-By-Two Contingency Tables , 1991 .