Study design and analytic methods for data collected from clusters of animals

A variety of study design and statistical methods to account for the clustering of animal health and production outcomes is outlined. We argue that the relative utility of study design vs. statistical methods in accounting for cluster effects depends primarily on the objectives of the study and the amount of prior information available. The statistical methods outlined vary from simple post-hoc adjustments of test statistics to relatively complex mixture-distribution models. Methods for normal, binomial and Poisson distributed data are presented. The various options presented are discussed with reference to their underlying assumptions and how they have been or might be used in veterinary epidemiologic studies.

[1]  J. Ware Linear Models for the Analysis of Longitudinal Studies , 1985 .

[2]  N. Breslow Extra‐Poisson Variation in Log‐Linear Models , 1984 .

[3]  M E Halloran,et al.  Study designs for dependent happenings. , 1991, Epidemiology.

[4]  H. Morgenstern Uses of ecologic analysis in epidemiologic research. , 1982, American journal of public health.

[5]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[6]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[7]  G. Casella,et al.  Overdispersion in clinical mastitis ata from dairy herds: a negative binomial approach , 1991 .

[8]  R. Mclean,et al.  A Unified Approach to Mixed Linear Models , 1991 .

[9]  M. Woodbury,et al.  A variance components approach to categorical data models with heterogeneous cell populations: analysis of spatial gradients in lung cancer mortality rates in North Carolina counties. , 1981, Biometrics.

[10]  Haseman Jk,et al.  Analysis of dichotomous response data from certain toxicological experiments. , 1979 .

[11]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[12]  H. Erb,et al.  Risk factors for reason-specific culling of dairy cows , 1989 .

[13]  J. McDermott,et al.  A review of methods used to adjust for cluster effects in explanatory epidemiological studies of animal populations , 1994 .

[14]  P. H. Bendixen The enigma of herd: A statistical problem or a question of study design? , 1989 .

[15]  F. J. Anscombe,et al.  The statistical analysis of insect counts based on the negative binomial distribution. , 1949, Biometrics.

[16]  S. Martin,et al.  Disease, production and culling in Holstein-Friesian cows II. Age, season and sire effects , 1984 .

[17]  J. McDermott,et al.  Patterns of stillbirth and dystocia in Ontario cow-calf herds. , 1992, Canadian journal of veterinary research = Revue canadienne de recherche veterinaire.

[18]  D. Gianola,et al.  Mixed models for binomial data with an application to lamb mortality , 1988 .

[19]  J. McDermott,et al.  "Benchmark" - a large observational study of Ontario beef breeding herds: Study design and collection of data. , 1991, The Canadian veterinary journal = La revue veterinaire canadienne.

[20]  Graham K. Rand,et al.  Quantitative Applications in the Social Sciences , 1983 .

[21]  D. A. Williams,et al.  The analysis of binary responses from toxicological experiments involving reproduction and teratogenicity. , 1975, Biometrics.

[22]  M. Salman,et al.  The enigma of herd: a comparison of different models to account for group effects in multiple logistic regression analysis. , 1988, Acta veterinaria Scandinavica. Supplementum.

[23]  S. Martin,et al.  Disease, production and culling in Holstein-Friesian cows VI. Effects of management on disease rates , 1984 .

[24]  H. Erb,et al.  Herd-level risk factors for Staphylococcus aureus and Streptococcus agalactiae intramammary infections , 1988 .

[25]  John Hinde,et al.  Compound Poisson Regression Models , 1982 .

[26]  N. Draper,et al.  Applied Regression Analysis. , 1967 .

[27]  S. R. Searle Linear Models , 1971 .

[28]  Barry H. Margolin,et al.  Testing Goodness of Fit for the Poisson Assumption When Observations are Not Identically Distributed , 1985 .

[29]  Norman R. Draper,et al.  Applied regression analysis (2. ed.) , 1981, Wiley series in probability and mathematical statistics.

[30]  A. Donner,et al.  The analysis of variance adjustment to chi-square tests when there is litter or herd correlation in surveys or field trials. , 1988, Acta veterinaria Scandinavica. Supplementum.

[31]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[32]  G. W. Snedecor Statistical Methods , 1964 .

[33]  S. Martin,et al.  Disease, production and culling in Holstein-Friesian cows III. Disease and production as determinants of disease , 1984 .

[34]  L. Kupper,et al.  Analysis of dichotomous response data from certain toxicological experiments. , 1979, Biometrics.

[35]  C Rouquette,et al.  [Epidemiologic research]. , 1970, Bulletin de l'Institut national de la sante et de la recherche medicale.

[36]  S. Martin,et al.  The association between disease, production and culling in a university dairy herd. , 1979, The Canadian veterinary journal = La revue veterinaire canadienne.

[37]  W. Johnson,et al.  Effect of brucellosis vaccination and dehorning on transmission of bovine leukemia virus in heifers on a California dairy. , 1990, Canadian journal of veterinary research = Revue canadienne de recherche veterinaire.

[38]  P. McCullagh,et al.  Generalized Linear Models, 2nd Edn. , 1990 .

[39]  J. McDermott,et al.  Culling practices of Ontario cow-calf producers. , 1992, Canadian journal of veterinary research = Revue canadienne de recherche veterinaire.

[40]  P. Willeberg,et al.  Chronic pleuritis in pigs for slaughter: an epidemiological study of infectious and rearing system-related risk factors , 1990 .

[41]  Allan J. Lichtman,et al.  Ecological Inference , 1978 .

[42]  W. Fuller,et al.  Transformations for Estimation of Linear Models with Nested-Error Structure , 1973 .

[43]  S. Martin,et al.  Disease, production and culling in Holstein-Friesian cows V. Survivorship , 1984 .

[44]  H. Erb,et al.  The relationship between mastitis and retained placenta in a commercial population of holstein dairy cows , 1988 .

[45]  Williams Da,et al.  The analysis of binary responses from toxicological experiments involving reproduction and teratogenicity. , 1975 .

[46]  J. Lewis,et al.  Probit Analysis (3rd ed). , 1972 .

[47]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[48]  A Donner,et al.  A regression approach to the analysis of data arising from cluster randomization. , 1985, International journal of epidemiology.

[49]  R. Prentice,et al.  Correlated binary regression with covariates specific to each binary observation. , 1988, Biometrics.

[50]  McDermott Jj,et al.  The analysis of individual animal risk for animals sampled in clusters. , 1988 .

[51]  Murray Aitkin,et al.  Variance Component Models with Binary Response: Interviewer Variability , 1985 .

[52]  S. Martin,et al.  Disease, production and culling in Holstein-Friesian cows , 1984 .

[53]  T. Rephann,et al.  Association , 1973, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[54]  A. Donner A Review of Inference Procedures for the Intraclass Correlation Coefficient in the One-Way Random Effects Model , 1986 .

[55]  S. Zeger,et al.  Multivariate Regression Analyses for Categorical Data , 1992 .

[56]  D. Hird,et al.  Dairy farm wells in southeastern Minnesota: The relation of new water sources (new wells) to milk and milk fat production , 1983 .

[57]  K. Liang,et al.  Marginal models for correlated binary responses with multiple classes and multiple levels of nesting. , 1992, Biometrics.

[58]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[59]  D. A. Williams,et al.  Extra‐Binomial Variation in Logistic Linear Models , 1982 .

[60]  J M Neuhaus,et al.  An annotated bibliography of methods for analysing correlated categorical data. , 1992, Statistics in medicine.

[61]  A. L. Rae,et al.  The analysis of binomial data by a generalized linear mixed model , 1985 .

[62]  G. McKeon,et al.  Epidemiological studies on the ecology of Leptospira interrogans serovars pomona and hardjo in Queensland , 1986 .

[63]  Anscombe Fj The statistical analysis of insect counts based on the negative binomial distribution. , 1949 .

[64]  A Donner,et al.  Adjustments to the Mantel-Haenszel chi-square statistic and odds ratio variance estimator when the data are clustered. , 1987, Statistics in medicine.