Analysis of count data from before-after control-impact studies

Before-after control-impact (BACI) studies are common observational studies conducted to determine environmental impacts of accidents or potential disturbances. In this paper, we present a practical guide to analysis of BACI studies when response variables are counts. Two commonly used analyses and one less common, but more appropriate, analysis are covered. The two common analyses fundamentally compare differences of differences, one using original units, the other using log-transformed units. The third analysis, which is less common, consists of estimating interaction effects in a quasi-likelihood generalized linear model with correlated errors (i.e., a generalized linear mixed model). We conclude that the two common analyses are of marginal utility when analyzing count data due to questions regarding interpretation of parameter estimates and treatment of zeros. These questions do not arise under the quasi-likelihood generalized linear model method, and it is the recommended approach. We illustrate the three techniques by analyzing data similar to that collected by an observational study of seabird counts on oiled and unoiled sites before and after the Exxon Valdez oil spill. Example data and SAS(r) code to conduct the three analyses are given.

[1]  R. Green,et al.  Sampling Design and Statistical Methods for Environmental Biologists , 1979 .

[2]  S. Hurlbert Pseudoreplication and the Design of Ecological Field Experiments , 1984 .

[3]  P. McCullagh,et al.  Generalized Linear Models , 1972, Predictive Analytics.

[4]  Strickland,et al.  Survey Design, Statistical Analysis, and Basis for Statistical Inferences in Coastal Habitat Injury Assessment: Exxon Valdez Oil Spill , 1995 .

[5]  P. Diggle,et al.  Analysis of Longitudinal Data. , 1997 .

[6]  John A. Nelder,et al.  Generalized linear models. 2nd ed. , 1993 .

[7]  S. Geisser,et al.  On methods in the analysis of profile data , 1959 .

[8]  R. Day “Effects of the Exxon Valdez oil spill on habitat use by birds in Prince William Sound, Alaska” , 1997 .

[9]  Douglas S. Robson,et al.  Techniques for wildlife investigations : design and analysis of capture data , 1993 .

[10]  T. J. Breen,et al.  Biostatistical Analysis (2nd ed.). , 1986 .

[11]  A. Grant,et al.  Exxon Valdez Oil Spill: Fate and Effects in Alaskan Waters. , 1996 .

[12]  R. Littell SAS System for Mixed Models , 1996 .

[13]  Douglas H. Johnson The Insignificance of Statistical Significance Testing , 1999 .

[14]  Allan Stewart-Oaten,et al.  ENVIRONMENTAL IMPACT ASSESSMENT: "PSEUDOREPLICATION" IN TIME?' , 1986 .

[15]  H. Huynh,et al.  Conditions under Which Mean Square Ratios in Repeated Measurements Designs Have Exact F-Distributions , 1970 .

[16]  A. Stewart-Oaten,et al.  SEQUENTIAL ESTIMATION OF LOG(ABUNDANCE) , 1996 .

[17]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[18]  H. Huynh,et al.  Estimation of the Box Correction for Degrees of Freedom from Sample Data in Randomized Block and Split-Plot Designs , 1976 .

[19]  J. Skalski Regression of abundance estimates from mark recapture surveys against environmental covariates , 1996 .

[20]  J. Mauchly Significance Test for Sphericity of a Normal $n$-Variate Distribution , 1940 .

[21]  M. Kendall,et al.  The advanced theory of statistics , 1945 .