Improving data analysis in herpetology: Using Akaike's information criterion (AIC) to assess the strength of biological hypotheses

In ecology, researchers frequently use observational studies to explain a given pattern, such as the number of individuals in a habitat patch, with a large number of explanatory (i.e., independent) variables. To elucidate such relationships, ecologists have long relied on hypothesis testing to include or exclude variables in regression models, although the conclusions often depend on the approach used (e.g., forward, backward, stepwise selection). Though better tools have surfaced in the mid 1970's, they are still underutilized in certain fields, particularly in herpetology. This is the case of the Akaike information criterion (AIC) which is remarkably superior in model selection (i.e., variable selection) than hypothesis- based approaches. It is simple to compute and easy to understand, but more importantly, for a given data set, it provides a measure of the strength of evidence for each model that represents a plausible biological hypothesis relative to the entire set of models considered. Using this approach, one can then compute a weighted average of the estimate and standard error for any given variable of interest across all the models considered. This procedure, termed model-averaging or multimodel inference, yields precise and robust estimates. In this paper, I illustrate the use of the AIC in model selection and inference, as well as the interpretation of results analysed in this framework with two real herpetological data sets. The AIC and measures derived from it is should be routinely adopted by herpetologists.

[1]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[2]  Rodney X. Sturdivant,et al.  Applied Logistic Regression: Hosmer/Applied Logistic Regression , 2005 .

[3]  David R. Anderson,et al.  Multimodel Inference , 2004 .

[4]  J. Buckley,et al.  Monitoring the conservation status of an endangered amphibian: the natterjack toad Bufo calamita in Britain , 2004 .

[5]  K. Pollock,et al.  ESTIMATING SITE OCCUPANCY AND SPECIES DETECTION PROBABILITY PARAMETERS FOR TERRESTRIAL SALAMANDERS , 2004 .

[6]  M. Mogie,et al.  In support of null hypothesis significance testing , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[7]  T. Grubb,et al.  Patch and landscape characteristics associated with the distribution of woodland amphibians in an agricultural fragmented landscape: an information-theoretic approach , 2004 .

[8]  Jerald B. Johnson,et al.  Model selection in ecology and evolution. , 2004, Trends in ecology & evolution.

[9]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[10]  David R. Anderson,et al.  Model Selection and Multimodel Inference , 2003 .

[11]  B. Strickland,et al.  POTENTIAL INCONSISTENCIES WHEN COMPUTING AKAIKE's INFORMATION CRITERION , 2003 .

[12]  David R. Anderson,et al.  Avoiding pitfalls when using information-theoretic methods , 2002 .

[13]  J. Buskirk,et al.  DOSAGE RESPONSE OF AN INDUCED DEFENSE: HOW SENSITIVE ARE TADPOLES TO PREDATION RISK? , 2002 .

[14]  M. Conroy,et al.  Analysis and Management of Animal Populations , 2002 .

[15]  Howard Wainer,et al.  On the Past and Future of Null Hypothesis Significance Testing , 2002 .

[16]  Douglas H. Johnson THE ROLE OF HYPOTHESIS TESTING IN WILDLIFE SCIENCE , 2002 .

[17]  Markus J. Peterson,et al.  The fall of the null hypothesis: Liabilities and opportunities , 2001 .

[18]  David R. Anderson,et al.  Suggestions for presenting the results of data analyses , 2001 .

[19]  David R. Anderson,et al.  Kullback-Leibler information as a basis for strong inference in ecological studies , 2001 .

[20]  David R. Anderson,et al.  CLIMATE, HABITAT QUALITY, AND FITNESS IN NORTHERN SPOTTED OWL POPULATIONS IN NORTHWESTERN CALIFORNIA , 2000 .

[21]  P. Wade Bayesian Methods in Conservation Biology , 2000 .

[22]  David R. Anderson,et al.  Null Hypothesis Testing: Problems, Prevalence, and an Alternative , 2000 .

[23]  Carmen Batanero,et al.  Controversies Around the Role of Statistical Tests in Experimental Research , 2000 .

[24]  Douglas H. Johnson The Insignificance of Statistical Significance Testing , 1999 .

[25]  S. Goodman Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy , 1999, Annals of Internal Medicine.

[26]  William S. Verplanck,et al.  Statistical inference: Why wheels spin , 1998, Behavioral and Brain Sciences.

[27]  J. Hunter Needed: A Ban on the Significance Test , 1997 .

[28]  Richard J. Harris Significance Tests Have Their Place , 1997 .

[29]  Robert P. Abelson,et al.  On the Surprising Longevity of Flogged Horses: Why There Is a Case for the Significance Test , 1997 .

[30]  Patrick E. Shrout,et al.  Should Significance Tests be Banned? Introduction to a Special Section Exploring the Pros and Cons , 1997 .

[31]  G. White,et al.  Analysis of Frequency Count Data Using the Negative Binomial Distribution , 1996 .

[32]  Aaron M. Ellison,et al.  AN INTRODUCTION TO BAYESIAN INFERENCE FOR ECOLOGICAL RESEARCH AND ENVIRONMENTAL , 1996 .

[33]  C. Chatfield Model uncertainty, data mining and statistical inference , 1995 .

[34]  David R. Anderson,et al.  AIC MODEL SELECTION IN OVERDISPERSED CAPTURE-RECAPTURE DATA' , 1994 .

[35]  David R. Anderson,et al.  Modeling Survival and Testing Biological Hypotheses Using Marked Animals: A Unified Approach with Case Studies , 1992 .

[36]  V. Clark,et al.  Computer-aided multivariate analysis , 1991 .

[37]  Chris Chatfield,et al.  Avoiding Statistical Pitfalls , 1991 .

[38]  Nigel G. Yoccoz,et al.  Use, Overuse, and Misuse of Significance Tests in Evolutionary Biology and Ecology , 1991, The Bulletin of the Ecological Society of America.

[39]  P. McCullagh,et al.  Generalized Linear Models, 2nd Edn. , 1990 .

[40]  E. Lehmann Model Specification: The Views of Fisher and Neyman, and Later Developments , 1990 .

[41]  T. J. Breen,et al.  Biostatistical Analysis (2nd ed.). , 1986 .

[42]  D. Freedman A Note on Screening Regression Equations , 1983 .

[43]  T. C. Chamberlin The Method of Multiple Working Hypotheses: With this method the dangers of parental affection for a favorite theory can be circumvented. , 1965, Science.

[44]  W. D. Schmid Some Aspects of the Water Economies of Nine Species of Amphibians , 1965 .

[45]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[46]  T. C. Chamberlin The Method of Multiple Working Hypotheses , 1931, The Journal of Geology.

[47]  Marc J. Mazerolle Mouvements et reproduction des amphibiens en tourbières perturbées , 2004 .

[48]  M. Schaub,et al.  Why you should use capture-recapture methods when estimating survival and breeding probabilities: on bias, temporary emigration, overdispersion, and common toads , 2002 .

[49]  David R. Anderson,et al.  Kullback-Leibler information in resolving natural resource conflicts when definitive data exist , 2001 .

[50]  Benedikt R. Schmidt,et al.  Analysis of survival probabilities of female common toads, Bufo bufo , 1999 .

[51]  Steve Cherry,et al.  STATISTICAL TESTS IN PUBLICATIONS OF THE WILDLIFE SOCIETY , 1998 .

[52]  D. Kleinbaum,et al.  Applied regression analysis and other multivariable methods, 3rd ed. , 1998 .

[53]  M. J. Adams,et al.  Surveying and monitoring amphibians using aquatic funnel traps , 1997 .

[54]  A. Agresti An introduction to categorical data analysis , 1997 .

[55]  N. Sugiura Further analysts of the data by akaike' s information criterion and the finite corrections , 1978 .

[56]  H. Akaike INFORMATION THEORY AS AN EXTENSION OF THE MAXIMUM LIKELIHOOD , 1973 .

[57]  T. B. Thorson The Relationship of Water Economy to Terrestrialism in Amphibians , 1955 .