Multimodel inference in ecology and evolution: challenges and solutions

Information theoretic approaches and model averaging are increasing in popularity, but this approach can be difficult to apply to the realistic, complex models that typify many ecological and evolutionary analyses. This is especially true for those researchers without a formal background in information theory. Here, we highlight a number of practical obstacles to model averaging complex models. Although not meant to be an exhaustive review, we identify several important issues with tentative solutions where they exist (e.g. dealing with collinearity amongst predictors; how to compute model‐averaged parameters) and highlight areas for future research where solutions are not clear (e.g. when to use random intercepts or slopes; which information criteria to use when random factors are involved). We also provide a worked example of a mixed model analysis of inbreeding depression in a wild population. By providing an overview of these issues, we hope that this approach will become more accessible to those investigating any process where multiple variables impact an evolutionary or ecological response.

[1]  I. Jamieson,et al.  Is lack of evidence of inbreeding depression in a threatened New Zealand robin indicative of reduced genetic load? , 2011 .

[2]  Shinichi Nakagawa,et al.  Inbreeding Depression Accumulation across Life‐History Stages of the Endangered Takahe , 2010, Conservation biology : the journal of the Society for Conservation Biology.

[3]  Shinichi Nakagawa,et al.  Limited inbreeding depression in a bottlenecked population is age but not environment dependent , 2010 .

[4]  Shinichi Nakagawa,et al.  Repeatability for Gaussian and non‐Gaussian data: a practical guide for biologists , 2010, Biological reviews of the Cambridge Philosophical Society.

[5]  Alain F. Zuur,et al.  A protocol for data exploration to avoid common statistical problems , 2010 .

[6]  H. Schielzeth Simple means to improve the interpretability of regression coefficients , 2010 .

[7]  R. Freckleton,et al.  Model averaging, missing data and multiple imputation: a case study for behavioural ecology , 2010, Behavioral Ecology and Sociobiology.

[8]  L. Garamszegi,et al.  Information-theoretic approaches to statistical analysis in behavioural ecology: an introduction , 2010, Behavioral Ecology and Sociobiology.

[9]  M. Symonds,et al.  A brief guide to model selection, multimodel inference and model averaging in behavioural ecology using Akaike’s information criterion , 2010, Behavioral Ecology and Sociobiology.

[10]  N. Dochtermann,et al.  Developing multiple hypotheses in behavioral ecology , 2010, Behavioral Ecology and Sociobiology.

[11]  Philip A. Stephens,et al.  Model selection and model averaging in behavioural ecology: the utility of the IT-AIC framework , 2010, Behavioral Ecology and Sociobiology.

[12]  Robert P. Freckleton,et al.  Dealing with collinearity in behavioural and ecological data: model averaging and the problems of measurement error , 2010, Behavioral Ecology and Sociobiology.

[13]  David R. Anderson,et al.  Model selection bias and Freedman’s paradox , 2010 .

[14]  A. Jansen Bayesian Methods for Ecology , 2009 .

[15]  Paul A Murtaugh,et al.  Performance of several variable-selection methods applied to real ecological data. , 2009, Ecology letters.

[16]  B. Carstens,et al.  An information‐theoretical approach to phylogeography , 2009, Molecular ecology.

[17]  Ben Bolker Learning hierarchical models: advice for the rest of us. , 2009, Ecological applications : a publication of the Ecological Society of America.

[18]  Mollie E. Brooks,et al.  Generalized linear mixed models: a practical guide for ecology and evolution. , 2009, Trends in ecology & evolution.

[19]  Martijn van de Pol,et al.  A simple method for distinguishing within- versus between-subject effects using mixed models , 2009, Animal Behaviour.

[20]  H. Schielzeth,et al.  Conclusions beyond support: overconfident estimates in mixed models , 2008, Behavioral ecology : official journal of the International Society for Behavioral Ecology.

[21]  Hua Liang,et al.  A Note on Conditional AIC for Linear Mixed-Effects Models. , 2008, Biometrika.

[22]  A. Gelman Scaling regression inputs by dividing by two standard deviations , 2008, Statistics in medicine.

[23]  Christopher N. Johnson,et al.  Species Richness and Evenness in Australian Birds , 2008, The American Naturalist.

[24]  L. Keller,et al.  Testing evolutionary models of senescence in a natural population: age and inbreeding effects on fitness components in song sparrows , 2008, Proceedings of the Royal Society B: Biological Sciences.

[25]  I. Jamieson,et al.  Quantifying and managing the loss of genetic variation in a free-ranging population of takahe through the use of pedigrees , 2008, Conservation Genetics.

[26]  PREDICTION‐FOCUSED MODEL SELECTION FOR AUTOREGRESSIVE MODELS , 2007 .

[27]  Shane A. Richards,et al.  Dealing with overdispersed count data in applied ecology , 2007 .

[28]  G. Kitagawa,et al.  Information Criteria and Statistical Modeling , 2007 .

[29]  David R. Anderson,et al.  Concerns regarding a call for pluralism of information theory and hypothesis testing , 2007 .

[30]  Philip A. Stephens,et al.  A call for statistical pluralism answered , 2007 .

[31]  William A Link,et al.  Model weights and the foundations of multimodel inference. , 2006, Ecology.

[32]  Robert P Freckleton,et al.  Why do we still use stepwise modelling in ecology and behaviour? , 2006, The Journal of animal ecology.

[33]  Peter Congdon,et al.  Bayesian model choice based on Monte Carlo estimates of posterior model probabilities , 2006, Comput. Stat. Data Anal..

[34]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[35]  Shane A. Richards,et al.  TESTING ECOLOGICAL THEORY USING THE INFORMATION‐THEORETIC APPROACH: EXAMPLES AND CAUTIONARY RESULTS , 2005 .

[36]  Timothy J. Robinson,et al.  Linear Models With R , 2005, Technometrics.

[37]  H. Goldstein,et al.  Variance partitioning in multilevel logistic models that exhibit overdispersion , 2005 .

[38]  F. Vaida,et al.  Conditional Akaike information for mixed-effects models , 2005 .

[39]  Philip A. Stephens,et al.  Information theory and hypothesis testing: a call for pluralism , 2005 .

[40]  Joe Hereford,et al.  COMPARING STRENGTHS OF DIRECTIONAL SELECTION: HOW STRONG IS STRONG? , 2004, Evolution; international journal of organic evolution.

[41]  Jerald B. Johnson,et al.  Model selection in ecology and evolution. , 2004, Trends in ecology & evolution.

[42]  Yuhong Yang Can the Strengths of AIC and BIC Be Shared , 2005 .

[43]  N. Hjort,et al.  The Focused Information Criterion , 2003 .

[44]  S. Paterson,et al.  Mixed models: getting the best use of parasitological data. , 2003 .

[45]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[46]  I. Jamieson,et al.  Sex‐Specific Consequences of Recent Inbreeding in an Ancestrally Inbred Population of New Zealand Takahe , 2003 .

[47]  David R. Anderson,et al.  Model Selection and Multimodel Inference , 2003 .

[48]  Sunil J Rao,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2003 .

[49]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[50]  Stephen J. Ganocy,et al.  Bayesian Statistical Modelling , 2002, Technometrics.

[51]  David R. Anderson,et al.  Avoiding pitfalls when using information-theoretic methods , 2002 .

[52]  L. Keller,et al.  Inbreeding effects in wild populations. , 2002 .

[53]  Frank E. Harrell,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2001 .

[54]  David R. Anderson,et al.  Null Hypothesis Testing: Problems, Prevalence, and an Alternative , 2000 .

[55]  Francis Tuerlinckx,et al.  Type S error rates for classical and Bayesian single and multiple comparison procedures , 2000, Comput. Stat..

[56]  G. Seber,et al.  Estimating Animal Abundance: Review III , 1999 .

[57]  T. Pärt,et al.  Age and reproduction in birds - hypotheses and tests. , 1995, Trends in ecology & evolution.

[58]  J. Bring How to Standardize Regression Coefficients , 1994 .

[59]  David R. Anderson,et al.  Modeling Survival and Testing Biological Hypotheses Using Marked Animals: A Unified Approach with Case Studies , 1992 .

[60]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[61]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[62]  Gary King,et al.  How Not to Lie with Statistics: Avoiding Common Mistakes in Quantitative Political Science , 1986 .

[63]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[64]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[65]  N. Morton,et al.  AN ESTIMATE OF THE MUTATIONAL DAMAGE IN MAN FROM DATA ON CONSANGUINEOUS MARRIAGES. , 1956, Proceedings of the National Academy of Sciences of the United States of America.

[66]  R. Fisher The theory of inbreeding , 1949 .

[67]  L. M. M.-T. Theory of Probability , 1929, Nature.

[68]  Mandy J. Haldane,et al.  A Mathematical Theory of Natural and Artificial Selection, Part V: Selection and Mutation , 1927, Mathematical Proceedings of the Cambridge Philosophical Society.

[69]  J. B. S. Haldane,et al.  A mathematical theory of natural and artificial selection—I , 1927, Mathematical Proceedings of the Cambridge Philosophical Society.

[70]  J B Haldane,et al.  A MATHEMATICAL THEORY OF NATURAL AND ARTIFICIAL SELECTION. PART II THE INFLUENCE OF PARTIAL SELF‐FERTILISATION, INBREEDING, ASSORTATIVE MATING, AND SELECTIVE FERTILISATION ON THE COMPOSITION OF MENDELIAN POPULATIONS, AND ON NATURAL SELECTION. , 1924, Mathematical Proceedings of the Cambridge Philosophical Society.

[71]  Sewall Wright,et al.  The effects of inbreeding and crossbreeding on guinea pigs : III. crosses between highly inbred families / , 1922 .