Indirect gradient analysis by Markov-chain Monte Carlo

Classical gradient analysis continues to be used to explore and test theories and models in community ecology. Yet the foundations of classical gradient analysis were developed at a time when computational power was limited, relative to current computational power. I argue that this history has left a lasting legacy on the field. Consequently, many gradient analyses do not to take advantage of current computer technology. Here I show how to use computationally intensive Markov-chain Monte Carlo methods to improve gradient analyses of presence–absence community data. The methods that I use were developed by quantitative social scientists in the early 1990s, and therefore tested and efficient software already exists for practical data analysis. As an example, I analyze the classic dune meadow vegetation data. A main advantage of the Bayesian approach to indirect gradient analysis is that, unlike essentially all classical indirect methods, it is able to make empirically testable probabilistic predictions of observed species occurrence patterns. The Bayesian approach also poses challenges for statistical ecology. In particular, the development of Markov-chain Monte Carlo methods for a wider class of Bayesian indirect gradient analysis models would permit more flexible approaches to generating probabilistic predictions.

[1]  Hadley Wickham,et al.  Reshaping Data with the reshape Package , 2007 .

[2]  J. P. Grime,et al.  Biodiversity and Ecosystem Functioning: Current Knowledge and Future Challenges , 2001, Science.

[3]  M. R. Novick,et al.  Statistical Theories of Mental Test Scores. , 1971 .

[4]  Peter R. Minchin,et al.  An evaluation of the relative robustness of techniques for ecological ordination , 1987 .

[5]  Cedric E. Ginestet ggplot2: Elegant Graphics for Data Analysis , 2011 .

[6]  David J. Harris Generating realistic assemblages with a joint species distribution model , 2015 .

[7]  F. Lord A theory of test scores. , 1952 .

[8]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[9]  M. Austin,et al.  Models for the analysis of species' response to environmental gradients , 2004, Vegetatio.

[10]  L. Prior,et al.  Ecological Models and Data in R , 2011 .

[11]  P. Legendre,et al.  vegan : Community Ecology Package. R package version 1.8-5 , 2007 .

[12]  Alfian Futuhul Hadi,et al.  Row–column interaction models, with an R implementation , 2014, Comput. Stat..

[13]  James S. Clark,et al.  Models for Ecological Data: An Introduction , 2007 .

[14]  Sara Taskinen,et al.  Model‐based approaches to unconstrained ordination , 2015 .

[15]  F. Baker,et al.  Item response theory : parameter estimation techniques , 1993 .

[16]  Karl Cottenie,et al.  Integrating environmental and spatial processes in ecological community dynamics. , 2005, Ecology letters.

[17]  William K. Morris,et al.  The role of functional traits in species distributions revealed through a hierarchical model , 2012 .

[18]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[19]  John Sibert,et al.  AD Model Builder: using automatic differentiation for statistical inference of highly parameterized complex nonlinear models , 2012, Optim. Methods Softw..

[20]  James S. Clark,et al.  Why environmental scientists are becoming Bayesians , 2004 .

[21]  Kai Zhu,et al.  More than the sum of the parts: forest climate response from joint species distribution models. , 2014, Ecological applications : a publication of the Ecological Society of America.

[22]  Campbell O. Webb,et al.  Phylogenies and Community Ecology , 2002 .

[23]  Donald A. Jackson,et al.  Random-effects ordination: describing and predicting multivariate correlations and co-occurrences , 2011 .

[24]  Christopher J. Fariss,et al.  Dynamic Patterns of Human Rights Practices , 2014 .

[25]  J. T. Curtis,et al.  An Ordination of the Upland Forest Communities of Southern Wisconsin , 1957 .

[26]  Donald A. Jackson,et al.  Qualitative and quantitative sampling of lake fish communities , 1997 .

[27]  Trevor Hastie,et al.  Generalized linear and generalized additive models in studies of species distributions: setting the scene , 2002 .

[28]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[29]  S. Hubbell,et al.  The unified neutral theory of biodiversity and biogeography at age ten. , 2011, Trends in ecology & evolution.

[30]  Andrew D. Martin,et al.  MCMCpack: Markov chain Monte Carlo in R , 2011 .

[31]  Noah Kaplan,et al.  Practical Issues in Implementing and Understanding Bayesian Ideal Point Estimation , 2005, Political Analysis.

[32]  Laura J. Pollock,et al.  Understanding co‐occurrence by modelling species simultaneously with a Joint Species Distribution Model (JSDM) , 2014 .

[33]  J. Albert Bayesian Estimation of Normal Ogive Item Response Curves Using Gibbs Sampling , 1992 .

[34]  Jonathan M. Chase,et al.  The metacommunity concept: a framework for multi-scale community ecology , 2004 .

[35]  David J. Harris Building realistic assemblages with a Joint Species Distribution Model , 2014, bioRxiv.

[36]  David M. Richardson,et al.  Biodiversity and Ecosystem Functioning , 2014 .

[37]  Hugh G. Gauch,et al.  Ordination of Vegetation Samples by Gaussian Species Distributions , 1974 .

[38]  Donald A. Jackson,et al.  Putting Things in Order: The Ups and Downs of Detrended Correspondence Analysis , 1991, The American Naturalist.

[39]  P. Schönemann,et al.  A generalized solution of the orthogonal procrustes problem , 1966 .

[40]  Hadley Wickham,et al.  The Split-Apply-Combine Strategy for Data Analysis , 2011 .

[41]  Jakub Stoklosa,et al.  Model-based thinking for community ecology , 2014, Plant Ecology.

[42]  A. Albert,et al.  On the existence of maximum likelihood estimates in logistic regression models , 1984 .

[43]  Martyn Plummer,et al.  JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling , 2003 .

[44]  Christina Gloeckner,et al.  Modern Applied Statistics With S , 2003 .

[45]  James H. Brown,et al.  Toward a metabolic theory of ecology , 2004 .

[46]  Xiao-Li Meng,et al.  POSTERIOR PREDICTIVE ASSESSMENT OF MODEL FITNESS VIA REALIZED DISCREPANCIES , 1996 .

[47]  C.J.F. ter Braak,et al.  A Theory of Gradient Analysis , 2004 .

[48]  D. Tilman,et al.  Biodiversity and Ecosystem Functioning , 2014 .

[49]  B. Enquist,et al.  Rebuilding community ecology from functional traits. , 2006, Trends in ecology & evolution.

[50]  D. Bates,et al.  Output Analysis and Diagnostics for MCMC , 2015 .

[51]  R. Whittaker,et al.  GRADIENT ANALYSIS OF VEGETATION* , 1967, Biological reviews of the Cambridge Philosophical Society.

[52]  Hugh G. Gauch,et al.  Multivariate analysis in community ecology , 1984 .

[53]  Otso Ovaskainen,et al.  Making more out of sparse data: hierarchical modeling of species communities. , 2011, Ecology.

[54]  Richard Arnold,et al.  Multivariate methods using mixtures: Correspondence analysis, scaling and pattern-detection , 2014, Comput. Stat. Data Anal..

[55]  David I Warton,et al.  Regularized Sandwich Estimators for Analysis of High‐Dimensional Data Using Generalized Estimating Equations , 2011, Biometrics.

[56]  C. Braak Correspondence Analysis of Incidence and Abundance Data:Properties in Terms of a Unimodal Response Model , 1985 .

[57]  Otso Ovaskainen,et al.  Modeling species co-occurrence by multivariate logistic regression generates new hypotheses on fungal interactions. , 2010, Ecology.

[58]  J. M. A. Swan,et al.  An Examination of Some Ordination Problems By Use of Simulated Vegetational Data , 1970 .

[59]  Thomas W. Yee,et al.  A NEW TECHNIQUE FOR MAXIMUM‐LIKELIHOOD CANONICAL GAUSSIAN ORDINATION , 2004 .

[60]  T. Yee The VGAM Package for Categorical Data Analysis , 2010 .

[61]  J. Andrew Royle,et al.  Estimating Size and Composition of Biological Communities by Modeling the Occurrence of Species , 2005 .

[62]  Anthony R. Ives,et al.  Generalized linear mixed models for phylogenetic analyses of community structure , 2011 .