Covariate-adjusted species response curves derived from long-term macroinvertebrate monitoring data using classical and contemporary model-based ordination methods

Abstract Species response curves describe the change in species abundance or probability of occurrence along an environmental gradient and can be useful tools for understanding the consequences of environmental change. I compare species (taxa) response curves to electrical conductivity derived by two multivariate methods—one based on a principal coordinates analysis (the “classical” method) and the other on latent variable models (the “contemporary” method)—from long-term (33-years) of aquatic macroinvertebrate and water quality data collected at six sites along 2200 km of the Murray River, Australia. In both cases the models included other environmental variables, and the response curves to electrical conductivity were adjusted for these covariates. The response curves were very similar by the two methods, but the contemporary method has many advantages. Unlike the classical method, the contemporary method explicitly accounts for the statistical properties of the data, and in particular the mean-variance relationship, thus enabling valid inference, and it provides interval estimates for the parameters of the model, allowing inferences to be drawn regarding the statistical significance of an individual taxon's relationship with electrical conductivity. For the data set used here with 104 taxa and 269 samples, the classical method, with 999 permutations to compute percentile confidence bands, took 12 min to execute on a laptop running a 2.8 GHz i7-7600U dual core CPU with 16 GB RAM, whereas the contemporary method (using the gllvm R package) took 6 min.

[1]  C. Braak Correspondence Analysis of Incidence and Abundance Data:Properties in Terms of a Unimodal Response Model , 1985 .

[2]  Marti J. Anderson,et al.  Causal modeling with multivariate species data , 2013 .

[3]  Anna Norberg,et al.  How to make more out of community data? A conceptual framework and its implementation as models and software. , 2017, Ecology letters.

[4]  Yi Wang,et al.  mvabund– an R package for model‐based analysis of multivariate abundance data , 2012 .

[5]  Pierre Legendre,et al.  An empirical comparison of permutation methods for tests of partial regression coefficients in a linear model , 1999 .

[6]  J. Tweedley,et al.  Simple shade plots aid better long-term choices of data pre-treatment in multivariate assemblage studies , 2013, Journal of the Marine Biological Association of the United Kingdom.

[7]  C. Townsend,et al.  Can biological traits of stream invertebrates help disentangle the effects of multiple stressors in an agricultural catchment , 2014 .

[8]  Sara Taskinen,et al.  Generalized Linear Latent Variable Models for Multivariate Count and Biomass Data in Ecology , 2017 .

[9]  D. Roberts Comparison of distance-based and model-based ordinations. , 2019, Ecology.

[10]  R. Whittaker,et al.  GRADIENT ANALYSIS OF VEGETATION* , 1967, Biological reviews of the Cambridge Philosophical Society.

[11]  Donald A. Jackson STOPPING RULES IN PRINCIPAL COMPONENTS ANALYSIS: A COMPARISON OF HEURISTICAL AND STATISTICAL APPROACHES' , 1993 .

[12]  S. Frontier Étude de la décroissance des valeurs propres dans une analyse en composantes principales: Comparaison avec le modd́le du bâton brisé , 1976 .

[13]  D. Warton Why you cannot transform your way out of trouble for small counts , 2018, Biometrics.

[14]  Francis K. C. Hui,et al.  boral – Bayesian Ordination and Regression Analysis of Multivariate Abundance Data in r , 2016 .

[15]  H. John B. Birks,et al.  From Classical to Canonical Ordination , 2012 .

[16]  Francis K. C. Hui,et al.  gllvm: Fast analysis of multivariate abundance data with generalized linear latent variable models in r , 2019, Methods in Ecology and Evolution.

[17]  Edward Hooper,et al.  The River , 2018, Jew Boy.

[18]  Pierre Legendre,et al.  DISTANCE‐BASED REDUNDANCY ANALYSIS: TESTING MULTISPECIES RESPONSES IN MULTIFACTORIAL ECOLOGICAL EXPERIMENTS , 1999 .

[19]  B. Downes Back to the future: little‐used tools and principles of scientific inference can help disentangle effects of multiple stressors on freshwater ecosystems , 2010 .

[20]  Laura J. Pollock,et al.  Understanding co‐occurrence by modelling species simultaneously with a Joint Species Distribution Model (JSDM) , 2014 .

[21]  C. Braak,et al.  Weighted averaging, logistic regression and the Gaussian response model , 2004, Vegetatio.

[22]  Ter Braak,et al.  Canoco reference manual and CanoDraw for Windows user''s guide: software for canonical community ord , 2002 .

[23]  T. Hillman,et al.  Macroinvertebrates of the River Murray (Survey and Monitoring: 1980-1985) , 1989 .

[24]  David I. Warton,et al.  The central role of mean‐variance relationships in the analysis of multivariate abundance data: a response to Roberts (2017) , 2017 .

[25]  Francis K. C. Hui,et al.  So Many Variables: Joint Modeling in Community Ecology. , 2015, Trends in ecology & evolution.

[26]  J. Oksanen,et al.  Continuum theory revisited: what shape are species responses along ecological gradients? , 2002 .

[27]  K. R. Clarke,et al.  Inverse analysis in non-parametric multivariate analyses: distinguishing groups of associated species which covary coherently across samples , 2013 .

[28]  Jan Lepš,et al.  Multivariate Analysis of Ecological Data using CANOCO , 2003 .

[29]  D. Goodall,et al.  Objective methods for the classification of vegetation. III. An essay in the use of factor analysis , 1954 .

[30]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[31]  Mike P. Austin,et al.  Continuum Concept, Ordination Methods, and Niche Theory , 1985 .

[32]  D. Roberts Distance, dissimilarity, and mean–variance ratios in ordination , 2017 .

[33]  S. Wood Generalized Additive Models: An Introduction with R , 2006 .

[34]  D. Warton,et al.  Distance‐based multivariate analyses confound location and dispersion effects , 2012 .

[35]  Robert H. Whittaker,et al.  Vegetation of the Great Smoky Mountains , 1956 .

[36]  K. R. Clarke,et al.  Change in marine communities : an approach to statistical analysis and interpretation , 2001 .

[37]  J. Koehn,et al.  Report of the River Murray scientific panel on environmental flows : River Murray – Dartmouth to Wellington and the Lower Darling River. , 2000 .

[38]  Sara Taskinen,et al.  Model‐based approaches to unconstrained ordination , 2015 .

[39]  C.J.F. ter Braak,et al.  A Theory of Gradient Analysis , 2004 .

[40]  Warren L Paul,et al.  Causal modelling applied to the risk assessment of a wastewater discharge , 2016, Environmental Monitoring and Assessment.

[41]  Warren L. Paul,et al.  Long‐Term Monitoring of Macroinvertebrate Communities Over 2,300 km of the Murray River Reveals Ecological Signs of Salinity Mitigation Against a Backdrop of Climate Variability , 2018, Water Resources Research.

[42]  C. Braak Canonical Correspondence Analysis: A New Eigenvector Technique for Multivariate Direct Gradient Analysis , 1986 .