Too good to be true: pitfalls of using mean Ellenberg indicator values in vegetation analyses

Question: Mean Ellenberg indicator values (EIVs) inherit information about compositional similarity, because during their calculation species abundances (or presence–absences) are used as weights. Can this similarity issue actually be demonstrated, does it bias results of vegetation analyses correlating mean EIVs with other aspects of species composition and how often are biased studies published? Methods: In order to separate information on compositional similarity possibly present in mean EIVs, a new variable was introduced, calculated as a weighted average of randomized species EIVs. The performance of these mean randomized EIVs was compared with that of the mean real EIVs on the one hand and random values (randomized mean EIVs) on the other. To demonstrate the similarity issue, differences between samples were correlated with dissimilarity matrices based on various indices. Next, the three mean EIV variables were tested in canonical correspondence analysis (CCA), detrended correspondence analysis (DCA), analysis of variance (ANOVA) between vegetation clusters, and in regression on species richness. Subsequently, a modified permutation test of significance was proposed, taking the similarity issue into account. In addition, an inventory was made of studies published in the Journal of Vegetation Science and Applied Vegetation Science between 2000 and 2010 likely reporting biased results due to the similarity issue. Results: Using mean randomized EIVs, it is shown that compositional similarity is inherited into mean EIVs and most resembles the inter-sample distances in correspondence analysis, which itself is based on iterative weighted averaging. The use of mean EIVs produced biased results in all four analysis types examined: unrealistic (too high) explained variances in CCA, too many significant correlations with ordination axes in DCA, too many significant differences between cluster analysis groups and too high coefficients of determination in regressions on species richness. Modified permutation tests provided ecologically better interpretable results. From 95 studies using Ellenberg indicator values, 36 reported potentially biased results. Conclusions: No statistical inferences should bemade in analyses relatingmean EIVs with other variables derived from the species composition as this can produce highly biased results, leading to misinterpretation. Alternatively, a modified permutation test using mean randomized EIVs can sometimes be used.

[1]  S. Hubbell,et al.  The unified neutral theory of biodiversity and biogeography at age ten. , 2011, Trends in ecology & evolution.

[2]  S. Zerbe,et al.  Ecological preferences of alien plant species in North-Eastern Germany , 2011, Biological Invasions.

[3]  M. Hill,et al.  Empirical realised niche models for British higher and lower plants – development and preliminary testing , 2010 .

[4]  J. Schellberg,et al.  Changes in vegetation types and Ellenberg indicator values after 65 years of fertilizer application in the Rengen Grassland Experiment, Germany. , 2009 .

[5]  W. Willner,et al.  Picea abies andAbies alba forests of the austrian alps: Numerical classification and ordination , 2002, Folia Geobotanica.

[6]  P. Legendre,et al.  vegan : Community Ecology Package. R package version 1.8-5 , 2007 .

[7]  D. Zelený,et al.  Environmental control of the vegetation pattern in deep rivervalleys of the Bohemian Massif , 2007 .

[8]  P. Pyšek,et al.  Regional species pools of vascular plants in habitats of the Czech Republic , 2007 .

[9]  S. Godefroid,et al.  ORIGINAL ARTICLE: Can Ellenberg's indicator values for Mediterranean plants be used outside their region of definition? , 2006 .

[10]  E. Maarel,et al.  The Braun-Blanquet approach in perspective , 1975, Vegetatio.

[11]  F. Berendse,et al.  Plant species as predictors of soil pH: Replacing expert judgement with measurements , 2005 .

[12]  Milan Chytrý,et al.  Stratified resampling of phytosociological databases: some strategies for obtaining more representative data sets for classification studies , 2005 .

[13]  J. Witte,et al.  Cover-weighted averaging of indicator values in vegetation analyses , 2004 .

[14]  M. Hill,et al.  Detrended correspondence analysis: An improved ordination technique , 2004, Vegetatio.

[15]  Gégout Jean-Claude,et al.  Comparison of indicator values of forest understory plant species in Western Carpathians (Slovakia) and Vosges Mountains (France) , 2003 .

[16]  A. Fosaa,et al.  Calibration of Ellenberg indicator values for the Faroe Islands , 2003 .

[17]  M. Diekmann Species indicator values as an important tool in applied plant ecology – a review , 2003 .

[18]  Milan Chytrý,et al.  Czech National Phytosociological Database: basic statistics of the available vegetation-plot data Česká národní fytocenologická databáze: základní statistika dostupných snímkových dat , 2003 .

[19]  J. Ewald The sensitivity of Ellenberg indicator values to the completeness of vegetation relevés , 2003 .

[20]  G. De’ath,et al.  CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE FOR ECOLOGICAL DATA ANALYSIS , 2000 .

[21]  A. P. Schaffers,et al.  Reliability of Ellenberg indicator values for moisture, nitrogen and soil reaction: a comparison with field measurements , 2000 .

[22]  Glenn De'ath,et al.  Classification and regression trees: a powerful yet simple technique for the analysis of complex ecological data , 2000 .

[23]  N. Altman,et al.  Canonical correspondence analysis as an approximation to Gaussian ordination , 1999 .

[24]  M. Diekmann,et al.  Prediction of occurrence of vascular plants in deciduous forests of South Sweden by means of Ellenberg indicator values , 1998 .

[25]  M. Zobel,et al.  The relative of species pools in determining plant species richness: an alternative explanation of species coexistence? , 1997, Trends in ecology & evolution.

[26]  Christopher M. Bishop,et al.  Classification and regression , 1997 .

[27]  Ruprecht Düll,et al.  Zeigerwerte von Pflanzen in Mitteleuropa , 1992 .

[28]  H. Möller Wege zur Ansprache der aktuellen Bodenazidität auf der Basis der Reaktionszahlen von Ellenberg ohne arithmetisches Mitteln dieser Werte , 1987 .

[29]  C. Braak Canonical Correspondence Analysis: A New Eigenvector Technique for Multivariate Direct Gradient Analysis , 1986 .

[30]  M. O. Hill,et al.  TWINSPAN: a FORTRAN program of arranging multivariate data in an ordered two way table by classification of individual and attributes , 1979 .

[31]  M. Hill Correspondence Analysis: A Neglected Multivariate Method , 1974 .

[32]  M. Hill,et al.  Reciprocal Averaging : an eigenvector method of ordination , 1973 .

[33]  R. Whittaker Vegetation of the Siskiyou Mountains, Oregon and California , 1960 .