Understanding the statistical properties of the percent model affinity index can improve biomonitoring related decision making

The percent model affinity (PMA) index is used to measure the similarity of two probability profiles representing, for example, an ideal profile (i.e. reference condition) and a monitored profile (i.e. possibly impacted condition). The goal of this work is to study the effects of sample size, evenness, true value of the index and number of classes on the statistical properties of the estimator of the PMA index. We derive and extend previous formulas of the expectation and variance of the estimator for estimated monitored profile and fixed reference profile. Using the obtained extension, we find that the estimator is asymptotically unbiased, converging faster when the profiles differ. When both profiles are estimated, we calculate the expectation using transformation rules for expectation and in addition derive the formula for the estimator’s variance. Since the computation of the probabilities in the variance formula is slow, we study the behavior of the variance with simulation experiments and assess whether it could be approximated with the variance for the fixed reference profile. Finally, we provide a set of recommendations for the users of the PMA index to avoid the most common caveats of the index.

[1]  John L Stoddard,et al.  Setting expectations for the ecological condition of streams: the concept of reference condition. , 2005, Ecological applications : a publication of the Ecological Society of America.

[2]  Robert K. Colwell,et al.  Unveiling the species-rank abundance distribution by generalizing the Good-Turing sample coverage theory. , 2015, Ecology.

[3]  Marko Järvinen,et al.  Defining the ecological status of small forest lakes using multiple biological quality elements and palaeolimnological analysis. , 2009 .

[4]  E. C. Pielou The measurement of diversity in different types of biological collections , 1966 .

[5]  G. Seber,et al.  The estimation of animal abundance and related parameters , 1974 .

[6]  O. D. Duncan,et al.  A METHODOLOGICAL ANALYSIS OF SEGREGATION INDEXES , 1955 .

[7]  Yong Cao,et al.  Quantifying the responses of macroinvertebrate assemblages to simulated stress: are more accurate similarity indices less useful? , 2010 .

[8]  C. Townsend,et al.  Is resolution the solution?: the effect of taxonomic resolution on the calculated properties of three stream food webs , 2000 .

[9]  Dan Romik,et al.  Stirling's Approximation for n!: the Ultimate Short Proof? , 2000, Am. Math. Mon..

[10]  H. Wolda,et al.  Similarity indices, sample size and diversity , 1981, Oecologia.

[11]  L. Gordon,et al.  Tutorial on large deviations for the binomial distribution , 1989 .

[12]  J. Jahn,et al.  The Measurement of Ecological Segregation , 1947 .

[13]  Matthew Goldstein,et al.  On the Problem of Bias in Multinomial Classification , 1977 .

[14]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[15]  E. Smith,et al.  Bias in Estimating Niche Overlap , 1982 .

[16]  K. Meissner,et al.  Comparing long term sediment records to current biological quality element data – Implications for bioassessment and management of a eutrophic lake , 2012 .

[17]  Charles P. Hawkins,et al.  Simulating biological impairment to evaluate the accuracy of ecological indicators , 2005 .

[18]  Y. Matsinos,et al.  Post-fire succession indices performance in a Mediterranean ecosystem , 2013, Stochastic Environmental Research and Risk Assessment.

[19]  R. Ricklefs,et al.  Bias and Dispersion of Overlap Indices: Results of Some Monte Carlo Simulations , 1980 .

[20]  B. Sinha,et al.  Some Aspects of the Sampling Distribution of the Apportionment Index and Related Inference , 2007 .

[21]  E. Smith Niche Breadth, Resource Availability, and Inference , 1982 .

[22]  Sa Bloom,et al.  Similarity Indices in Community Studies: Potential Pitfalls , 1981 .

[23]  A. Chao,et al.  Nonparametric estimation of Shannon’s index of diversity when there are unseen species in sample , 2004, Environmental and Ecological Statistics.

[24]  G. Jona Lasinio,et al.  Bayesian analysis of three indices for lagoons ecological status evaluation , 2015, Stochastic Environmental Research and Risk Assessment.

[25]  R. W. Bode,et al.  Percent Model Affinity: A New Measure of Macroinvertebrate Community Composition , 1992, Journal of the North American Benthological Society.

[26]  Michael R. Ransom,et al.  Sampling Distributions of Segregation Indexes , 2000 .

[27]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[28]  L. Gordon,et al.  Tutorial on large deviations for the binomial distribution. , 1989, Bulletin of mathematical biology.

[29]  E. Venrick PERCENT SIMILARITY: THE PREDICTION OF BIAS , 1983 .

[30]  D. Hannah,et al.  How does macroinvertebrate taxonomic resolution influence ecohydrological relationships in riverine ecosystems , 2012 .

[31]  C. Baraloto,et al.  The decomposition of Shannon's entropy and a confidence interval for beta diversity , 2012 .