COMPOSITIONAL DATA IN COMMUNITY ECOLOGY: THE PARADIGM OR PERIL OF PROPORTIONS?

Ecologists are often restricted to using or choose to use proportional- or percentage-type data with the view that it helps standardize for differences in variable totals among sampling units or individuals. This standardization to compositional data leads to constraints in the covariance and correlation structure that profoundly affect subsequent analysis and interpretation. This is another form of the problem related to the use of ratios in statistical analyses. Using simulated and zooplankton data I demonstrate the effect of using compositional data vs. the original data in correlation, ordination, and cluster analysis, which are common analytical methods in community ecology. Interpretations about the relatedness of various taxa or sites may reverse when using compositions relative to the unstandardized data. In addition, the selection of subcompositions (i.e., one or more variables are excluded when calculating the composition) may have profound and unpredictable consequences for the results. I examine some approaches proposed to deal with such data, e.g., centered log-ratio analysis, and recommend the use of correspondence analysis in multivariate studies to avoid the problems associated with differing solutions.

[1]  J. Aitchison A new approach to null correlations of proportions , 1981 .

[2]  R. Reyment Compositional data analysis , 1989 .

[3]  Felix Chayes,et al.  An Approximate Statistical Test for Correlations between Proportions , 1966, The Journal of Geology.

[4]  Raymond B. Cattell,et al.  Psychological measurement: normative, ipsative, interactive. , 1944 .

[5]  M. Hill,et al.  Data analysis in community and landscape ecology , 1987 .

[6]  J. Mosimann,et al.  NEW STATISTICAL METHODS FOR ALLOMETRY WITH APPLICATION TO FLORIDA RED‐WINGED BLACKBIRDS , 1979, Evolution; international journal of organic evolution.

[7]  C. McCulloch,et al.  Data Analysis and the Design of Experiments in Ornithology , 1985 .

[8]  Donald A. Jackson STOPPING RULES IN PRINCIPAL COMPONENTS ANALYSIS: A COMPARISON OF HEURISTICAL AND STATISTICAL APPROACHES' , 1993 .

[9]  M. C. Jones,et al.  The Statistical Analysis of Compositional Data , 1986 .

[10]  J. C. Butler Effect of various transformations on the analysis of percentage data , 1981 .

[11]  J. C. Butler Effects of closure on the measures of similarity between samples , 1979 .

[12]  K. Jöreskog,et al.  Applied Factor Analysis in the Natural Sciences. , 1997 .

[13]  F. Rohlf,et al.  NTSYS-pc Numerical Taxonomy and Multivariate Analysis System, version 2.1: Owner manual , 1992 .

[14]  C.J.F. ter Braak,et al.  CANOCO - a FORTRAN program for canonical community ordination by [partial] [etrended] [canonical] correspondence analysis, principal components analysis and redundancy analysis (version 2.1) , 1988 .

[15]  J. Bacon-Shone Ranking methods for compositional data , 1992 .

[16]  J. Gentle,et al.  Randomization and Monte Carlo Methods in Biology. , 1990 .

[17]  M. Gates,et al.  Physicochemical Typologies of South-Central Ontario Lakes , 1983 .

[18]  J. Rising,et al.  The measurement of overall body size in birds , 1989 .

[19]  John C. Butler,et al.  Principal components analysis using the hypothetical closed array , 1976 .

[20]  W. Atchley,et al.  Statistical Properties of Ratios. I. Empirical Results , 1976 .

[21]  J. Paloheimo,et al.  Factors Influencing Phosphorus–Phytoplankton Relationships , 1983 .

[22]  K. Pearson Mathematical contributions to the theory of evolution.—On a form of spurious correlation which may arise when indices are used in the measurement of organs , 1897, Proceedings of the Royal Society of London.

[23]  P. G. N. Digby,et al.  Multivariate Analysis of Ecological Communities , 1987 .

[24]  Donald A. Jackson,et al.  Ratios in aquatic sciences : statistical shortcomings with mean depth and the morphoedaphic index , 1990 .

[25]  Duane F. Alwin,et al.  The Factor Analysis of Ipsative Measures , 1980 .

[26]  A. Gibson Multivariate Analysis of Lizard Thermal Behavior , 1984 .

[27]  J. C. Butler Occurrence of negative open variances in ternary systems , 1975 .

[28]  R. W. Bode,et al.  Percent Model Affinity: A New Measure of Macroinvertebrate Community Composition , 1992, Journal of the North American Benthological Society.

[29]  F. Chayes On correlation between variables of constant sum , 1960 .

[30]  William P. Dunlap,et al.  On the questionable soundness of factoring ipsative data: A response to Saville & Willson (1991) , 1994 .

[31]  John C. Butler,et al.  Visual bias in R-mode dendrograms due to the effect of closure , 1978 .

[32]  Felix Chayes,et al.  Detecting nonrandom associations between proportions by tests of remaining-space variables , 1983 .

[33]  W. Dunlap,et al.  Factor Analysis of Ipsative Measures. , 1994, Multivariate behavioral research.

[34]  John C. Butler,et al.  Trends in ternary petrologic variation diagrams; fact or fantasy? , 1979 .