Correlation Coefficient of Compositional Data Based on Isometric Logratio Transformation

Compositional data is a relatively independent field in statistical analysis. Aitchison used to introduce the additive-logratio transformation (alr) and centered logratio transformation (clr) in 1986, which are effective tools to solve the problem of compositional data. But those approaches are not isometry ways, and the interpretation of transformed data might be different from the expected properties. Then Egozcue put forward the isometric logratio transformation, which can transform the simplex space into the real Euclidean space. This paper aims at introducing a new approach to calculate the correlation of two sets of compositional data. This approach based on the isometric logratio transformation can preserve all metric properties and solve the problem of calculation of correlation coefficient on compositional data.

[1]  Gordon M Miskelly,et al.  Compositional data analysis for elemental data in forensic science. , 2009, Forensic science international.

[2]  Qiang Liu,et al.  A hyperspherical transformation forecasting model for compositional data , 2007, Eur. J. Oper. Res..

[3]  John Connolly,et al.  Compositional data analysis in the study of carcass composition of beef cattle , 2001 .

[4]  J. Aitchison,et al.  Kernel Density Estimation for Compositional Data , 1985 .

[5]  Thomas Kätterer,et al.  Compositional analysis for an unbiased measure of soil aggregation , 2012 .

[6]  W. E. Sharp,et al.  The graph median–A stable alternative measure of central tendency for compositional data sets , 2006 .

[7]  K. Pearson Mathematical contributions to the theory of evolution. VIII. On the correlation of characters not quantitatively measurable , 2022, Proceedings of the Royal Society of London.

[8]  José E. Chacón,et al.  Gaussian kernels for density estimation with compositional data , 2011, Comput. Geosci..

[9]  Antonella Buccianti,et al.  Insights into Late Quaternary calcareous nannoplankton assemblages under the theory of statistical analysis for compositional data , 2004 .

[10]  P. Filzmoser,et al.  The bivariate statistical analysis of environmental (compositional) data. , 2010, The Science of the total environment.

[11]  V. Pawlowsky-Glahn,et al.  Exploring Compositional Data with the CoDa-Dendrogram , 2011 .

[12]  R. Zuo,et al.  Compositional data analysis in the study of integrated geochemical anomalies associated with mineralization , 2013 .

[13]  Lindsay Pennington,et al.  Analysis of compositional data in communication disorders research. , 2009, Journal of communication disorders.

[14]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[15]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[16]  Clemens Reimann,et al.  Interpretation of multivariate outliers for compositional data , 2012, Comput. Geosci..

[17]  Karl Pearson,et al.  Mathematical contributions to the theory of evolution, On the law of ancestral heredity , 1898, Proceedings of the Royal Society of London.

[18]  Peter Filzmoser,et al.  Model-based replacement of rounded zeros in compositional data: Classical and robust approaches , 2012, Comput. Stat. Data Anal..

[19]  G. Mateu-Figueras,et al.  Isometric Logratio Transformations for Compositional Data Analysis , 2003 .