Principal Component Analysis of symmetric fuzzy data

Principal Component Analysis (PCA) is a well-known tool often used for the exploratory analysis of a numerical data set. Here an extension of classical PCA is proposed, which deals with fuzzy data (in short PCAF), where the elementary datum cannot be recognized exactly by a specific number but by a center, two spread measures and a membership function. Specifically, two different PCAF methods, associated with different hypotheses of interrelation between parts of the solution, are proposed. In the first method, called Centers-related Spread PCAF (CS-PCAF), the size of the spread measures depends on the size of the centers. In the second method, called Loadings-related Spread PCAF (LS-PCAF), the spreads are not related directly to the sizes of the centers, but indirectly, via the component loadings. To analyze how well PCAF works a simulation study was carried out. On the whole, the PCAF method performed better than or equally well as PCA, except in a few particular conditions. Finally, the application of PCAF to an empirical fuzzy data set is described.

[1]  H. Kiers,et al.  Three-mode principal components analysis: choosing the numbers of components and sensitivity to local optima. , 2000, The British journal of mathematical and statistical psychology.

[2]  H. Kiers Some procedures for displaying results from three‐way methods , 2000 .

[3]  Henk A. L. Kiers,et al.  Hierarchical relations between methods for simultaneous component analysis and a technique for rotation to a simple simultaneous structure , 1994 .

[4]  Phil Diamond,et al.  Fuzzy least squares , 1988, Inf. Sci..

[5]  Hans-Hermann Bock,et al.  Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data , 2000 .

[6]  J. Watada,et al.  Fuzzy principal component analysis for fuzzy data , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[7]  E. Diday,et al.  Extension de l'analyse en composantes principales à des données de type intervalle , 1997 .

[8]  Didier Dubois,et al.  Fuzzy sets and systems ' . Theory and applications , 2007 .

[9]  Pierpaolo D'Urso,et al.  A least-squares approach to fuzzy linear regression analysis , 2000 .

[10]  Pierpaolo D'Urso,et al.  Linear regression analysis for fuzzy = crisp input and fuzzy = crisp output data , 2015 .

[11]  B. Everitt,et al.  Three-Mode Principal Component Analysis. , 1986 .

[12]  M. Ichino General Metrics For Mixed Features The Cartesian Space Theory For Pattern Recognition , 1988, Proceedings of the 1988 IEEE International Conference on Systems, Man, and Cybernetics.

[13]  T. Berge Least squares optimization in multivariate analysis , 2005 .

[14]  ScienceDirect Computational statistics & data analysis , 1983 .

[15]  Miin-Shen Yang,et al.  On a class of fuzzy c-numbers clustering procedures for fuzzy data , 1996, Fuzzy Sets Syst..

[16]  F. Palumbo,et al.  A PCA for interval-valued data based on midpoints and radii , 2003 .