Three-way analysis of imprecise data

Data are often affected by uncertainty. Uncertainty is usually referred to as randomness. Nonetheless, other sources of uncertainty may occur. In particular, the empirical information may also be affected by imprecision. Also in these cases it can be fruitful to analyze the underlying structure of the data. In this paper we address the problem of summarizing a sample of three-way imprecise data. In order to manage the different sources of uncertainty a twofold strategy is adopted. On the one hand, imprecise data are transformed into fuzzy sets by means of the so-called fuzzification process. The so-obtained fuzzy data are then analyzed by suitable generalizations of the Tucker3 and CANDECOMP/PARAFAC models, which are the two most popular three-way extensions of Principal Component Analysis. On the other hand, the statistical validity of the obtained underlying structure is evaluated by (nonparametric) bootstrapping. A simulation experiment is performed for assessing whether the use of fuzzy data is helpful in order to summarize three-way uncertain data. Finally, to show how our models work in practice, an application to real data is discussed.

[1]  L. Tucker,et al.  Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.

[2]  P. Giordani,et al.  Component Models for Fuzzy Data , 2006 .

[3]  Paolo Giordani,et al.  Two- and three-way component models for LR fuzzy data in a possibilistic framework , 2006, Fuzzy Sets Syst..

[4]  N. Singpurwalla,et al.  Membership Functions and Probability Measures of Fuzzy Sets , 2004 .

[5]  L. Tucker A METHOD FOR SYNTHESIS OF FACTOR ANALYSIS STUDIES , 1951 .

[6]  Paolo Giordani,et al.  A comparison of three methods for principal component analysis of fuzzy interval data , 2006, Comput. Stat. Data Anal..

[7]  Peter M. Bentler,et al.  Statistical aspects of a three-mode factor analysis model , 1978 .

[8]  Marieke E. Timmerman,et al.  Three-way component analysis with smoothness constraints , 2002 .

[9]  H. Kiers Towards a standardized notation and terminology in multiway analysis , 2000 .

[10]  H. Kiers Some procedures for displaying results from three‐way methods , 2000 .

[11]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[12]  Alex M. Andrew,et al.  Uncertainty and Information: Foundations of Generalized Information Theory , 2006 .

[13]  R. Bro,et al.  Centering and scaling in component analysis , 2003 .

[14]  A. Agresti,et al.  Multiway Data Analysis , 1989 .

[15]  Volker Krätschmer,et al.  Limit distributions of least squares estimators in linear regression models with vague concepts , 2006 .

[16]  Thierry Denoeux,et al.  Principal component analysis of fuzzy data using autoassociative neural networks , 2004, IEEE Transactions on Fuzzy Systems.

[17]  Hans-Jürgen Zimmermann,et al.  Fuzzy Set Theory - and Its Applications , 1985 .

[18]  Junzo Watada,et al.  Fuzzy Principal Component Analysis and Its Application , 1997 .

[19]  Richard A. Harshman,et al.  Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis , 1970 .

[20]  Pierpaolo D'Urso,et al.  A possibilistic approach to latent component analysis for symmetric fuzzy data , 2005, Fuzzy Sets Syst..

[21]  J. Kruskal Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics , 1977 .

[22]  W. Woodall,et al.  A probabilistic and statistical view of fuzzy methods , 1995 .

[23]  H. Kiers,et al.  Bootstrap confidence intervals for three‐way methods , 2004 .

[24]  Paolo Giordani,et al.  Principal Component Analysis of symmetric fuzzy data , 2004, Comput. Stat. Data Anal..

[25]  Volker Krätschmer,et al.  Strong consistency of least-squares estimation in linear regression models with vague concepts , 2006 .