A new efficient method for determining the number of components in PARAFAC models

A new diagnostic called the core consistency diagnostic (CORCONDIA) is suggested for determining the proper number of components for multiway models. It applies especially to the parallel factor analysis (PARAFAC) model, but also to other models that can be considered as restricted Tucker3 models. It is based on scrutinizing the ‘appropriateness’ of the structural model based on the data and the estimated parameters of gradually augmented models. A PARAFAC model (employing dimension‐wise combinations of components for all modes) is called appropriate if adding other combinations of the same components does not improve the fit considerably. It is proposed to choose the largest model that is still sufficiently appropriate. Using examples from a range of different types of data, it is shown that the core consistency diagnostic is an effective tool for determining the appropriate number of components in e.g. PARAFAC models. However, it is also shown, using simulated data, that the theoretical understanding of CORCONDIA is not yet complete. Copyright © 2003 John Wiley & Sons, Ltd.

[1]  G. Ewing Instrumental methods of chemical analysis , 1954 .

[2]  P. Ladefoged,et al.  Factor analysis of tongue shapes. , 1971, Journal of the Acoustical Society of America.

[3]  R. P. McDonald,et al.  A simple comprehensive model for the analysis of covariance structures: Some remarks on applications , 1980 .

[4]  Frans Spaepen,et al.  Metallic Glasses and Metastable Crystalline Phases Produced by Picosecond Pulsed Laser Quenching , 1983 .

[5]  Peter R. Pamment,et al.  Three-mode {parafac} factor analysis in applied research. , 1983 .

[6]  J. Kruskal,et al.  A two-stage procedure incorporating good features of both trilinear and quadrilinear models , 1989 .

[7]  Ben C. Mitchell,et al.  An empirical comparison of resolution methods for three-way arrays , 1993 .

[8]  B. Kowalski,et al.  Theory of medium‐rank second‐order calibration with restricted‐Tucker models , 1994 .

[9]  Age K. Smilde,et al.  Multicomponent Determination of Chlorinated Hydrocarbons Using a Reaction-Based Chemical Sensor. 3. Medium-Rank Second-Order Calibration with Restricted Tucker Models , 1994 .

[10]  Age K. Smilde,et al.  Some theoretical results on second‐order calibration methods for data with and without rank overlap , 1995 .

[11]  Lars Nørgaard Spectral resolution and prediction of slit widths in fluorescence spectroscopy by two‐ and three‐way methods , 1996 .

[12]  R. Bro PARAFAC. Tutorial and applications , 1997 .

[13]  Claus A. Andersson,et al.  PARAFAC2—Part II. Modeling chromatographic data with retention time shifts , 1999 .

[14]  Rasmus Bro,et al.  Calibration methods for complex second-order data , 1999 .

[15]  R. Bro Exploratory study of sugar production using fluorescence spectroscopy and multi-way analysis , 1999 .

[16]  Rasmus Bro,et al.  The N-way Toolbox for MATLAB , 2000 .

[17]  L. Moberg,et al.  Spectrofluorimetric determination of chlorophylls and pheopigments using parallel factor analysis. , 2001, Talanta.

[18]  Didier G. Leibovici,et al.  Multi-way modelling of high-dimensionality electroencephalographic data , 2001 .

[19]  F. Fernández,et al.  Multicomponent kinetic determination of Cu, Zn, Co, Ni and Fe at trace levels by first and second order multivariate calibration , 2001 .