Simple component analysis

With a large number of variables measuring different aspects of a same theme, we would like to summarize the information in a limited number of components, i.e. linear combinations of the original variables. Among linear dimension reduction techniques, principal component analysis is optimal in at least two ways: principal components extract the maximum of the variability of the original variables, and they are uncorrelated. Unfortunately, they are often difficult to interpret. Moreover, in most applications, only the first principal component is a 'block component', the remaining components being 'difference components' which are also more difficult to interpret. The goal of simple component analysis is to replace (or to supplement) principal components with suboptimal but better interpretable 'simple components'. We propose a fast algorithm which seeks the optimal system of components under constraints of simplicity. Thus, in contrast with other techniques like 'varimax', this approach always provides a simple solution. The optimal simple system is suboptimal compared with principal components: less variability is extracted and components are correlated. However, if the loss of extracted variability is small, and correlations between components are low, it might be advantageous for practical use. Moreover, our concept of simplicity allows the system to have more than one block component, which also facilitates interpretation. Simplicity is not a guarantee for interpretability. With the help of our algorithm, the user can partly modify an optimal simple system of components to enhance interpretability. In this respect, the ultimate goal of simple component analysis is not to propose a method that leads automatically to a unique solution, but rather to develop tools for assisting the user in his or her choice of an interpretable solution. Finally, we argue that simple components may also make the task of choosing the dimension easier. The methodology is illustrated with a test battery to study the development of neuromotor functions in children and adolescents. Copyright 2004 Royal Statistical Society.

[1]  H. Kaiser The varimax criterion for analytic rotation in factor analysis , 1958 .

[2]  Theo Gasser,et al.  Assessing intrarater, interrater and test–retest reliability of continuous measurements , 2002, Statistics in medicine.

[3]  C. R. Rao,et al.  Linear Statistical Inference and its Applications , 1968 .

[4]  J. Edward Jackson,et al.  A User's Guide to Principal Components: Jackson/User's Guide to Principal Components , 2004 .

[5]  L Molinari,et al.  Neuromotor development from 5 to 18 years. Part 1: timed performance. , 2001, Developmental medicine and child neurology.

[6]  J. E. Jackson A User's Guide to Principal Components , 1991 .

[7]  Daniel Gervini,et al.  Criteria for Evaluating Dimension-Reducing Components for Multivariate Data , 2004 .

[8]  Theo Gasser,et al.  Some Case Studies of Simple Component Analysis , 2003 .

[9]  Pin T. Ng,et al.  Summarizing the Effect of a Wide Array of Amenity Measures into Simple Components , 2005 .

[10]  I. Jolliffe Principal Component Analysis , 2002 .

[11]  Alexander Basilevsky,et al.  Statistical Factor Analysis and Related Methods , 1994 .

[12]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[13]  Jorge Cadima Departamento de Matematica Loading and correlations in the interpretation of principle compenents , 1995 .

[14]  J. N. R. Jeffers,et al.  Two Case Studies in the Application of Principal Component Analysis , 1967 .

[15]  I. Jolliffe,et al.  A Modified Principal Component Technique Based on the LASSO , 2003 .

[16]  D. F. Morrison,et al.  Multivariate Statistical Methods , 1968 .

[17]  S. Vines Simple principal components , 2000 .

[18]  A. Leplège,et al.  Methodological issues in determining the dimensionality of composite health measures using principal component analysis: Case illustration and suggestions for practice , 2005, Quality of Life Research.

[19]  I. Jolliffe Rotation of principal components: choice of normalization constraints , 1995 .

[20]  R. Shanmugam Multivariate Analysis: Part 2: Classification, Covariance Structures and Repeated Measurements , 1998 .

[21]  Ian T. Jolliffe,et al.  The Simplified Component Technique: An Alternative to Rotated Principal Components , 2000 .

[22]  H. Chipman,et al.  Interpretable dimension reduction , 2005 .