The Centroid Decomposition: Relationships between Discrete Variational Decompositions and SVDs

The centroid decomposition, an approximation for the singular value decomposition (SVD), has a long history among the statistics/psychometrics community for factor analysis research. We revisit the centroid method in its original context of factor analysis and then adapt it to other than a covariance matrix. The centroid method can be cast as an ${\cal O}(n)$-step ascent method on a hypercube. It is shown empirically that the centroid decomposition provides a measurement of second order statistical information of the original data in the direction of the corresponding left centroid vectors. One major purpose of this work is to show fundamental relationships between the singular value, centroid, and semidiscrete decompositions. This unifies an entire class of truncated SVD approximations. Applications include semantic indexing in information retrieval.

[1]  B. Price A First Course in Factor Analysis , 1993 .

[2]  Tamara G. Kolda,et al.  A semidiscrete matrix decomposition for latent semantic indexing information retrieval , 1998, TOIS.

[3]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[4]  P. Sander Decision and estimation theory , 1980 .

[5]  Gene H. Golub,et al.  A Rank-One Reduction Formula and Its Applications to Matrix Factorizations , 1995, SIAM Rev..

[6]  Howard B. Lee,et al.  A First Course in Factor Analysis 2nd Ed , 1973 .

[7]  Willem J. Heiser,et al.  Two Purposes for Matrix Factorization: A Historical Appraisal , 2000, SIAM Rev..

[8]  G KoldaTamara,et al.  A semidiscrete matrix decomposition for latent semantic indexing information retrieval , 1998 .

[9]  C. Eckart,et al.  The approximation of one matrix by another of lower rank , 1936 .

[10]  Michael R. Anderberg,et al.  Cluster Analysis for Applications , 1973 .

[11]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[12]  G. W. STEWARTt ON THE EARLY HISTORY OF THE SINGULAR VALUE DECOMPOSITION * , 2022 .

[13]  E. B. Andersen,et al.  Modern factor analysis , 1961 .

[14]  Paul Horst,et al.  Factor analysis of data matrices , 1965 .

[15]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[16]  D. Luenberger Optimization by Vector Space Methods , 1968 .

[17]  P. B. Ballard The distribution and relations of educational abilities. , 1918 .

[18]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[19]  J. Tukey,et al.  Multiple-Factor Analysis , 1947 .

[20]  R. C. Durfee,et al.  MULTIPLE FACTOR ANALYSIS. , 1967 .