Nonnegative Matrix Factorization and its application to pattern analysis and text mining

Nonnegative Matrix Factorization (NMF) is one of the most promising techniques to reduce the dimensionality of the data. This presentation compares the method with other popular matrix decomposition approaches for various pattern analysis tasks. Among others, NMF has been also widely applied for clustering and latent feature extraction. Several types of the objective functions have been used for NMF in the literature. Instead of minimizing the common Euclidean Distance (EucD) error, we review an alternative method that maximizes the correntropy similarity measure to produce the factorization. Correntropy is an entropy-based criterion defined as a nonlinear similarity measure. Following the discussion of maximization of the correntropy function, we use it to cluster document data set and compare the clustering performance with the EucD-based NMF. Our approach was applied and illustrated for the clustering of documents in the 20-Newsgroups data set. The comparison is illustrated with 20-Newsgroups data set. The results show that our approach produces per average better clustering compared with other methods which use EucD as an objective function.

[1]  Michael W. Berry,et al.  Document clustering using nonnegative matrix factorization , 2006, Inf. Process. Manag..

[2]  BertinNancy,et al.  Nonnegative matrix factorization with the itakura-saito divergence , 2009 .

[3]  Jacek M. Zurada,et al.  Correntropy-Based Document Clustering via Nonnegative Matrix Factorization , 2012, ICANN.

[4]  Ran He,et al.  Maximum Correntropy Criterion for Robust Face Recognition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Jiang-She Zhang,et al.  Large margin based nonnegative matrix factorization and partial least squares regression for face recognition , 2011, Pattern Recognit. Lett..

[6]  Weifeng Liu,et al.  Correntropy: Properties and Applications in Non-Gaussian Signal Processing , 2007, IEEE Transactions on Signal Processing.

[7]  José Carlos Príncipe,et al.  Self organizing maps with the correntropy induced metric , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[8]  Sang Uk Lee,et al.  Occlusion invariant face recognition using selective local non-negative matrix factorization basis images , 2008, Image Vis. Comput..

[9]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[10]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[11]  Jérôme Idier,et al.  Algorithms for Nonnegative Matrix Factorization with the β-Divergence , 2010, Neural Computation.

[12]  Ning Li,et al.  A new non-negative matrix factorization algorithm with sparseness constraints , 2011, 2011 International Conference on Machine Learning and Cybernetics.

[13]  Xuelong Li,et al.  Constrained Nonnegative Matrix Factorization for Image Representation , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[15]  Ran He,et al.  Robust Principal Component Analysis Based on Maximum Correntropy Criterion , 2011, IEEE Transactions on Image Processing.

[16]  Chih-Jen Lin,et al.  Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[17]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[18]  Michael W. Berry,et al.  Algorithms and applications for approximate nonnegative matrix factorization , 2007, Comput. Stat. Data Anal..

[19]  P. Paatero Least squares formulation of robust non-negative factor analysis , 1997 .

[20]  Bhavin J. Shastri,et al.  Face recognition using localized features based on non-negative sparse coding , 2006, Machine Vision and Applications.

[21]  Jacek M. Zurada,et al.  Occluded Face Recognition Using Correntropy-Based Nonnegative Matrix Factorization , 2012, 2012 11th International Conference on Machine Learning and Applications.

[22]  Moongu Jeon,et al.  Face Recognition Using Region-Based Nonnegative Matrix Factorization , 2009, FGIT-FGCN.