Coordinate Ranking Regularized Non-negative Matrix Factorization

Non-negative Matrix Factorization (NMF) has become increasingly popular in many applications that require data mining techniques such as information retrieval, computer vision, and pattern recognition. NMF aims at approximating the original data matrix in a high dimensional space with the product of two non-negative matrices in a lower dimensional space. In many applications with high dimensional data such as text, data often have a global geometric structure, which typically may not be directly derived from the local information. But the existing literature of NMF completely ignores this problem. This paper proposes a novel matrix factorization algorithm called Coordinate Ranking regularized NMF (CR-NMF) in order to address this problem. The idea of the proposed algorithm is to combine NMF and manifold ranking to encode both local and global geometric structures of the data. Experimental results on two real-world datasets demonstrate the superiority of this algorithm.

[1]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[2]  Robert D. Nowak,et al.  Multi-Manifold Semi-Supervised Learning , 2009, AISTATS.

[3]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[4]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[5]  Bernhard Schölkopf,et al.  Ranking on Data Manifolds , 2003, NIPS.

[6]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[7]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8]  Hujun Bao,et al.  Understanding the Power of Clause Learning , 2009, IJCAI.

[9]  Jiawei Han,et al.  Non-negative Matrix Factorization on Manifold , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[10]  Xiaojun Wan,et al.  Manifold-Ranking Based Topic-Focused Multi-Document Summarization , 2007, IJCAI.

[11]  Jingrui He,et al.  Manifold-ranking based image retrieval , 2004, MULTIMEDIA '04.

[12]  Luo Si,et al.  Non-Negative Matrix Factorization Clustering on Multiple Manifolds , 2010, AAAI.

[13]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Markus Breitenbach,et al.  Clustering through ranking on manifolds , 2005, ICML '05.

[15]  Chris H. Q. Ding,et al.  K-nearest-neighbor consistency in data clustering: incorporating local information into global optimization , 2004, SAC '04.

[16]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[17]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.