Semisupervised Nonnegative Matrix Factorization for learning the semantics

In real world there are a lot of unlabeled data, and relatively few labeled data. Unlabeled data help to learn a statistical model that can fully describe the global property of data, while labeled data help to minimize the gap between the statistical property and human beings' perception, i.e. labeled data can help to learn the semantics. Nonnegative Matrix Factorization is a popular technique in data analysis, since a lot of real world data are nonnegative. However, traditional NMF is an unsupervised learning algorithm, which means that it cannot make use of the label information. To enable NMF to make use of both labeled and unlabeled data samples, we propose a novel semisupervised Nonnegative Matrix Factorization technique for learning the semantics. The proposed algorithm extracts prior information from the labeled data, and then uses it to guide the later processing. Experimental results with different settings prove the efficacy of the proposed algorithm.

[1]  Patrik O. Hoyer,et al.  Non-negative sparse coding , 2002, Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing.

[2]  Yanhua Chen,et al.  Non-Negative Matrix Factorization for Semisupervised Heterogeneous Data Coclustering , 2010, IEEE Transactions on Knowledge and Data Engineering.

[3]  Thomas S. Huang,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[4]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[5]  Chris H. Q. Ding,et al.  Convex and Semi-Nonnegative Matrix Factorizations , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Hujun Bao,et al.  Understanding the Power of Clause Learning , 2009, IJCAI.

[7]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[8]  Luo Si,et al.  Non-Negative Matrix Factorization Clustering on Multiple Manifolds , 2010, AAAI.

[9]  Fillia Makedon,et al.  Learning from Incomplete Ratings Using Non-negative Matrix Factorization , 2006, SDM.

[10]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[11]  Haesun Park,et al.  Sparse Nonnegative Matrix Factorization for Clustering , 2008 .

[12]  Quanquan Gu,et al.  Neighborhood Preserving Nonnegative Matrix Factorization , 2009, BMVC.