Symmetric and Nonnegative Latent Factor Models for Undirected, High-Dimensional, and Sparse Networks in Industrial Applications

Undirected, high-dimensional, and sparse (HiDS) networks are frequently encountered in industrial applications. They contain rich knowledge regarding various useful patterns. Nonnegative latent factor (NLF) models are effective and efficient in extracting useful knowledge from directed networks. However, they cannot describe the symmetry of an undirected network. For addressing this issue, this paper analyzes the extraction process of NLFs on asymmetric and symmetric matrices, respectively, thereby innovatively achieving the symmetric and nonnegative latent factor (SNLF) models for undirected, HiDS networks. The proposed SNLF models are equipped with: 1) high efficiency; 2) nonnegativity; and 3) symmetry. Experimental results on real networks show that the SNLF models are able to: 1) describe the symmetry of the target network rigorously; 2) ensure the nonnegativity of resultant latent factors; and 3) achieve high computational efficiency when addressing data analysis tasks like missing data estimation.

[1]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[2]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[3]  Chih-Jen Lin,et al.  Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[4]  Zoubin Ghahramani,et al.  Probabilistic machine learning and artificial intelligence , 2015, Nature.

[5]  MengChu Zhou,et al.  An Efficient Second-Order Approach to Factorize Sparse Matrices in Recommender Systems , 2015, IEEE Transactions on Industrial Informatics.

[6]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[7]  Zhu-Hong You,et al.  Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data , 2010, Bioinform..

[8]  P. Paatero,et al.  Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[9]  Domonkos Tikk,et al.  Scalable Collaborative Filtering Approaches for Large Recommender Systems , 2009, J. Mach. Learn. Res..

[10]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[11]  Zhaoshui He,et al.  Symmetric Nonnegative Matrix Factorization: Algorithms and Applications to Probabilistic Clustering , 2011, IEEE Transactions on Neural Networks.

[12]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[13]  Christian von Mering,et al.  SVD-phy: improved prediction of protein functional associations through singular value decomposition of phylogenetic profiles , 2015, Bioinform..

[14]  Hyunsoo Kim,et al.  Sparse Non-negative Matrix Factorizations via Alternating Non-negativity-constrained Least Squares , 2006 .

[15]  Hareton K. N. Leung,et al.  A Highly Efficient Approach to Protein Interactome Mapping Based on Collaborative Filtering Framework , 2015, Scientific Reports.

[16]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[17]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[18]  MengChu Zhou,et al.  An Efficient Non-Negative Matrix-Factorization-Based Approach to Collaborative Filtering for Recommender Systems , 2014, IEEE Transactions on Industrial Informatics.

[19]  Pierre Geurts,et al.  DMFSGD: A Decentralized Matrix Factorization Algorithm for Network Distance Prediction , 2012, IEEE/ACM Transactions on Networking.

[20]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[21]  Andrew M. Gross,et al.  Network-based stratification of tumor mutations , 2013, Nature Methods.

[22]  Yixin Cao,et al.  Identifying overlapping communities as well as hubs and outliers via nonnegative matrix factorization , 2013, Scientific Reports.

[23]  Yoan Shin,et al.  Matrix Completion Optimization for Localization in Wireless Sensor Networks for Intelligent IoT , 2016, Sensors.

[24]  Nikos D. Sidiropoulos,et al.  Non-Negative Matrix Factorization Revisited: Uniqueness and Algorithm for Symmetric Decomposition , 2014, IEEE Transactions on Signal Processing.

[25]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[26]  MengChu Zhou,et al.  A Nonnegative Latent Factor Model for Large-Scale Sparse Matrices in Recommender Systems via Alternating Direction Method , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[27]  Junbin Gao,et al.  Correlated Spatio-Temporal Data Collection in Wireless Sensor Networks Based on Low Rank Matrix Approximation and Optimized Node Sampling , 2014, Sensors.

[28]  Xiaochun Cao,et al.  A Unified Semi-Supervised Community Detection Framework Using Latent Space Graph Regularization , 2015, IEEE Transactions on Cybernetics.

[29]  Zibin Zheng,et al.  QoS-Aware Web Service Recommendation by Collaborative Filtering , 2011, IEEE Transactions on Services Computing.

[30]  Chris H. Q. Ding,et al.  Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization , 2008, SIGIR '08.

[31]  Yung-Yu Chuang,et al.  Collaborative video reindexing via matrix factorization , 2012, TOMCCAP.

[32]  Wai Lok Woo,et al.  Unsupervised Diagnostic and Monitoring of Defects Using Waveguide Imaging With Adaptive Sparse Representation , 2016, IEEE Transactions on Industrial Informatics.

[33]  MengChu Zhou,et al.  Generating Highly Accurate Predictions for Missing QoS Data via Aggregating Nonnegative Latent Factor Models , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Andrzej Cichocki,et al.  A Multiplicative Algorithm for Convolutive Non-Negative Matrix Factorization Based on Squared Euclidean Distance , 2009, IEEE Transactions on Signal Processing.

[35]  Weixiong Zhang,et al.  Identification of hybrid node and link communities in complex networks , 2013, Scientific Reports.

[36]  Ponnuthurai N. Suganthan,et al.  Ensemble and Arithmetic Recombination-Based Speciation Differential Evolution for Multimodal Optimization , 2016, IEEE Transactions on Cybernetics.

[37]  Zhaohui Wu,et al.  An Efficient Recommendation Method for Improving Business Process Modeling , 2014, IEEE Transactions on Industrial Informatics.

[38]  Timothy A. Davis,et al.  The university of Florida sparse matrix collection , 2011, TOMS.

[39]  Qiang Yang,et al.  Tracking Mobile Users in Wireless Networks via Semi-Supervised Colocalization , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.