A Momentum-Accelerated Hessian-Vector-Based Latent Factor Analysis Model

Service-oriented applications commonly involve high-dimensional and sparse (HiDS) interactions among users and service-related entities, e.g., user-item interactions from a personalized recommendation services system. How to perform precise and efficient representation learning on such HiDS interactions data is a hot yet thorny issue. An efficient approach to it is latent factor analysis (LFA), which commonly depends on large-scale non-convex optimization. Hence, it is vital to implement an LFA model able to approximate second-order stationary points efficiently for enhancing its representation learning ability. However, existing second-order LFA models suffer from high computational cost, which significantly reduces its practicability. To address this issue, this paper presents a Momentum-accelerated Hessian-vector algorithm (MH) for precise and efficient LFA on HiDS data. Its main ideas are two-fold: a) adopting the principle of a Hessian-vector-product-based method to utilize the second-order information without manipulating a Hessian matrix directly, and b) incorporating a generalized momentum method into its parameter learning scheme for accelerating its convergence rate to a stationary point. Experimental results on nine industrial datasets demonstrate that compared with state-of-the-art LFA models, an MH-based LFA model achieves gains in both accuracy and convergence rate. These positive outcomes also indicate that a generalized momentum method is compatible with the algorithms, e.g., a second-order algorithm, which implicitly rely on gradients.

[1]  Xin Luo,et al.  Generalized Nesterov's Acceleration-Incorporated, Non-Negative and Adaptive Latent Factor Analysis , 2022, IEEE Transactions on Services Computing.

[2]  Lun Hu,et al.  A Distributed Framework for Large-scale Protein-protein Interaction Data Analysis and Prediction Using MapReduce , 2022, IEEE/CAA Journal of Automatica Sinica.

[3]  Mengchu Zhou,et al.  Symmetric Nonnegative Matrix Factorization-Based Community Detection Models and Their Convergence Analysis , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Qing Tao,et al.  Momentum Acceleration in the Individual Convergence of Nonsmooth Convex Optimization With Constraints , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Wei Yuan,et al.  A new non-adaptive optimization method: Stochastic gradient descent with momentum and difference , 2021, Applied Intelligence.

[6]  Haitao Yuan,et al.  Profit-Maximized Collaborative Computation Offloading and Resource Allocation in Distributed Cloud and Edge Computing Systems , 2021, IEEE Transactions on Automation Science and Engineering.

[7]  Yuntao Qian,et al.  Multi-stage stochastic gradient method with momentum acceleration , 2021, Signal Process..

[8]  Xin Luo,et al.  Robust Latent Factor Analysis for Precise Representation of High-Dimensional and Sparse Data , 2021, IEEE/CAA Journal of Automatica Sinica.

[9]  Martin Jaggi,et al.  Quasi-Global Momentum: Accelerating Decentralized Deep Learning on Heterogeneous Data , 2021, ICML.

[10]  Mingsheng Shang,et al.  A Fast Non-Negative Latent Factor Model Based on Generalized Momentum Method , 2021, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[11]  Mohsen Guizani,et al.  Trust-Based Cloud Machine Learning Model Selection for Industrial IoT and Smart City Services , 2020, IEEE Internet of Things Journal.

[12]  Aili Wang,et al.  SOLAR: Services-Oriented Deep Learning Architectures-Deep Learning as a Service , 2017, IEEE Transactions on Services Computing.

[13]  Xin Luo,et al.  A Generalized-Momentum-Accelerated Hessian-Vector Algorithm for High-Dimensional and Sparse Data , 2020, 2020 IEEE International Conference on Data Mining (ICDM).

[14]  Minyi Guo,et al.  Learning User Preference from Heterogeneous Information for Store-Type Recommendation , 2020, IEEE Transactions on Services Computing.

[15]  Xuanzhe Liu,et al.  Decentralized Services Computing Paradigm for Blockchain-Based Data Governance: Programmability, Interoperability, and Intelligence , 2020, IEEE Transactions on Services Computing.

[16]  Xiong Luo,et al.  A Parallel Recommender System Using a Collaborative Filtering Algorithm with Correntropy for Social Networks , 2020, IEEE Transactions on Network Science and Engineering.

[17]  Jia Zhang,et al.  DLTSR: A Deep Learning Framework for Recommendations of Long-Tail Web Services , 2020, IEEE Transactions on Services Computing.

[18]  Feng Jiang,et al.  A cross-domain collaborative filtering algorithm with expanding user and item features via the latent factor space of auxiliary domains , 2019, Pattern Recognit..

[19]  Laurence T. Yang,et al.  A Double Deep Q-Learning Model for Energy-Efficient Edge Scheduling , 2019, IEEE Transactions on Services Computing.

[20]  Dietmar Jannach,et al.  Measuring the Business Value of Recommender Systems , 2019, ACM Trans. Manag. Inf. Syst..

[21]  Hong Shen,et al.  ISLF: Interest Shift and Latent Factors Combination Model for Session-based Recommendation , 2019, IJCAI.

[22]  MengChu Zhou,et al.  An Effective Scheme for QoS Estimation via Alternating Direction Method-Based Matrix Factorization , 2019, IEEE Transactions on Services Computing.

[23]  Zibin Zheng,et al.  Learning the Evolution Regularities for BigService-Oriented Online Reliability Prediction , 2019, IEEE Transactions on Services Computing.

[24]  Changjun Jiang,et al.  Correlated Matrix Factorization for Recommendation with Implicit Feedback , 2019, IEEE Transactions on Knowledge and Data Engineering.

[25]  Ming Zhang,et al.  Overfitting remedy by sparsifying regularization on fully-connected layers of CNNs , 2019, Neurocomputing.

[26]  Shiliang Sun,et al.  An Accelerated Linearly Convergent Stochastic L-BFGS Algorithm , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[27]  Yu Xie,et al.  Cost-Efficient Consolidating Service for Aliyun’s Cloud-Scale Computing , 2019, IEEE Transactions on Services Computing.

[28]  Mingsheng Shang,et al.  DCCR: Deep Collaborative Conjunctive Recommender for Rating Prediction , 2019, IEEE Access.

[29]  MengChu Zhou,et al.  Incorporation of Efficient Second-Order Solvers Into Latent Factor Models for Accurate Prediction of Missing QoS Data , 2018, IEEE Transactions on Cybernetics.

[30]  Jia Chen,et al.  Randomized latent factor model for high-dimensional and sparse matrices from industrial applications , 2018, 2018 IEEE 15th International Conference on Networking, Sensing and Control (ICNSC).

[31]  Mohamed Elshenawy,et al.  Towards a service-oriented cyber-physical systems of systems for smart city mobility applications , 2018, Future Gener. Comput. Syst..

[32]  Joseph A. Konstan,et al.  Rating-Based Collaborative Filtering: Algorithms and Evaluation , 2018, Social Information Access.

[33]  Yu-Jun Zheng,et al.  A Pythagorean-Type Fuzzy Deep Denoising Autoencoder for Industrial Accident Early Warning , 2017, IEEE Transactions on Fuzzy Systems.

[34]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[35]  Michael I. Jordan,et al.  How to Escape Saddle Points Efficiently , 2017, ICML.

[36]  Ji Feng,et al.  Deep Forest: Towards An Alternative to Deep Neural Networks , 2017, IJCAI.

[37]  Cho-Jui Hsieh,et al.  HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[38]  Ling Liu,et al.  Services Computing: From Cloud Services, Mobile Services to Internet of Services , 2016, IEEE Trans. Serv. Comput..

[39]  Daniel P. Robinson,et al.  A trust region algorithm with a worst-case iteration complexity of O ((cid:2) − 3 / 2 ) for nonconvex optimization , 2016 .

[40]  Sheng Li,et al.  Deep Collaborative Filtering via Marginalized Denoising Auto-encoder , 2015, CIKM.

[41]  Scott Sanner,et al.  AutoRec: Autoencoders Meet Collaborative Filtering , 2015, WWW.

[42]  MengChu Zhou,et al.  An Efficient Non-Negative Matrix-Factorization-Based Approach to Collaborative Filtering for Recommender Systems , 2014, IEEE Transactions on Industrial Informatics.

[43]  Zhaohui Wu,et al.  An Efficient Recommendation Method for Improving Business Process Modeling , 2014, IEEE Transactions on Industrial Informatics.

[44]  A. Hanjalic,et al.  Collaborative Filtering beyond the User-Item Matrix: A Survey of the State of the Art and Future Challenges , 2014, ACM Comput. Surv..

[45]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.

[46]  Zibin Zheng,et al.  Predicting Quality of Service for Selection by Neighborhood-Based Collaborative Filtering , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[47]  Haralambos Sarimveis,et al.  Radial Basis Function Network Training Using a Nonsymmetric Partition of the Input Space and Particle Swarm Optimization , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[48]  Ilya Sutskever,et al.  Training Deep and Recurrent Networks with Hessian-Free Optimization , 2012, Neural Networks: Tricks of the Trade.

[49]  Quoc V. Le,et al.  On optimization methods for deep learning , 2011, ICML.

[50]  Zibin Zheng,et al.  QoS-Aware Web Service Recommendation via Collaborative Filtering , 2014, Web Services Foundations.

[51]  James Martens,et al.  Deep learning via Hessian-free optimization , 2010, ICML.

[52]  Zibin Zheng,et al.  Collaborative reliability prediction of service-oriented systems , 2010, 2010 ACM/IEEE 32nd International Conference on Software Engineering.

[53]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[54]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[55]  Steven M. LaValle,et al.  On the Relationship between Classical Grid Search and Probabilistic Roadmaps , 2004, Int. J. Robotics Res..

[56]  Raymond J. Mooney,et al.  Content-boosted collaborative filtering for improved recommendations , 2002, AAAI/IAAI.

[57]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[58]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[59]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[60]  Y. Nesterov A method for solving the convex programming problem with convergence rate O(1/k^2) , 1983 .

[61]  C. G. Broyden The Convergence of a Class of Double-rank Minimization Algorithms 2. The New Algorithm , 1970 .