Distributed randomized singular value decomposition using count sketch

Compared with other recommendation algorithms, Matrix decomposition is frequently used in the current recommendation system. It can not only lead to better results, but also can fully take the influence of various factors into account, which explains its good scalability. Matrix decomposition includ-es SVD(Singular Value Decomposition), non-negative matrix decomposition, Latent Factor Model and some other traditional matrix decomposition techniques is designed to approximate a high-dimensional matrix with low-dimensional. As a perfect technique in recommendation system, SVD is traditionally expert at dense matrix decomposition. However, real rating matrix are sparse, and have high time complexity of SVD, if the matrix size increases rapidly, the efficiency must become unacceptable. The combination of random algorithm and matrix decomposition turns traditional matrix decomposition into random matrix decomposition technique under distributed system environment. The random singular value decomposition technique illustrated in the following content can be at the expense of little accuracy under the premise of greatly improving the efficiency of the calculation.

[1]  Emmanuel J. Candès,et al.  Randomized Algorithms for Low-Rank Matrix Factorizations: Sharp Performance Bounds , 2013, Algorithmica.

[2]  Zhenyu He,et al.  Robust Object Tracking via Key Patch Sparse Representation , 2017, IEEE Transactions on Cybernetics.

[3]  Rasmus Pagh,et al.  Fast and scalable polynomial kernels via explicit feature maps , 2013, KDD.

[4]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[5]  Eric Price,et al.  Improved Concentration Bounds for Count-Sketch , 2012, SODA.

[6]  Volker Markl,et al.  Distributed matrix factorization with mapreduce using a series of broadcast-joins , 2013, RecSys.

[7]  Heikki Mannila,et al.  Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[8]  David P. Woodruff,et al.  Low rank approximation and regression in input sparsity time , 2013, STOC '13.

[9]  Jack J. Dongarra,et al.  Randomized algorithms to update partial singular value decomposition on a hybrid CPU/GPU cluster , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.

[10]  Inderjit S. Dhillon,et al.  Parallel matrix factorization for recommender systems , 2014, Knowl. Inf. Syst..

[11]  Hao Huang,et al.  Streaming Anomaly Detection Using Randomized Matrix Sketching , 2015, Proc. VLDB Endow..

[12]  Nathan Halko,et al.  Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions , 2009, SIAM Rev..

[13]  V. Rokhlin,et al.  A fast randomized algorithm for the approximation of matrices ✩ , 2007 .

[14]  V. Rokhlin,et al.  A randomized algorithm for the approximation of matrices , 2006 .

[15]  Amir Averbuch,et al.  Randomized LU decomposition using sparse projections , 2016, Comput. Math. Appl..

[16]  Edo Liberty,et al.  Simple and deterministic matrix sketching , 2012, KDD.

[17]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[18]  Marios Hadjieleftheriou,et al.  Methods for finding frequent items in data streams , 2010, The VLDB Journal.

[19]  Zhenyu He,et al.  Connected Component Model for Multi-Object Tracking , 2016, IEEE Transactions on Image Processing.