Large-scale recommender system with compact latent factor model

A compact latent factor model for query-based recommendation is proposed.The proposed model can process data incrementally.The proposed model can deal with cold-start problems.The proposed model can extend to context-aware recommendation algorithm.The proposed algorithm outperforms other alternatives on three datasets. This work devises a factorization model called compact latent factor model, in which we propose a compact representation to consider query, user and item in the model. The blend of information retrieval and collaborative filtering is a typical setting in many applications. The proposed model can incorporate various features into the model, and this work demonstrates that the proposed model can incorporate context-aware and content-based features to handle context-aware recommendation and cold-start problems, respectively. Besides recommendation accuracy, a critical problem concerning the computational cost emerges in practical situations. To tackle this problem, this work uses a buffer update scheme to allow the proposed model to process data incrementally, and provide a means to use historical data instances. Meanwhile, we use stochastic gradient descent algorithm along with sampling technique to optimize ranking loss, giving a competitive performance while considering scalability and deployment issues. The experimental results indicate that the proposed algorithm outperforms other alternatives on four datasets.

[1]  Kuan-Wei Wu,et al.  A Two-Stage Ensemble of Diverse Models for Advertisement Ranking in KDD Cup 2012 , 2012 .

[2]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[3]  Qiudan Li,et al.  A recommender system based on tag and time information for social tagging systems , 2011, Expert Syst. Appl..

[4]  Rafail Ostrovsky,et al.  Efficient search for approximate nearest neighbor in high dimensional spaces , 1998, STOC '98.

[5]  Marko Tkalcic,et al.  Database for contextual personalization , 2011 .

[6]  Nuria Oliver,et al.  Multiverse recommendation: n-dimensional tensor factorization for context-aware collaborative filtering , 2010, RecSys '10.

[7]  Wan-Shiou Yang,et al.  A location-aware recommender system for mobile shopping environments , 2008, Expert Syst. Appl..

[8]  Robert Krauthgamer,et al.  Navigating nets: simple algorithms for proximity search , 2004, SODA '04.

[9]  J. Bobadilla,et al.  Recommender systems survey , 2013, Knowl. Based Syst..

[10]  Deepak Agarwal,et al.  Latent OLAP: data cubes over latent variables , 2011, SIGMOD '11.

[11]  Chong Wang,et al.  Latent Collaborative Retrieval , 2012, ICML.

[12]  Rafael Valencia-García,et al.  RecomMetz: A context-aware knowledge-based mobile recommender system for movie showtimes , 2015, Expert Syst. Appl..

[13]  Abhinandan Das,et al.  Google news personalization: scalable online collaborative filtering , 2007, WWW '07.

[14]  Xavier Amatriain,et al.  Building industrial-scale real-world recommender systems , 2012, RecSys.

[15]  Ping Li,et al.  Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product Search (MIPS) , 2014, NIPS.

[16]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[17]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[18]  Jurij F. Tasic,et al.  Predicting and Detecting the Relevant Contextual Information in a Movie-Recommender System , 2013, Interact. Comput..

[19]  Angshul Majumdar,et al.  Matrix completion incorporating auxiliary information for recommender system design , 2015, Expert Syst. Appl..

[20]  Masataka Goto,et al.  An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Qingsheng Zhu,et al.  Incremental Collaborative Filtering recommender based on Regularized Matrix Factorization , 2012, Knowl. Based Syst..

[22]  Hod Lipson,et al.  Learning symbolic representations of hybrid dynamical systems , 2012, J. Mach. Learn. Res..

[23]  Lars Schmidt-Thieme,et al.  Fast context-aware recommendations with factorization machines , 2011, SIGIR.

[24]  Jason Weston,et al.  Large-Scale Music Annotation and Retrieval: Learning to Rank in Joint Semantic Spaces , 2011, ArXiv.

[25]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[26]  Yoon Ho Cho,et al.  A personalized recommender system based on web usage mining and decision tree induction , 2002, Expert Syst. Appl..

[27]  Sophia Ananiadou,et al.  Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty , 2009, ACL.

[28]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[29]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[30]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[31]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[32]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[33]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[34]  Deepak Agarwal,et al.  Regression-based latent factor models , 2009, KDD.

[35]  Geoffrey E. Hinton,et al.  Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[36]  Yong Yu,et al.  SVDFeature: a toolkit for feature-based collaborative filtering , 2012, J. Mach. Learn. Res..

[37]  Stathes Hadjiefthymiades,et al.  Facing the cold start problem in recommender systems , 2014, Expert Syst. Appl..

[38]  Chien-Liang Liu,et al.  Fast recommendation on latent collaborative relations , 2016, Knowl. Based Syst..

[39]  Jason Weston,et al.  WSABIE: Scaling Up to Large Vocabulary Image Annotation , 2011, IJCAI.

[40]  Steffen Rendle Scaling Factorization Machines to Relational Data , 2013, Proc. VLDB Endow..

[41]  Chih-Jen Lin,et al.  A fast parallel SGD for matrix factorization in shared memory systems , 2013, RecSys.