Improving neighbor-based collaborative filtering by using a hybrid similarity measurement

Abstract Memory-based collaborative filtering is one of the recommendation system methods used to predict a user’s rating or preference by exploring historic ratings, but without incorporating any content information about users or items. It can be either item-based or user-based. Taking item-based Collaborative Filtering (CF) as an example, the way it makes predictions is accomplished in 2 steps: first, it selects based on pair-wise similarities a number of most similar items to the predicting item from those that the user has already rated on. Second, it aggregates the user’s opinions on those most similar items to predict a rating on the predicting item. Thus, similarity measurement determines which items are similar, and plays an important role on how accurate the predictions are. Many studies have been conducted on memory-based CFs to improve prediction accuracy, but none of them have achieved better prediction accuracy than state-of-the-art model-based CFs. In this paper, we proposed a new approach that combines both structural and rating-based similarity measurement. We found that memory-based CF using combined similarity measurement can achieve better prediction accuracy than model-based CFs in terms of lower MAE and reduce memory and time by using less neighbors than traditional memory-based CFs on MovieLens and Netflix datasets.

[1]  Ning Zhang,et al.  An improved collaborative filtering method based on similarity , 2018, PloS one.

[2]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[3]  Xiaoyuan Su,et al.  Hybrid Collaborative Filtering Algorithms Using a Mixture of Experts , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[4]  George Karypis,et al.  A Comprehensive Survey of Neighborhood-based Recommendation Methods , 2011, Recommender Systems Handbook.

[5]  Liang Luo,et al.  An Efficient Similarity Measure for Collaborative Filtering , 2018, IIKI.

[6]  Fernando Ortega,et al.  A collaborative filtering approach to mitigate the new user cold start problem , 2012, Knowl. Based Syst..

[7]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[8]  Hyung Jun Ahn,et al.  A new similarity measure for collaborative filtering to alleviate the new user cold-starting problem , 2008, Inf. Sci..

[9]  Markus Schedl,et al.  Local and global scaling reduce hubs in space , 2012, J. Mach. Learn. Res..

[10]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[11]  Jesús Bobadilla,et al.  A new collaborative filtering metric that improves the behavior of recommender systems , 2010, Knowl. Based Syst..

[12]  Robin D. Burke,et al.  Hybrid Recommender Systems: Survey and Experiments , 2002, User Modeling and User-Adapted Interaction.

[13]  Masashi Sugiyama,et al.  Active Learning in Recommender Systems , 2011, Recommender Systems Handbook.

[14]  Zhi-Heng Zhang,et al.  Integrating Triangle and Jaccard similarities for recommendation , 2017, PloS one.

[15]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[16]  Erik Brynjolfsson,et al.  Consumer Surplus in the Digital Economy: Estimating the Value of Increased Product Variety at Online Booksellers , 2003, Manag. Sci..

[17]  Hui Tian,et al.  A new user similarity model to improve the accuracy of collaborative filtering , 2014, Knowl. Based Syst..

[18]  Nikolaos Polatidis,et al.  A multi-level collaborative filtering method that improves recommendations , 2016, Expert Syst. Appl..

[19]  Ville Ollikainen,et al.  A new similarity measure using Bhattacharyya coefficient for collaborative filtering in sparse data , 2015, Knowl. Based Syst..

[20]  Jure Leskovec,et al.  Graph Convolutional Neural Networks for Web-Scale Recommender Systems , 2018, KDD.

[21]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[22]  Chunming Rong,et al.  Fast algorithms to evaluate collaborative filtering recommender systems , 2016, Knowl. Based Syst..

[23]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[24]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[25]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[26]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[27]  Tat-Seng Chua,et al.  Neural Graph Collaborative Filtering , 2019, SIGIR.

[28]  Alexandros Nanopoulos,et al.  Hubs in Space: Popular Nearest Neighbors in High-Dimensional Data , 2010, J. Mach. Learn. Res..

[29]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[30]  Marc Leman,et al.  Content-Based Music Information Retrieval: Current Directions and Future Challenges , 2008, Proceedings of the IEEE.

[31]  Thomas Hofmann,et al.  Latent semantic models for collaborative filtering , 2004, TOIS.

[32]  C. Willmott,et al.  Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance , 2005 .

[33]  C. Willmott,et al.  Ambiguities inherent in sums-of-squares-based error statistics , 2009 .