Improving K-nearest-neighborhood based Collaborative Filtering via Similarity Support

Collaborative Filtering (CF) is the most popular choice when implementing personalized recommender systems. A classical approach to CF is based on K-nearest-neighborhood (KNN) model, where the precondition for making recommendations is the KNN construction for involved entities. However, when building KNN sets, there exits the dilemma to decide the value of K --a small value will lead to poor recommendation quality, whereas a large one will result in high computational complexity. In this work we firstly empirically validate that the suitable value of K in KNN based CF is affected by the number of the totally involved entities, and then focus on optimizing the KNN building process for providing high recommendation performance as well as maintaining acceptable KNN size. To achieve this objective, we propose a novel KNN metric named Similarity Support (SS). By taking SS into consideration, we designed a series of strategies for optimizing the KNN based CF. The empirical studies on public large, real datasets showed that due to the improvement on KNN construction brought by SS, recommender optimized by our strategies turned out to be superior to original KNN based CF in terms of both recommendation performance and computational complexity.

[1]  Yao Zhao,et al.  A Collaborative Filtering Recommendation Algorithm Based on User Interest Change and Trust Evaluation , 2010, J. Digit. Content Technol. its Appl..

[2]  Arkadiusz Paterek,et al.  Improving regularized singular value decomposition for collaborative filtering , 2007 .

[3]  Caiming Zhang,et al.  Image Interpolation using Collaborative Filtering , 2011 .

[4]  András A. Benczúr,et al.  Methods for large scale SVD with missing values , 2007 .

[5]  Yehuda Koren,et al.  Improved Neighborhood-based Collaborative Filtering , 2007 .

[6]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[7]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[8]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[9]  Domonkos Tikk,et al.  Investigation of Various Matrix Factorization Methods for Large Recommender Systems , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[10]  John Riedl,et al.  An Empirical Analysis of Design Choices in Neighborhood-Based Collaborative Filtering Algorithms , 2002, Information Retrieval.

[11]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.