A Collaborative Method to Reduce the Running Time and Accelerate the k-Nearest Neighbors Search

DOI reference number: 10.18293/SEKE2015-153 Abstract—Recommendation systems are software tools and techniques that provide customized content to users. The collaborative filtering is one of the most prominent approaches in the recommendation area. Among the collaborative algorithms, one of the most popular is the k-Nearest Neighbors (kNN) which is an instance-based learning method. The kNN generates recommendations based on the ratings of the most similar users (nearest neighbors) to the target one. Despite being quite effective, the algorithm performance drops while running on large datasets. We propose a method, called Restricted Space kNN that is based on the restriction of the neighbors search space through a fast and efficient heuristic. The heuristic builds the new search space from the most active users. As a result, we found that using only 15% of the original search space the proposed method generated recommendations almost as accurate as the standard kNN, but with almost 58% less running time.

[1]  Qi Zhang,et al.  A Novel Nearest Neighborhood Algorithm for Recommender Systems , 2012, 2012 Third Global Congress on Intelligent Systems.

[2]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[3]  Henk de Poot,et al.  Information overload: why some people seem to suffer more than others , 2006, NordiCHI '06.

[4]  Dianhong Wang,et al.  Survey of Improving K-Nearest-Neighbor for Classification , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[5]  Tiejian Luo,et al.  Towards an Introduction to Collaborative Filtering , 2009, 2009 International Conference on Computational Science and Engineering.

[6]  Helena Ramalhinho Dias Lourenço,et al.  Iterated Local Search , 2001, Handbook of Metaheuristics.

[7]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[8]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[9]  Fabio Stella,et al.  Towards Explaining Latent Factors with Topic Models in Collaborative Recommender Systems , 2013, 2013 24th International Workshop on Database and Expert Systems Applications.

[10]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[11]  Karl Aberer,et al.  LSH At Large - Distributed KNN Search in High Dimensions , 2008, WebDB.

[12]  Felipe Barbosa Araújo Ramos,et al.  A recommendation approach for digital TV systems based on multimodal features , 2014, SAC.

[13]  Patrick Grother,et al.  Fast implementations of nearest neighbor classifiers , 1997, Pattern Recognit..

[14]  Jae Kyeong Kim,et al.  A literature review and classification of recommender systems research , 2012, Expert Syst. Appl..

[15]  Qing Liao,et al.  Weight Based KNN Recommender System , 2013, 2013 5th International Conference on Intelligent Human-Machine Systems and Cybernetics.

[16]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[17]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[18]  Lars Schmidt-Thieme,et al.  MyMediaLite: a free recommender system library , 2011, RecSys '11.

[19]  Fionn Murtagh,et al.  Overcoming the Curse of Dimensionality in Clustering by Means of the Wavelet Transform , 2000, Comput. J..

[20]  Amos Azaria,et al.  Movie recommender system for profit maximization , 2013, AAAI.

[21]  Amine M. Boumaza,et al.  Stochastic search for global neighbors selection in collaborative filtering , 2012, SAC '12.