An entropy-based neighbor selection approach for collaborative filtering

Collaborative filtering is an emerging technology to deal with information overload problem guiding customers by offering recommendations on products of possible interest. Forming neighborhood of a user/item is the crucial part of the recommendation process. Traditional collaborative filtering algorithms solely utilize entity similarities in order to form neighborhoods. In this paper, we introduce a novel entropy-based neighbor selection approach which focuses on measuring uncertainty of entity vectors. Such uncertainty can be interpreted as how a user perceives rating domain to distinguish her tastes or diversification of items' rating distributions. The proposed method takes similarities into account along with such uncertainty values and it solves the optimization problem of gathering the most similar entities with minimum entropy difference within a neighborhood. Described optimization problem can be considered as combinatorial optimization and it is similar to 0-1 knapsack problem. We perform benchmark data sets-based experiments in order to compare our method's accuracy with the conventional user- and item-based collaborative filtering algorithms. We also investigate integration of our method with some of previously introduced studies. Empirical outcomes substantiate that the proposed method significantly improves recommendation accuracy of traditional collaborative filtering algorithms and it is possible to combine the entropy-based method with other compatible works introducing new similarity measures or novel neighbor selection methodologies.

[1]  Hyung Jun Ahn,et al.  A new similarity measure for collaborative filtering to alleviate the new user cold-starting problem , 2008, Inf. Sci..

[2]  David M. Pennock,et al.  Collaborative filtering with maximum entropy , 2004, IEEE Intelligent Systems.

[3]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[4]  Alejandro Bellogín,et al.  Predicting Neighbor Goodness in Collaborative Filtering , 2009, FQAS.

[5]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[6]  Hidetomo Ichihashi,et al.  Collaborative Filtering Using Principal Component Analysis and Fuzzy Clustering , 2001, Web Intelligence.

[7]  Juan M. Fernández-Luna,et al.  A New Criteria for Selecting Neighborhood in Memory-Based Recommender Systems , 2011, CAEPIA.

[8]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[9]  Huseyin Polat,et al.  Privacy-Preserving Trust-Based Recommendations on Vertically Distributed Data , 2011, 2011 IEEE Fifth International Conference on Semantic Computing.

[10]  Takeo Kanade,et al.  Maximum Entropy for Collaborative Filtering , 2004, UAI.

[11]  Carlo Vercellis,et al.  Stochastic on-line knapsack problems , 1995, Math. Program..

[12]  David J. Miller,et al.  A Maximum Entropy Approach for Collaborative Filtering , 2004, J. VLSI Signal Process..

[13]  John Riedl,et al.  An Empirical Analysis of Design Choices in Neighborhood-Based Collaborative Filtering Algorithms , 2002, Information Retrieval.

[14]  Kamal Kant Bharadwaj,et al.  Utilizing various sparsity measures for enhancing accuracy of collaborative recommender systems based on local and global similarities , 2011, Expert Syst. Appl..

[15]  Jun Guo,et al.  An Approach of Selecting Right Neighbors for Collaborative Filtering , 2009, 2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC).

[16]  Francesco Ricci,et al.  Locally Adaptive Neighborhood Selection for Collaborative Filtering Recommendations , 2008, AH.

[17]  Sung-Bong Yang,et al.  An Effective Threshold-Based Neighbor Selection in Collaborative Filtering , 2007, ECIR.

[18]  Amine M. Boumaza,et al.  Stochastic search for global neighbors selection in collaborative filtering , 2012, SAC '12.

[19]  Ulrich Pferschy,et al.  Dynamic Programming Revisited: Improving Knapsack Algorithms , 1999, Computing.

[20]  Denis Trystram,et al.  An efficient parallel algorithm for solving the Knapsack problem on hypercubes , 2004, J. Parallel Distributed Comput..

[21]  Fernando Ortega,et al.  A collaborative filtering similarity measure based on singularities , 2012, Inf. Process. Manag..

[22]  David M. Pennock,et al.  A Maximum Entropy Approach to Collaborative Filtering in Dynamic, Sparse, High-Dimensional Domains , 2002, NIPS.

[23]  Kyong Joo Oh,et al.  The collaborative filtering recommendation based on SOM cluster-indexing CBR , 2003, Expert Syst. Appl..

[24]  Huseyin Polat,et al.  A comparison of clustering-based privacy-preserving collaborative filtering schemes , 2013, Appl. Soft Comput..

[25]  Amnon Meisels,et al.  Recommender System from Personal Social Networks , 2007, AWIC.

[26]  Mei-Hua Hsu,et al.  Proposing an ESL recommender teaching and learning system , 2008, Expert Syst. Appl..

[27]  Jianhua Wu,et al.  Solving 0-1 knapsack problem by a novel global harmony search algorithm , 2011, Appl. Soft Comput..

[28]  Gang Yu,et al.  JacUOD: A New Similarity Measurement for Collaborative Filtering , 2012, Journal of Computer Science and Technology.

[29]  Qiang Yang,et al.  Scalable collaborative filtering using cluster-based smoothing , 2005, SIGIR '05.

[30]  Jing Zhao,et al.  Research on entropy-based collaborative filtering algorithm and personalized recommendation in e-commerce , 2009, Service Oriented Computing and Applications.

[31]  Huseyin Polat,et al.  A scalable privacy-preserving recommendation scheme via bisecting k-means clustering , 2013, Inf. Process. Manag..

[32]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[33]  Yongmoo Suh,et al.  A new similarity function for selecting neighbors for each target item in collaborative filtering , 2013, Knowl. Based Syst..