A new similarity measure using Bhattacharyya coefficient for collaborative filtering in sparse data

Collaborative filtering (CF) is the most successful approach for personalized product or service recommendations. Neighborhood based collaborative filtering is an important class of CF, which is simple, intuitive and efficient product recommender system widely used in commercial domain. Typically, neighborhood-based CF uses a similarity measure for finding similar users to an active user or similar products on which she rated. Traditional similarity measures utilize ratings of only co-rated items while computing similarity between a pair of users. Therefore, these measures are not suitable in a sparse data. In this paper, we propose a similarity measure for neighborhood based CF, which uses all ratings made by a pair of users. Proposed measure finds importance of each pair of rated items by exploiting Bhattacharyya similarity. To show effectiveness of the measure, we compared performances of neighborhood based CFs using state-of-the-art similarity measures with the proposed measured based CF. Recommendation results on a set of real data show that proposed measure based CF outperforms existing measures based CFs in various evaluation metrics.

[1]  Bradley N. Miller,et al.  MovieLens unplugged: experiences with an occasionally connected recommender system , 2003, IUI '03.

[2]  Frank Nielsen,et al.  The Burbea-Rao and Bhattacharyya Centroids , 2010, IEEE Transactions on Information Theory.

[3]  John Riedl,et al.  Collaborative Filtering Recommender Systems , 2011, Found. Trends Hum. Comput. Interact..

[4]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[5]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[6]  Michael J. Pazzani,et al.  Adaptive interfaces for ubiquitous web access , 2002, CACM.

[7]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[8]  Hui Tian,et al.  A new user similarity model to improve the accuracy of collaborative filtering , 2014, Knowl. Based Syst..

[9]  Michael J. Pazzani,et al.  Learning Collaborative Information Filters , 1998, ICML.

[10]  Ville Ollikainen,et al.  Exploiting Bhattacharyya Similarity Measure to Diminish User Cold-Start Problem in Sparse Data , 2014, Discovery Science.

[11]  Yongmoo Suh,et al.  A new similarity function for selecting neighbors for each target item in collaborative filtering , 2013, Knowl. Based Syst..

[12]  Thomas Hofmann,et al.  Latent semantic models for collaborative filtering , 2004, TOIS.

[13]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[14]  Kamal Kant Bharadwaj,et al.  Utilizing various sparsity measures for enhancing accuracy of collaborative recommender systems based on local and global similarities , 2011, Expert Syst. Appl..

[15]  George Karypis,et al.  A Comprehensive Survey of Neighborhood-based Recommendation Methods , 2011, Recommender Systems Handbook.

[16]  T. Kailath The Divergence and Bhattacharyya Distance Measures in Signal Selection , 1967 .

[17]  Fernando Ortega,et al.  A collaborative filtering similarity measure based on singularities , 2012, Inf. Process. Manag..

[18]  Ruimin Shen,et al.  A collaborative filtering framework based on both local user similarity and global user similarity , 2008, Machine Learning.

[19]  Jesús Bobadilla,et al.  A new collaborative filtering metric that improves the behavior of recommender systems , 2010, Knowl. Based Syst..

[20]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[21]  George Karypis,et al.  Evaluation of Item-Based Top-N Recommendation Algorithms , 2001, CIKM '01.

[22]  Ken Lang,et al.  NewsWeeder: Learning to Filter Netnews , 1995, ICML.

[23]  Zuhua Jiang,et al.  Collaborative filtering based on workflow space , 2009, Expert Syst. Appl..

[24]  Paul Resnick,et al.  Recommender systems , 1997, CACM.

[25]  Zuhua Jiang,et al.  Distributed recommender for peer-to-peer knowledge sharing , 2010, Inf. Sci..

[26]  Zuhua Jiang,et al.  Recommender system based on workflow , 2009, Decis. Support Syst..

[27]  J. Bobadilla,et al.  Recommender systems survey , 2013, Knowl. Based Syst..

[28]  Mukkai S. Krishnamoorthy,et al.  A random walk method for alleviating the sparsity problem in collaborative filtering , 2008, RecSys '08.

[29]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[30]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[31]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[32]  Fernando Ortega,et al.  A collaborative filtering approach to mitigate the new user cold start problem , 2012, Knowl. Based Syst..

[33]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[34]  Anil K. Jain ON AN ESTIMATE OF THE BHATTACHARYYA DISTANCE. , 1976 .

[35]  Neil A. Thacker,et al.  The Bhattacharyya metric as an absolute similarity measure for frequency coded data , 1998, Kybernetika.

[36]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[37]  Hyung Jun Ahn,et al.  A new similarity measure for collaborative filtering to alleviate the new user cold-starting problem , 2008, Inf. Sci..

[38]  Abdulmotaleb El-Saddik,et al.  Collaborative error-reflected models for cold-start recommender systems , 2011, Decis. Support Syst..

[39]  Michael J. Pazzani,et al.  Learning and Revising User Profiles: The Identification of Interesting Web Sites , 1997, Machine Learning.

[40]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[41]  Kamal Ali,et al.  TiVo: making show recommendations using a distributed collaborative filtering architecture , 2004, KDD.