A Comparative Study of Different Similarity Metrics in Highly Sparse Rating Dataset

Recommender system has been popularly used for recommending products and services to the online buyers and users. Collaborative Filtering (CF) is one of the most popular filtering approaches used to find the preferences of users for the recommendation. CF works on the ratings given by the users for a particular item. It predicts the rating that is not explicitly given for any item and build the recommendation list for a particular user. Different similarity metrics and prediction approaches are used for this purpose. But these metrics and approaches have some issues in dealing with highly sparse datasets. In this paper, we sought to find the most accurate combinations of similarity metrics and prediction approaches for both user and item similarity based CF. In this comparative study, we deliberately instill sparsity of different magnitudes (10, 20, 30 and 40%) by deleting given ratings in an existing dataset. We then predict the deleted ratings using different combinations of similarity metrics and prediction approach. We assessed the accuracy of the prediction with the help of two evaluation metrics (MAE and RMSE).

[1]  Zhigang Luo,et al.  A novel approach for collaborative filtering to alleviate the new item cold-start problem , 2011, 2011 11th International Symposium on Communications & Information Technologies (ISCIT).

[2]  Sutheera Puntheeranurak,et al.  An Item-based collaborative filtering method using Item-based hybrid similarity , 2011, 2011 IEEE 2nd International Conference on Software Engineering and Service Science.

[3]  Jin Xu,et al.  Dictionary Learning Based on Laplacian Score in Sparse Coding , 2011, MLDM.

[4]  Fernando Ortega,et al.  Collaborative filtering based on significances , 2012, Inf. Sci..

[5]  Zibin Zheng,et al.  Predicting Quality of Service for Selection by Neighborhood-Based Collaborative Filtering , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[6]  Cihan Kaleli,et al.  A multi-criteria item-based collaborative filtering framework , 2014, 2014 11th International Joint Conference on Computer Science and Software Engineering (JCSSE).

[7]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[8]  Rong Hu,et al.  A Hybrid User and Item-Based Collaborative Filtering with Smoothing on Sparse Data , 2006, 16th International Conference on Artificial Reality and Telexistence--Workshops (ICAT'06).

[9]  Sahin Albayrak,et al.  User-centric evaluation of a K-furthest neighbor collaborative filtering recommender algorithm , 2013, CSCW.

[10]  Martin Szomszor,et al.  Comparison of implicit and explicit feedback from an online music recommendation service , 2010, HetRec '10.

[11]  Ludovico Boratto,et al.  Using Collaborative Filtering to Overcome the Curse of Dimensionality when Clustering Users in a Group Recommender System , 2014, ICEIS.

[12]  Hui Tian,et al.  A new user similarity model to improve the accuracy of collaborative filtering , 2014, Knowl. Based Syst..

[13]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender System - A Case Study , 2000 .

[14]  Jason J. Jung,et al.  Item-Based Collaborative Filtering with Attribute Correlation: A Case Study on Movie Recommendation , 2014, ACIIDS.

[15]  Miguel Ángel García Cumbreras,et al.  Pessimists and optimists: Improving collaborative filtering through sentiment analysis , 2013, Expert Syst. Appl..

[16]  Yang Guo,et al.  A survey of collaborative filtering based social recommender systems , 2014, Comput. Commun..

[17]  Tansel Özyer,et al.  A Movie Rating Prediction Algorithm with Collaborative Filtering , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[18]  Debajyoti Mukhopadhyay,et al.  An Item-Based Collaborative Filtering using Dimensionality Reduction Techniques on Mahout Framework , 2015, ArXiv.

[19]  John Riedl,et al.  An Empirical Analysis of Design Choices in Neighborhood-Based Collaborative Filtering Algorithms , 2002, Information Retrieval.

[20]  John Riedl,et al.  An Algorithmic Framework for Performing Collaborative Filtering , 1999, SIGIR Forum.