Improving Collaborative Filtering's Rating Prediction Coverage in Sparse Datasets by Exploiting User Dissimilarity

Collaborative filtering systems analyze the ratings databases to identify users with similar likings and preferences, termed as near neighbors, and then generate rating predictions for a user by examining the ratings of his near neighbors for items that the user has not yet rated; based on rating predictions, recommendations are then formulated. However, these systems are known to exhibit the "gray sheep" problem, i.e. the situation where no near neighbors can be identified for a number of users, and hence no recommendation can be formulated for them. This problem is more intense in sparse datasets, i.e. datasets with relatively small number of ratings, compared to the number of users and items. In this work, we propose a method for alleviating this problem by exploiting user dissimilarity, under the assumption that if some users have exhibited opposing preferences in the past, they are likely to do so in the future. The proposed method has been evaluated against seven widely used datasets and has been proven to be particularly effective in increasing the percentage of users for which personalized recommendations can be formulated in the context of sparse datasets, while at the same time maintaining or slightly improving rating prediction quality.

[1]  Hans-Peter Kriegel,et al.  Ieee Transactions on Knowledge and Data Engineering Probabilistic Memory-based Collaborative Filtering , 2022 .

[2]  Panagiotis Georgiadis,et al.  Query personalization using social network information and collaborative filtering techniques , 2018, Future Gener. Comput. Syst..

[3]  Qiang Yang,et al.  One-Class Collaborative Filtering , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[4]  Panagiotis Georgiadis,et al.  Recommendation information diffusion in social networks considering user influence and semantics , 2016, Social Network Analysis and Mining.

[5]  Lada A. Adamic,et al.  The role of social networks in information diffusion , 2012, WWW.

[6]  Charu C. Aggarwal,et al.  Neighborhood-Based Collaborative Filtering , 2016 .

[7]  Konstantinos G. Margaritis,et al.  A Hybrid Approach for Improving Prediction Coverage of Collaborative Filtering , 2009, AIAI.

[8]  Panagiotis Georgiadis,et al.  Knowledge-Based Leisure Time Recommendations in Social Networks , 2017 .

[9]  Ricardo Dias,et al.  Improving Music Recommendation in Session-Based Collaborative Filtering by Using Temporal Context , 2013, 2013 IEEE 25th International Conference on Tools with Artificial Intelligence.

[10]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[11]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[12]  Robin D. Burke,et al.  Hybrid Recommender Systems: Survey and Experiments , 2002, User Modeling and User-Adapted Interaction.

[13]  Dan Frankowski,et al.  Collaborative Filtering Recommender Systems , 2007, The Adaptive Web.

[14]  John Riedl,et al.  Collaborative Filtering Recommender Systems , 2011, Found. Trends Hum. Comput. Interact..

[15]  Anton van den Hengel,et al.  Image-Based Recommendations on Styles and Substitutes , 2015, SIGIR.

[16]  Abhinandan Das,et al.  Google news personalization: scalable online collaborative filtering , 2007, WWW '07.

[17]  Dionisis Margaris,et al.  Pruning and aging for user histories in collaborative filtering , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[18]  Songjie Gong A Collaborative Filtering Recommendation Algorithm Based on User Clustering and Item Clustering , 2010, J. Softw..

[19]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[20]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[21]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[22]  Dionisis Margaris,et al.  Enhancing User Rating Database Consistency Through Pruning , 2017, Trans. Large Scale Data Knowl. Centered Syst..

[23]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[24]  Isabelle Tellier,et al.  Reducing the Cold-Start Problem in Content Recommendation through Opinion Classification , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[25]  Joemon M. Jose,et al.  Handling data sparsity in collaborative filtering using emotion and semantic based features , 2011, SIGIR.

[26]  Panagiotis Georgiadis,et al.  A collaborative filtering algorithm with clustering for personalized web service selection in business processes , 2015, 2015 IEEE 9th International Conference on Research Challenges in Information Science (RCIS).

[27]  Jure Leskovec,et al.  Inferring Networks of Substitutable and Complementary Products , 2015, KDD.