Alleviating the data sparsity problem of recommender systems by clustering nodes in bipartite networks

Abstract Recommender systems help users to find information that fits their preferences in an overloaded search space. Collaborative filtering systems suffer from increasingly severe data sparsity problem because more and more products are sold in commercial websites, which largely constrains the performance of recommendation algorithms. User clustering has already been applied to recommendation on sparse data in the literature, but in a completely different way. In most existing works, user clustering is directly used to identify the similar users of the target user to whom we want to make recommendation. More specifically, the users who are clustered in the same group of the target user are considered as similar users. However, in this paper we use user clustering to reconstruct the user-item bipartite network such that the network density is significantly improved. The recommendation made on this dense network thus can achieve much higher accuracy than on the original sparse network. The experimental results on three benchmark data sets demonstrate that, when facing the problem of data sparsity, our proposed recommendation algorithm based on node clustering achieves a significant improvement in accuracy and coverage of recommendation.

[1]  Hsinchun Chen,et al.  Applying associative retrieval techniques to alleviate the sparsity problem in collaborative filtering , 2004, TOIS.

[2]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[3]  Guoliang Li,et al.  PASS-JOIN: A Partition-based Method for Similarity Joins , 2011, Proc. VLDB Endow..

[4]  Manoj Kumar Tiwari,et al.  A noise correction-based approach to support a recommender system in a highly sparse rating environment , 2019, Decis. Support Syst..

[5]  Fernando Ortega,et al.  Robust Model-Based Reliability Approach to Tackle Shilling Attacks in Collaborative Filtering Recommender Systems , 2019, IEEE Access.

[6]  Laura Ferreira,et al.  A Comparison of Hierarchical Methods for Clustering Functional Data , 2009, Commun. Stat. Simul. Comput..

[7]  Yi-Cheng Zhang,et al.  Bipartite network projection and personal recommendation. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Zhang Yu-fang Collaborative filtering recommendation algorithm based on item classification and cloud model , 2012 .

[9]  Fuguo Zhang,et al.  Improving information filtering via network manipulation , 2012, ArXiv.

[10]  Fei Wu,et al.  Modeling recommender systems via weighted bipartite network , 2017, Concurr. Comput. Pract. Exp..

[11]  Luis Martínez,et al.  Fuzzy Tools in Recommender Systems: A Survey , 2017, Int. J. Comput. Intell. Syst..

[12]  Wei Wang,et al.  Recommender system application developments: A survey , 2015, Decis. Support Syst..

[13]  Mahdi Jalili,et al.  Recommender systems based on collaborative filtering and resource allocation , 2014, Social Network Analysis and Mining.

[14]  Paul Resnick,et al.  Recommender systems , 1997, CACM.

[15]  Tao Zhou,et al.  Negative ratings play a positive role in information filtering , 2011 .

[16]  Wolfgang Nejdl,et al.  Preventing shilling attacks in online recommender systems , 2005, WIDM '05.

[17]  Xingyuan Li Collaborative filtering recommendation algorithm based on cluster , 2011, Proceedings of 2011 International Conference on Computer Science and Network Technology.

[18]  Yehuda Koren,et al.  Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[19]  Tao Zhou,et al.  Relevance is more significant than correlation: Information filtering on sparse data , 2009 .

[20]  Paolo Avesani,et al.  Trust-Aware Collaborative Filtering for Recommender Systems , 2004, CoopIS/DOA/ODBASE.

[21]  Fei Yu,et al.  Network-based recommendation algorithms: A review , 2015, Physica A: Statistical Mechanics and its Applications.

[22]  Yi-Cheng Zhang,et al.  Effect of initial configuration on network-based recommendation , 2007, 0711.2506.

[23]  Luis M. de Campos,et al.  Using past-prediction accuracy in recommender systems , 2012, Inf. Sci..

[24]  Philipp Koehn,et al.  Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models , 2004, AMTA.

[25]  Zahra Yusefi Hafshejani,et al.  Improving sparsity and new user problems in collaborative filtering by clustering the personality factors , 2018, Electron. Commer. Res..

[26]  J. Bobadilla,et al.  Recommender systems survey , 2013, Knowl. Based Syst..

[27]  Tinghuai Ma,et al.  Social Network and Tag Sources Based Augmenting Collaborative Recommender System , 2015, IEICE Trans. Inf. Syst..

[28]  Jiming Liu,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Social Collaborative Filtering by Trust , 2022 .

[29]  Paul D. McNicholas,et al.  Cluster-weighted $$t$$t-factor analyzers for robust model-based clustering and dimension reduction , 2015, Stat. Methods Appl..

[30]  Kamal Kant Bharadwaj,et al.  Utilizing various sparsity measures for enhancing accuracy of collaborative recommender systems based on local and global similarities , 2011, Expert Syst. Appl..

[31]  Yang Sok Kim,et al.  An empirical study on the effect of data sparsity and data overlap on cross domain collaborative filtering performance , 2017, Expert Syst. Appl..

[32]  Wang Bing-hong,et al.  Overview of the Evaluated Algorithms for the Personal Recommendation Systems , 2009 .

[33]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender System - A Case Study , 2000 .

[34]  Jie Liu,et al.  Personal Recommendation Based on Weighted Bipartite Networks , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[35]  Fabian Lecron,et al.  Weighting strategies for a recommender system using item clustering based on genres , 2017, Expert Syst. Appl..

[36]  Mehrbakhsh Nilashi,et al.  A recommender system based on collaborative filtering using ontology and dimensionality reduction techniques , 2018, Expert Syst. Appl..

[37]  Chen Honghui,et al.  User-based Clustering with Top-N Recommendation on Cold-Start Problem , 2013, 2013 Third International Conference on Intelligent System Design and Engineering Applications.

[38]  Qiang Yang,et al.  Scalable collaborative filtering using cluster-based smoothing , 2005, SIGIR '05.

[39]  Hui Tian,et al.  A new user similarity model to improve the accuracy of collaborative filtering , 2014, Knowl. Based Syst..

[40]  Yi-Cheng Zhang,et al.  Solving the apparent diversity-accuracy dilemma of recommender systems , 2008, Proceedings of the National Academy of Sciences.

[41]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.