Improving Collaborative Filtering’s Rating Prediction Coverage in Sparse Datasets through the Introduction of Virtual Near Neighbors

Collaborative filtering creates personalized recommendations by considering ratings entered by users. Collaborative filtering algorithms initially detect users whose likings are alike, by exploring the similarity between ratings that have insofar been submitted. Users having a high degree of similarity regarding their ratings are termed near neighbors, and in order to formulate a recommendation for a user, her near neighbors’ ratings are extracted and form the basis for the recommendation. Collaborative filtering algorithms however exhibit the problem commonly referred to as “gray sheep this pertains to the case where for some users no near neighbors can be identified, and hence no personalized recommendations can be computed. The “gray sheep” problem is more severe in sparse datasets, i.e. datasets where the number of ratings is small, compared to the number of items and users. In this paper, we address the “gray sheep” problem by introducing the concept of virtual near neighbors and a related algorithm for their creation on the basis of the existing ones. We evaluate the proposed algorithm, which is termed as CFVNN, using eight widely used datasets and considering two correlation metrics which are widely used in Collaborative Filtering research, namely the Pearson Correlation Coefficient and the Cosine Similarity. The results show that the proposed algorithm considerably leverages the capability of a Collaborative Filtering system to compute personalized recommendations in the context of sparse datasets, tackling thus efficiently the “gray sheep” problem. In parallel, the CFVNN algorithm achieves improvements in rating prediction quality, as this is expressed through the Mean Absolute Error and the Root Mean Square Error metrics.

[1]  Dionisis Margaris,et al.  Using Time Clusters for Following Users' Shifts in Rating Practices , 2017, Complex Syst. Informatics Model. Q..

[2]  Ricardo Dias,et al.  Improving Music Recommendation in Session-Based Collaborative Filtering by Using Temporal Context , 2013, 2013 IEEE 25th International Conference on Tools with Artificial Intelligence.

[3]  Panagiotis Georgiadis,et al.  Recommendation information diffusion in social networks considering user influence and semantics , 2016, Social Network Analysis and Mining.

[4]  Dionisis Margaris,et al.  Exploiting Rating Abstention Intervals for Addressing Concept Drift in Social Network Recommender Systems , 2018, Informatics.

[5]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[6]  Hans-Peter Kriegel,et al.  Ieee Transactions on Knowledge and Data Engineering Probabilistic Memory-based Collaborative Filtering , 2022 .

[7]  Matthias Jarke,et al.  A Clustering Approach for Collaborative Filtering Recommendation Using Social Network Analysis , 2011, J. Univers. Comput. Sci..

[8]  S. Ioannidis,et al.  Social media analysis during political turbulence , 2017, PloS one.

[9]  Carlos Delgado Kloos,et al.  A Collaborative Recommender System Based on Space-Time Similarities , 2010, IEEE Pervasive Computing.

[10]  C. Vassilakis,et al.  Experimental results for considering Virtual Near Neighbors in Collaborative Filtering’s Rating Prediction , 2019 .

[11]  Joemon M. Jose,et al.  Handling data sparsity in collaborative filtering using emotion and semantic based features , 2011, SIGIR.

[12]  Dionisis Margaris,et al.  Enhancing Rating Prediction Quality Through Improving the Accuracy of Detection of Shifts in Rating Practices , 2018, Trans. Large Scale Data Knowl. Centered Syst..

[13]  Panagiotis Georgiadis,et al.  A collaborative filtering algorithm with clustering for personalized web service selection in business processes , 2015, 2015 IEEE 9th International Conference on Research Challenges in Information Science (RCIS).

[14]  Rong Yan,et al.  Social influence in social advertising: evidence from field experiments , 2012, EC '12.

[15]  Panagiotis Georgiadis,et al.  An integrated framework for QoS-based adaptation and exception resolution in WS-BPEL scenarios , 2013, SAC '13.

[16]  Sotiris Ioannidis,et al.  Investigating the complete corpus of referendum and elections tweets , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[17]  Dionisis Margaris,et al.  Improving collaborative filtering's rating prediction quality in dense datasets, by pruning old ratings , 2017, 2017 IEEE Symposium on Computers and Communications (ISCC).

[18]  Anton van den Hengel,et al.  Image-Based Recommendations on Styles and Substitutes , 2015, SIGIR.

[19]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[20]  Dionisis Margaris,et al.  Enhancing User Rating Database Consistency Through Pruning , 2017, Trans. Large Scale Data Knowl. Centered Syst..

[21]  Mehrbakhsh Nilashi,et al.  Collaborative filtering recommender systems , 2013 .

[22]  Panagiotis Georgiadis,et al.  Query personalization using social network information and collaborative filtering techniques , 2018, Future Gener. Comput. Syst..

[23]  Dionisis Margaris,et al.  Exploiting Internet of Things information to enhance venues’ recommendation accuracy , 2017, Service Oriented Computing and Applications.

[24]  Panagiotis Georgiadis,et al.  Knowledge-Based Leisure Time Recommendations in Social Networks , 2017 .

[25]  A KonstanJoseph,et al.  The MovieLens Datasets , 2015 .

[26]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[27]  Dionisis Margaris,et al.  Improving Collaborative Filtering's Rating Prediction Coverage in Sparse Datasets by Exploiting User Dissimilarity , 2018, 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[28]  Dionisis Margaris,et al.  Pruning and aging for user histories in collaborative filtering , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[29]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[30]  Isabelle Tellier,et al.  Reducing the Cold-Start Problem in Content Recommendation through Opinion Classification , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[31]  Konstantinos G. Margaritis,et al.  A Hybrid Approach for Improving Prediction Coverage of Collaborative Filtering , 2009, AIAI.

[32]  Dionisis Margaris,et al.  Improving Collaborative Filtering's Rating Prediction Accuracy by Considering Users' Rating Variability , 2018, 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[33]  Panagiotis Georgiadis,et al.  Adapting WS-BPEL scenario execution using collaborative filtering techniques , 2013, IEEE 7th International Conference on Research Challenges in Information Science (RCIS).

[34]  Dionisis Margaris,et al.  Improving Collaborative Filtering's Rating Prediction Quality by Considering Shifts in Rating Practices , 2017, 2017 IEEE 19th Conference on Business Informatics (CBI).