Performance Comparison of Clustering Algorithm Based Collaborative Filtering Recommendation System

Importance of recommendation Systems (RS), based on collaborative filtering, is escalating with exponential growth of e-commerce application, e.g., on-line shopping, internet movie and music sites, tourism, news portals, to name a few. The target of recommendation system is to predict user preferences based on their previous activities, and associating users of similar behavior. Of the two main approaches, Content Based (CB) and Collaborative Filtering (CF), CF is increasingly popular because there is no need of domain knowledge and it scales well for large datasets. CF uses archives, like user-item purchase data or user-movie review data, which are large and sparse. Traditional approaches using matrix factorization like singular valued decomposition (SVD), is inefficient, for very a large review data. It is more efficient to cluster item-vectors, where elements of an item-vector are entries from users. As the dimension of an item-vector is large and elements are sparse, conventional clustering algorithms fails. We created item-item adjacency matrix, where the elements are similarity between item-vectors. There are various clustering algorithms that work directly on the adjacency matrix. We used spectral clustering, K-means++, Agglomerative Clustering. Considering item-vectors as nodes and adjacency matrix elements as link weights, we performed graph-clustering using Louvain Algorithm, to discover groups. We used synthetic data, added different levels of noise and run different algorithms to compare their accuracies to restore the original data from the noisy one. Louvain algorithm and Spectral clustering could achieve the highest accuracies.

[1]  Liu Chao,et al.  A new clustering method for collaborative filtering , 2010, 2010 International Conference on Networking and Information Technology.

[2]  Xiangliang Zhang,et al.  Clustering Recommenders in Collaborative Filtering Using Explicit Trust Information , 2011, IFIPTM.

[3]  Dongre Deepak Mahapatrav,et al.  LARS*: An Efficient and Scalable Location-Aware Recommender System , 2017 .

[4]  Duen-Ren Liu,et al.  Hybrid approaches to product recommendation based on customer lifetime value and purchase preferences , 2005, J. Syst. Softw..

[5]  Panagiotis Symeonidis,et al.  MusicBox: Personalized Music Recommendation Based on Cubic Analysis of Social Tags , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Yongmoo Suh,et al.  A new similarity function for selecting neighbors for each target item in collaborative filtering , 2013, Knowl. Based Syst..

[7]  Anand Shanker Tewari Generating Items Recommendations by Fusing Content and User-Item based Collaborative Filtering , 2020 .

[8]  José Juan Pazos-Arias,et al.  Exploring synergies between content-based filtering and Spreading Activation techniques in knowledge-based recommender systems , 2011, Inf. Sci..

[9]  SongJie Gong,et al.  Mining User Interest Change for Improving Collaborative Filtering , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[10]  Zafar Ali,et al.  Recommender Systems: Issues, Challenges, and Research Opportunities , 2016 .

[11]  John Riedl,et al.  Recommender Systems for Large-scale E-Commerce : Scalable Neighborhood Formation Using Clustering , 2002 .

[12]  H. Edelsbrunner,et al.  Efficient algorithms for agglomerative hierarchical clustering methods , 1984 .

[13]  F. O. Isinkaye,et al.  Recommendation systems: Principles, methods and evaluation , 2015 .

[14]  Ahmed Eldawy,et al.  LARS*: An Efficient and Scalable Location-Aware Recommender System , 2014, IEEE Transactions on Knowledge and Data Engineering.

[15]  Enrique Herrera-Viedma,et al.  A google wave-based fuzzy recommender system to disseminate information in University Digital Libraries 2.0 , 2011, Inf. Sci..

[16]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[17]  Konstantinos G. Margaritis,et al.  Using SVD and demographic data for the enhancement of generalized Collaborative Filtering , 2007, Inf. Sci..

[18]  Fernando Ortega,et al.  A balanced memory‐based collaborative filtering similarity measure , 2012, Int. J. Intell. Syst..

[19]  Ali Selamat,et al.  A scalable collaborative recommender algorithm based on user density-based clustering , 2011, The 3rd International Conference on Data Mining and Intelligent Information Technology Applications.

[20]  Xiao Ma,et al.  An explicit trust and distrust clustering based collaborative filtering recommendation approach , 2017, Electron. Commer. Res. Appl..

[21]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[22]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[23]  Sung-Bong Yang,et al.  An Effective Recommendation Algorithm for Clustering-Based Recommender Systems , 2005, Australian Conference on Artificial Intelligence.

[24]  Johanne Saint-Charles,et al.  Predicting semantic preferences in a socio-semantic system with collaborative filtering: A case study , 2020, Int. J. Inf. Manag..

[25]  Enrique Herrera-Viedma,et al.  A multi-disciplinar recommender system to advice research resources in University Digital Libraries , 2009, Expert Syst. Appl..

[26]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[27]  Rafael Valencia-García,et al.  Solving the cold-start problem in recommender systems with social tags , 2010, Expert Syst. Appl..

[28]  Chun Chen,et al.  Using rich social media information for music recommendation via hypergraph model , 2011, TOMCCAP.

[29]  J. Bobadilla,et al.  Recommender systems survey , 2013, Knowl. Based Syst..

[30]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[31]  Ken Lang,et al.  NewsWeeder: Learning to Filter Netnews , 1995, ICML.

[32]  Yong Wang,et al.  A Novel K-medoids clustering recommendation algorithm based on probability distribution for collaborative filtering , 2019, Knowl. Based Syst..