A Hybrid Multigroup Coclustering Recommendation Framework Based on Information Fusion

Collaborative Filtering (CF) is one of the most successful algorithms in recommender systems. However, it suffers from data sparsity and scalability problems. Although many clustering techniques have been incorporated to alleviate these two problems, most of them fail to achieve further significant improvement in recommendation accuracy. First of all, most of them assume each user or item belongs to a single cluster. Since usually users can hold multiple interests and items may belong to multiple categories, it is more reasonable to assume that users and items can join multiple clusters (groups), where each cluster is a subset of like-minded users and items they prefer. Furthermore, most of the clustering-based CF models only utilize historical rating information in the clustering procedure but ignore other data resources in recommender systems such as the social connections of users and the correlations between items. In this article, we propose HMCoC, a Hybrid Multigroup CoClustering recommendation framework, which can cluster users and items into multiple groups simultaneously with different information resources. In our framework, we first integrate information of user--item rating records, user social networks, and item features extracted from the DBpedia knowledge base. We then use an optimization method to mine meaningful user--item groups with all the information. Finally, we apply the conventional CF method in each cluster to make predictions. By merging the predictions from each cluster, we generate the top-n recommendations to the target users for return. Extensive experimental results demonstrate the superior performance of our approach in top-n recommendation in terms of MAP, NDCG, and F1 compared with other clustering-based CF models.

[1]  George Karypis,et al.  SLIM: Sparse Linear Methods for Top-N Recommender Systems , 2011, 2011 IEEE 11th International Conference on Data Mining.

[2]  Chun Chen,et al.  An exploration of improving collaborative recommender systems via user-item subgroups , 2012, WWW.

[3]  George Karypis,et al.  A Comprehensive Survey of Neighborhood-based Recommendation Methods , 2011, Recommender Systems Handbook.

[4]  Stuart E. Middleton,et al.  Ontology-based Recommender Systems , 2004, Handbook on Ontologies.

[5]  Daniel Dajun Zeng,et al.  Collaborative filtering in social tagging systems based on joint item-tag recommendations , 2010, CIKM.

[6]  Songjie Gong A Collaborative Filtering Recommendation Algorithm Based on User Clustering and Item Clustering , 2010, J. Softw..

[7]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[8]  Chung-Kon Shi,et al.  Exploring Movie Recommendation System Using Cultural Metadata , 2008, Trans. Edutainment.

[9]  Songjie Gong,et al.  An Efficient Collaborative Recommendation Algorithm Based on Item Clustering , 2010 .

[10]  Kenneth Wai-Ting Leung,et al.  CLR: a collaborative location recommendation framework based on co-clustering , 2011, SIGIR.

[11]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[12]  Markus Zanker,et al.  Linked open data to support content-based recommender systems , 2012, I-SEMANTICS '12.

[13]  Jun Wang,et al.  Unifying user-based and item-based collaborative filtering approaches by similarity fusion , 2006, SIGIR.

[14]  Panagiotis Symeonidis,et al.  Tag recommendations based on tensor dimensionality reduction , 2008, RecSys '08.

[15]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender System - A Case Study , 2000 .

[16]  John F. Kolen,et al.  Reducing the time complexity of the fuzzy c-means algorithm , 2002, IEEE Trans. Fuzzy Syst..

[17]  Qiang Yang,et al.  Scalable collaborative filtering using cluster-based smoothing , 2005, SIGIR '05.

[18]  Fei Wang,et al.  Social contextual recommendation , 2012, CIKM.

[19]  Tommaso Di Noia,et al.  Top-N recommendations from implicit feedback leveraging linked open data , 2013, IIR.

[20]  Srujana Merugu,et al.  A scalable collaborative filtering framework based on co-clustering , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[21]  Jacek M. Leski,et al.  Towards a robust fuzzy clustering , 2003, Fuzzy Sets Syst..

[22]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[23]  John Riedl,et al.  Recommender Systems for Large-scale E-Commerce : Scalable Neighborhood Formation Using Clustering , 2002 .

[24]  Thomas Hofmann,et al.  Latent Class Models for Collaborative Filtering , 1999, IJCAI.

[25]  Hai Yang,et al.  ACM Transactions on Intelligent Systems and Technology - Special Section on Urban Computing , 2014 .

[26]  Thomas Hofmann,et al.  Latent semantic models for collaborative filtering , 2004, TOIS.

[27]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[28]  Tommaso Di Noia,et al.  Exploiting the web of data in model-based recommender systems , 2012, RecSys.

[29]  Inderjit S. Dhillon,et al.  Co-clustering documents and words using bipartite spectral graph partitioning , 2001, KDD '01.

[30]  Paolo Avesani,et al.  Trust-aware recommender systems , 2007, RecSys '07.

[31]  Martin Ester,et al.  A matrix factorization technique with trust propagation for recommendation in social networks , 2010, RecSys '10.

[32]  Bradley N. Miller,et al.  Using filtering agents to improve prediction quality in the GroupLens research collaborative filtering system , 1998, CSCW '98.

[33]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[34]  Chun Chen,et al.  Music recommendation by unified hypergraph: combining social media information and music content , 2010, ACM Multimedia.

[35]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[36]  Martial Hebert,et al.  A spectral technique for correspondence problems using pairwise constraints , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[37]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[38]  J. MacDonald,et al.  Successive Approximations by the Rayleigh-Ritz Variation Method , 1933 .

[39]  Yi-Cheng Zhang,et al.  Personalized Recommendation via Integrated Diffusion on User-Item-Tag Tripartite Graphs , 2009, ArXiv.

[40]  Hsinchun Chen,et al.  A Comparison of Collaborative-Filtering Recommendation Algorithms for E-commerce , 2007, IEEE Intelligent Systems.

[41]  Chun Chen,et al.  Locally Discriminative Coclustering , 2012, IEEE Transactions on Knowledge and Data Engineering.

[42]  Xi Zhang,et al.  TopRec: domain-specific recommendation through community topic mining in social network , 2013, WWW '13.

[43]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.