Trustworthy Collaborative Filtering through Downweighting Noise and Redundancy

Proliferation of Electronic Commerce (EC) has revolutionized the way people purchase online. Web-based technologies enable people to more actively interact with merchants and service providers. Such purchasing logs and comments further lead to proliferation of recommender systems. Existing recommendation algorithms exploit either prior transactions or customer reviews to predict user interests towards certain items. Vast noise may be introduced into such information by fake raters, and information redundancy also makes recommender system entangled. In this work, we first examine user reviews and prior transactions to estimate user credibility and item importance to reduce effect from content polluters. Then we propose to alleviate the redundant information from homogeneous users based on link analysis. A unified framework is finally proposed to incorporate them in a mathematical formulation, which can be efficiently optimized. Experimental results on real world data reveal that our model can significantly outperform other baselines.

[1]  Michael R. Lyu,et al.  Probabilistic factor models for web site recommendation , 2011, SIGIR.

[2]  Akhmed Umyarov,et al.  Improving Collaborative Filtering Recommendations Using External Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[3]  Xuan Li,et al.  Robust Nonnegative Matrix Factorization via Half-Quadratic Minimization , 2012, 2012 IEEE 12th International Conference on Data Mining.

[4]  Michael R. Lyu,et al.  Effective missing data prediction for collaborative filtering , 2007, SIGIR.

[5]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[6]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[8]  Stuart E. Middleton,et al.  Ontological user profiling in recommender systems , 2004, TOIS.

[9]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.

[10]  Huan Liu,et al.  Social Spammer Detection in Microblogging , 2013, IJCAI.

[11]  Peter Brusilovsky,et al.  Open user profiles for adaptive news systems: help or harm? , 2007, WWW '07.

[12]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[13]  Bing Liu,et al.  Review spam detection , 2007, WWW '07.

[14]  Deepak Agarwal,et al.  Generalizing matrix factorization through flexible regression priors , 2011, RecSys '11.

[15]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[16]  Hsinchun Chen,et al.  Applying associative retrieval techniques to alleviate the sparsity problem in collaborative filtering , 2004, TOIS.

[17]  Shuang-Hong Yang,et al.  Functional matrix factorizations for cold-start recommendation , 2011, SIGIR.

[18]  Juan-Zi Li,et al.  Typicality-Based Collaborative Filtering Recommendation , 2014, IEEE Transactions on Knowledge and Data Engineering.

[19]  Alejandro Bellogín,et al.  News@hand: A Semantic Web Approach to Recommending News , 2008, AH.

[20]  Kenneth Wai-Ting Leung,et al.  CLR: a collaborative location recommendation framework based on co-clustering , 2011, SIGIR.

[21]  Pasquale Lops,et al.  Content-based Recommender Systems: State of the Art and Trends , 2011, Recommender Systems Handbook.

[22]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[23]  Michael J. Pazzani,et al.  Learning and Revising User Profiles: The Identification of Interesting Web Sites , 1997, Machine Learning.

[24]  Pasquale Lops,et al.  A content-collaborative recommender that exploits WordNet-based user profiles for neighborhood formation , 2007, User Modeling and User-Adapted Interaction.

[25]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[26]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[27]  Wolfgang Nejdl,et al.  The Benefit of Using Tag-Based Profiles , 2007 .

[28]  Russell Greiner,et al.  Does Wikipedia Information Help Netflix Predictions? , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[29]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[30]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .