PredictingTrust from User Ratings

Abstract Trust relationships between usersinvariousonline communitiesare notoriouslyhardtomodelfor computer scientists.It canbeeasilyverifiedthattryingtoinfer trustbasedonthe socialnetworkaloneisoftenineffcient. Therefore, theavenue we explore is applying Data Mining algorithms to unearth latent relationships and patterns from background data. In this paper, we focus on a case where the background data is user ratings for online product reviews. We consider as a testinggroundalarge datasetprovidedby Epinions.comthat containsa trustnetworkaswellas userratingsforreviews on products from a wide range of categories. In order to predict trust we define and compute a critical set of features, which we show to be highly effective in providing the basis for trust predictions. Then, we show that state-of-the-art classifierscandoan impressivejobin predicting trustbasedonourextracted features.Forthis,weemploy avarietyof measurestoevaluatethe classification basedon these features.Weshow thatby carefully collectingand synthesizing readily available background information, such as ratings for online reviews, one can accurately predict social links based on trust.

[1]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[2]  Paolo Avesani,et al.  A trust-enhanced recommender system application: Moleskiing , 2005, SAC '05.

[3]  John A. Swets,et al.  Effectiveness of information retrieval methods , 1969 .

[4]  Ee-Peng Lim,et al.  Trust relationship prediction using online product review data , 2009, CIKM-CNIKM.

[5]  Paolo Avesani,et al.  Trust-Aware Collaborative Filtering for Recommender Systems , 2004, CoopIS/DOA/ODBASE.

[6]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Joseph Sill,et al.  Feature-Weighted Linear Stacking , 2009, ArXiv.

[9]  Cyril W. Cleverdon,et al.  Factors determining the performance of indexing systems , 1966 .

[10]  Jennifer Golbeck,et al.  Computing and Applying Trust in Web-based Social Networks , 2005 .

[11]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[12]  Jennifer Golbeck,et al.  SUNNY: A New Algorithm for Trust Inference in Social Networks Using Probabilistic Confidence Models , 2007, AAAI.

[13]  N. Graham,et al.  Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretation , 2002 .

[14]  Yehuda Koren,et al.  Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[15]  Ee-Peng Lim,et al.  To Trust or Not to Trust? Predicting Online Trusts Using Trust Antecedent Framework , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[16]  Jaideep Srivastava,et al.  Predicting trusts among users of online communities: an epinions case study , 2008, EC '08.

[17]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[18]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[19]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[20]  Yehuda Koren,et al.  Modeling relationships at multiple scales to improve accuracy of large recommender systems , 2007, KDD '07.