论文信息 - ScoreFinder: A method for collaborative quality inference on user-generated content

ScoreFinder: A method for collaborative quality inference on user-generated content

User-generated content is quickly becoming the greatest source of information on the World Wide Web. Shared content items are initially considered unconfirmed in the sense that their credibility has not yet been established. Conventional, centralized confirmation of credibility is infeasible at the Internet scale and so making use of the annotators themselves to evaluate each item is essential. However, users usually differ in opinions to the same item, and the existence of bias, variance and maliciousness makes the problem of aggregating opinions more difficult. Addressing this problem, we propose the use of an Author-Annotator model with an iterative algorithm, called ScoreFinder, for inferring credibility by ranking shared items. In order to reduce the influence from a variety of error sources, we identify reliable users on each topic, and adaptively aggregate scores from them. Moreover, we transform the users' input to remove errors/anomalies, by identifying patterns of misbehaviour learned from a real data set. We show how our algorithm performs on both real data sets and synthetic data sets, and a significant improvement was achieved in the experiment.

Kotagiri Ramamohanarao | Aaron Harwood | Yang Liao

[1] Ke Wang,et al. On Mining Rating Dependencies in Online Collaborative Rating Networks , 2009, PAKDD.

[2] Ronald R. Yager,et al. On ordered weighted averaging aggregation operators in multicriteria decisionmaking , 1988, IEEE Trans. Syst. Man Cybern..

[3] John Riedl,et al. GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[4] John Riedl,et al. Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[5] Loren G. Terveen,et al. Does “authority” mean quality? predicting expert quality ratings of Web documents , 2000, SIGIR '00.

[6] Philip S. Yu,et al. Truth Discovery with Multiple Conflicting Information Providers on the Web , 2007, IEEE Transactions on Knowledge and Data Engineering.