Online reviews provide valuable information about products and services to consumers. However, spammers are joining the community trying to mislead readers by writing fake reviews. Previous attempts for spammer detection used reviewers' behaviors, text similarity, linguistics features and rating patterns. Those studies are able to identify certain types of spammers, e.g., those who post many similar reviews about one target entity. However, in reality, there are other kinds of spammers who can manipulate their behaviors to act just like genuine reviewers, and thus cannot be detected by the available techniques. In this paper, we propose a novel concept of a heterogeneous review graph to capture the relationships among reviewers, reviews and stores that the reviewers have reviewed. We explore how interactions between nodes in this graph can reveal the cause of spam and propose an iterative model to identify suspicious reviewers. This is the first time such intricate relationships have been identified for review spam detection. We also develop an effective computation method to quantify the trustiness of reviewers, the honesty of reviews, and the reliability of stores. Different from existing approaches, we don't use review text information. Our model is thus complementary to existing approaches and able to find more difficult and subtle spamming activities, which are agreed upon by human judges after they evaluate our results.
[1]
Jacob Cohen,et al.
The Equivalence of Weighted Kappa and the Intraclass Correlation Coefficient as Measures of Reliability
,
1973
.
[2]
J. R. Landis,et al.
The measurement of observer agreement for categorical data.
,
1977,
Biometrics.
[3]
M. KleinbergJon.
Authoritative sources in a hyperlinked environment
,
1999
.
[4]
Bing Liu,et al.
Mining and summarizing customer reviews
,
2004,
KDD.
[5]
Philip S. Yu,et al.
Truth Discovery with Multiple Conflicting Information Providers on the Web
,
2007,
IEEE Transactions on Knowledge and Data Engineering.
[6]
Bing Liu,et al.
Opinion spam and analysis
,
2008,
WSDM '08.
[7]
Ee-Peng Lim,et al.
Detecting product review spammers using rating behaviors
,
2010,
CIKM.
[8]
Ee-Peng Lim,et al.
Finding unusual review patterns using unexpected rules
,
2010,
CIKM.
[9]
Claire Cardie,et al.
Finding Deceptive Opinion Spam by Any Stretch of the Imagination
,
2011,
ACL.
[10]
Yi Yang,et al.
Learning to Identify Review Spam
,
2011,
IJCAI.