Our society is increasingly digitalized. Every day, a tremendous amount of information is being created, shared, and digested through all kinds of cyber channels. Although people can easily acquire information from various sources (social media, news articles, etc.), the truthfulness of most received information remains unverified. In many real-life scenarios, false information has become the de facto cause that leads to detrimental decision makings, and techniques that can automatically filter false information are highly demanded. However, verifying whether a piece of information is trustworthy is difficult because: (1) selecting candidate snippets for fact checking is nontrivial; and (2) detecting supporting evidences, i.e. stances, suffers from the difficulty of measuring the similarity between claims and related evidences. We build ClaimVerif, a claim verification system that not only provides credibility assessment for any user-given query claim, but also rationales the assessment results with supporting evidences. ClaimVerif can automatically select the stances from millions of documents and employs two-step training to justify the opinions of the stances. Furthermore, combined with the credibility of stances sources, ClaimVerif degrades the score of stances from untrustworthy sources and alleviates the negative effects from rumor spreaders. Our empirical evaluations show that ClaimVerif achieves both high accuracy and efficiency in different claim verification tasks. It can be highly useful in practical applications by providing multi-dimension analysis for the suspicious statements, including the stances, opinions, source credibility and estimated judgements.
[1]
Gerhard Weikum,et al.
Credibility Assessment of Textual Claims on the Web
,
2016,
CIKM.
[2]
Fan Yang,et al.
Automatic detection of rumor on Sina Weibo
,
2012,
MDS '12.
[3]
Gerhard Weikum,et al.
Where the Truth Lies: Explaining the Credibility of Emerging Claims on the Web and Social Media
,
2017,
WWW.
[4]
M. Gentzkow,et al.
Social Media and Fake News in the 2016 Election
,
2017
.
[5]
Mona T. Diab,et al.
Rumor Detection and Classification for Twitter Data
,
2015,
ArXiv.
[6]
Quoc V. Le,et al.
Distributed Representations of Sentences and Documents
,
2014,
ICML.
[7]
Daniel Jurafsky,et al.
Linguistic Models for Analyzing and Detecting Biased Language
,
2013,
ACL.
[8]
Yongdong Zhang,et al.
News Credibility Evaluation on Microblog with a Hierarchical Propagation Model
,
2014,
2014 IEEE International Conference on Data Mining.
[9]
Jeffrey Dean,et al.
Distributed Representations of Words and Phrases and their Compositionality
,
2013,
NIPS.
[10]
Wei Gao,et al.
Detect Rumors Using Time Series of Social Context Information on Microblogging Websites
,
2015,
CIKM.