Understanding Information Credibility on Twitter

Increased popularity of microblogs in recent years brings about a need for better mechanisms to extract credible or otherwise useful information from noisy and large data. While there are a great number of studies that introduce methods to find credible data, there is no accepted credibility benchmark. As a result, it is hard to compare different studies and generalize from their findings. In this paper, we argue for a methodology for making such studies more useful to the research community. First, the underlying ground truth values of credibility must be reliable. The specific constructs used to define credibility must be carefully defined. Secondly, the underlying network context must be quantified and documented. To illustrate these two points, we conduct a unique credibility study of two different data sets on the same topic, but with different network characteristics. We also conduct two different user surveys, and construct two additional indicators of credibility based on retweet behavior. Through a detailed statistical study, we first show that survey based methods can be extremely noisy and results may vary greatly from survey to survey. However, by combining such methods with retweet behavior, we can incorporate two signals that are noisy but uncorrelated, resulting in ground truth measures that can be predicted with high accuracy and are stable across different data sets and survey methods. Newsworthiness of tweets can be a useful frame for specific applications, but it is not necessary for achieving reliable credibility ground truth measurements.

[1]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[2]  A. Sanfey,et al.  Friend or foe: The effect of implicit trustworthiness judgments in social decision-making , 2008, Cognition.

[3]  Scott Counts,et al.  Tweeting is believing?: understanding microblog credibility perceptions , 2012, CSCW.

[4]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[5]  Daniel Z. Levin,et al.  The Strength of Weak Ties You Can Trust: The Mediating Role of Trust in Effective Knowledge Transfer , 2004, Manag. Sci..

[6]  Yu Suzuki A Credibility Assessment for Message Streams on Microblogs , 2010, 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[7]  Sibel Adali,et al.  Modeling Trust Context in Networks , 2013, SpringerBriefs in Computer Science.

[8]  P. Zoë Stavri,et al.  Perception of Quality and Trustworthiness of Internet Resources by Personal Health Information Seekers , 2003, AMIA.

[9]  Alexander Todorov,et al.  Modeling Social Perception of Faces , 2011 .

[10]  Panagiotis Takis Metaxas,et al.  What Edited Retweets Reveal about Online Political Discourse , 2011, Analyzing Microtext.

[11]  Sibel Adali,et al.  Actions speak as loud as words: predicting relationships from social behavior data , 2012, WWW.

[12]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[13]  Luke J. Chang,et al.  Seeing is believing: Trustworthiness as a dynamic belief , 2010, Cognitive Psychology.

[14]  Soo Young Rieh,et al.  Developing a unifying framework of credibility assessment: Construct, heuristics, and interaction in context , 2008, Inf. Process. Manag..

[15]  Kevin Robert Canini,et al.  Finding Credible Information Sources in Social Networks Based on Content and Social Structure , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[16]  Mor Naaman,et al.  Unfolding the event landscape on twitter: classification and exploration of user categories , 2012, CSCW '12.

[17]  Mohammed J. Zaki,et al.  Finding Hidden Group Structure in a Stream of Communications , 2006, ISI.

[18]  Barbara Poblete,et al.  Predicting information credibility in time-sensitive social media , 2013, Internet Res..

[19]  Ponnurangam Kumaraguru,et al.  Credibility ranking of tweets during high impact events , 2012, PSOSM '12.

[20]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[21]  Sibel Adali,et al.  Measuring behavioral trust in social networks , 2010, 2010 IEEE International Conference on Intelligence and Security Informatics.

[22]  Daniel Gayo-Avello,et al.  "I Wanted to Predict Elections with Twitter and all I got was this Lousy Paper" - A Balanced Survey on Election Prediction using Twitter Data , 2012, ArXiv.

[23]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[24]  A. Todorov,et al.  Inferences of Competence from Faces Predict Election Outcomes , 2005, Science.

[25]  Mor Naaman,et al.  Finding and assessing social media information sources in the context of journalism , 2012, CHI.

[26]  Tobias Höllerer,et al.  Modeling topic specific credibility on twitter , 2012, IUI '12.