What kind of #conversation is Twitter? Mining #psycholinguistic cues for emergency coordination

The information overload created by social media messages in emergency situations challenges response organizations to find targeted content and users. We aim to select useful messages by detecting the presence of conversation as an indicator of coordinated citizen action. Using simple linguistic indicators drawn from conversation analysis in social science, we model the presence of coordination in the communication landscape of Twitter using a corpus of 1.5 million tweets for various disaster and non-disaster events spanning different periods, lengths of time, and varied social significance. Within replies, retweets and tweets that mention other Twitter users, we found that domain-independent, linguistic cues distinguish likely conversation from non-conversation in this online form of mediated communication. We demonstrate that these likely conversation subsets potentially contain more information than non-conversation subsets, whether or not the tweets are replies, retweets, or mention other Twitter users, as long as they reflect conversational properties. From a practical perspective, we have developed a model for trimming the candidate tweet corpus to identify a much smaller subset of data for submission to deeper, domain-dependent semantic analyses for the identification of actionable information nuggets for coordinated emergency response.

[1]  E. Hovy,et al.  Contextual Bearing on Linguistic Variation in Social Media , 2011 .

[2]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[3]  Amit P. Sheth,et al.  A Qualitative Examination of Topical Tweet and Retweet Practices , 2010, ICWSM.

[4]  Alan Ritter,et al.  Unsupervised Modeling of Twitter Conversations , 2010, NAACL.

[5]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[6]  Amit P. Sheth,et al.  Citizen sensor data mining, social media analytics and development centric web applications , 2011, WWW.

[7]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[8]  Gloria Mark,et al.  Extreme collaboration , 2002, CACM.

[9]  Philip R. Cohen,et al.  Referring as a Collaborative Process , 2003 .

[10]  Amit P. Sheth,et al.  Citizen Sensing, Social Signals, and Enriching Human Experience , 2009, IEEE Internet Computing.

[11]  Susan T. Dumais,et al.  Mark my words!: linguistic style accommodation in social media , 2011, WWW.

[12]  W. Chafe Cognitive constraints on information flow , 1984 .

[13]  Jean-Paul de Vooght Citizen Sensing , 2014, Encyclopedia of Social Network Analysis and Mining.

[14]  Amit P. Sheth,et al.  Twitris+: Social Media Analytics Platform for Effective Coordination , 2012 .

[15]  Amit P. Sheth,et al.  Framework for the Analysis of Coordination in Crisis Response , 2012 .

[16]  Amit P. Sheth,et al.  Spatio-Temporal-Thematic Analysis of Citizen Sensor Data: Challenges and Experiences , 2009, WISE.

[18]  Susan C. Herring,et al.  Beyond Microblogging: Conversation and Collaboration via Twitter , 2009, 2009 42nd Hawaii International Conference on System Sciences.

[19]  J. Chatwin Conversation analysis. , 2004, Complementary therapies in medicine.

[20]  Amit P. Sheth,et al.  Continuous Semantics to Analyze Real-Time Data , 2010, IEEE Internet Computing.

[21]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .