Earthquake shakes Twitter users: real-time event detection by social sensors

Twitter, a popular microblogging service, has received much attention recently. An important characteristic of Twitter is its real-time nature. For example, when an earthquake occurs, people make many Twitter posts (tweets) related to the earthquake, which enables detection of earthquake occurrence promptly, simply by observing the tweets. As described in this paper, we investigate the real-time interaction of events such as earthquakes in Twitter and propose an algorithm to monitor tweets and to detect a target event. To detect a target event, we devise a classifier of tweets based on features such as the keywords in a tweet, the number of words, and their context. Subsequently, we produce a probabilistic spatiotemporal model for the target event that can find the center and the trajectory of the event location. We consider each Twitter user as a sensor and apply Kalman filtering and particle filtering, which are widely used for location estimation in ubiquitous/pervasive computing. The particle filter works better than other comparable methods for estimating the centers of earthquakes and the trajectories of typhoons. As an application, we construct an earthquake reporting system in Japan. Because of the numerous earthquakes and the large number of Twitter users throughout the country, we can detect an earthquake with high probability (96% of earthquakes of Japan Meteorological Agency (JMA) seismic intensity scale 3 or more are detected) merely by monitoring tweets. Our system detects earthquakes promptly and sends e-mails to registered users. Notification is delivered much faster than the announcements that are broadcast by the JMA.

[1]  Dieter Fox,et al.  Bayesian Filtering for Location Estimation , 2003, IEEE Pervasive Comput..

[2]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[3]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[4]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009, J. Assoc. Inf. Sci. Technol..

[5]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[6]  Dirk Schulz,et al.  Bayesian Filters for Location Estimation , 2003 .

[7]  Chao Liu,et al.  A probabilistic approach to spatiotemporal theme pattern mining on weblogs , 2006, WWW '06.

[8]  Pavel Serdyukov,et al.  Placing flickr photos on a map , 2009, SIGIR.

[9]  Alexandre Passant,et al.  Microblogging: A Semantic Web and Distributed Approach , 2008 .

[10]  E. M. Scordilis,et al.  Accelerating seismic crustal deformation before strong mainshocks in Adriatic and its importance for earthquake prediction , 2004 .

[11]  John G. Breslin,et al.  Microblogging: A Semantic Web and Distributed Approach , 2008 .

[12]  M. Weiser The Computer for the Twenty-First Century , 1991 .

[13]  Martin Ebner,et al.  Microblogging - more than fun? , 2008 .

[14]  Carmen Holotescu,et al.  INDICATORS FOR THE ANALYSIS OF LEARNING AND PRACTICE COMMUNITIES FROM THE PERSPECTIVE OF MICROBLOGGING AS A PROVOCATIVE SOCIOLECT IN VIRTUAL SPACE , 2009 .

[15]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[16]  Mor Naaman,et al.  Towards automatic extraction of event and place semantics from flickr tags , 2007, SIGIR.

[17]  Thorsten Joachims,et al.  Text categorization with support vector machines , 1999 .

[18]  Yutaka Matsuo,et al.  Community gravity: measuring bidirectional effects by trust and rating on online social networks , 2009, WWW '09.

[19]  Gaetano Borriello,et al.  Particle Filters for Location Estimation in Ubiquitous Computing: A Case Study , 2004, UbiComp.

[20]  Jon M. Kleinberg,et al.  Spatial variation in search engine queries , 2008, WWW.

[21]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[22]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[23]  Ruimin Shen,et al.  Microblogging for Language Learning: Using Twitter to Train Communicative and Cultural Competence , 2009, ICWL.

[24]  Fang Wu,et al.  Social Networks that Matter: Twitter Under the Microscope , 2008, First Monday.

[25]  Gaetano Borriello,et al.  Location Systems for Ubiquitous Computing , 2001, Computer.

[26]  T. Bleier,et al.  Earthquake [earthquake warning systems] , 2005, IEEE Spectrum.

[27]  Mor Naaman,et al.  Is it really about me?: message content in social awareness streams , 2010, CSCW '10.