Spatiotemporal anomaly detection through visual analysis of geolocated Twitter messages

Analyzing message streams from social blogging services such as Twitter is a challenging task because of the vast number of documents that are produced daily. At the same time, the availability of geolocated, realtime, and manually created status updates are an invaluable data source for situational awareness scenarios. In this work we present an approach that allows for an interactive analysis of location-based microblog messages in realtime by means of scalable aggregation and geolocated text visualization. For this purpose, we use a novel cluster analysis approach and distinguish between local event reports and global media reaction to detect spatiotemporal anomalies automatically. A workbench allows the scalable visual examination and analysis of messages featuring perspective and semantic layers on a world map representation. Our novel techniques can be used by analysts to classify the presented event candidates and examine them on a global scale.

[1]  Steffen Lohmann,et al.  Comparison of Tag Cloud Layouts: Task-Related Performance and Visual Exploration , 2009, INTERACT.

[2]  B. Weitz Hosted By , 2003 .

[3]  Daniel A. Keim,et al.  Challenging problems of geospatial visual analytics , 2011, J. Vis. Lang. Comput..

[4]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[5]  Jeannette N. Sutton,et al.  Twittering Tennessee: Distributed networks and collaboration following a technological disaster , 2010, ISCRAM.

[6]  K. Field,et al.  Cartoblography: Experiments in Using and Organising the Spatial Context of Micro‐blogging , 2010 .

[7]  Amanda Lee Hughes,et al.  Crisis in a Networked World , 2009 .

[8]  Keith C. Clarke,et al.  Interactive Visual Exploration of a Large Spatio-temporal Dataset: Reflections on a Geovisualization Mashup. , 2007, IEEE Transactions on Visualization and Computer Graphics.

[9]  Andrew W. Moore,et al.  X-means: Extending K-means with Efficient Estimation of the Number of Clusters , 2000, ICML.

[10]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[11]  Martin Wattenberg,et al.  TIMELINESTag clouds and the case for vernacular visualization , 2008, INTR.

[12]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[13]  Leysia Palen,et al.  Pass it on?: Retweeting in mass emergency , 2010, ISCRAM.

[14]  Martin Wattenberg,et al.  Parallel Tag Clouds to explore and analyze faceted text corpora , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[15]  Thomas Ertl,et al.  ScatterBlogs: Geo-spatial document analysis , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[16]  G. Eysenbach,et al.  Pandemics in the Age of Twitter: Content Analysis of Tweets during the 2009 H1N1 Outbreak , 2010, PloS one.

[17]  Mor Naaman,et al.  Generating summaries and visualization for large collections of geo-referenced photographs , 2006, MIR '06.

[18]  Anthony C. Robinson,et al.  Geovisual Analytics and Crisis Management , 2007 .

[19]  Lei Shi,et al.  Understanding text corpora with multiple facets , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[20]  M. Sheelagh T. Carpendale,et al.  SparkClouds: Visualizing Trends in Tag Clouds , 2010, IEEE Transactions on Visualization and Computer Graphics.

[21]  Thomas Ertl,et al.  Iterative Integration of Visual Insights during Scalable Patent Search and Analysis , 2011, IEEE Transactions on Visualization and Computer Graphics.

[22]  Martin Wattenberg,et al.  Participatory Visualization with Wordle , 2009, IEEE Transactions on Visualization and Computer Graphics.

[23]  Heidrun Schumann,et al.  Particle-based labeling: Fast point-feature labeling without obscuring other visual features , 2008, IEEE Transactions on Visualization and Computer Graphics.

[24]  Xiao Zhang,et al.  SensePlace2: GeoTwitter analytics support for situational awareness , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[25]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[26]  Leysia Palen,et al.  Twitter adoption and use in mass convergence and emergency events , 2009 .

[27]  Lisl Zach,et al.  Microblogging for crisis communication: Examination of twitter use in response to a 2009 violent crisis in the Seattle-Tacoma, Washington area , 2010, ISCRAM.

[28]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.