Visualizing Collective Attention Using Association Networks

The socialization of the Web changes the ways we behave both online and offline, leading to a novel emergent phenomenon called “collective attention” in which people’s attention is suddenly concentrated on a particular real-life event. Visualizing collective attention is fundamental to understand human behavior in the digital age. Here we propose “association networks” to visualize usage-based, term-association patterns in a large dataset of tweets (short text messages) during collective attention events. First, we train the word2vec model to obtain vector representations of terms (words) based on semantic similarities, and then construct association networks: given some terms as seeds, the associated terms are linked with each other using the trained word2vec model, and considering the resulting terms as new seeds, the same procedure is repeated. Using two sets of Twitter data—the 2011 Japan earthquake and the 2011 FIFA Women’s World Cup—we demonstrate how association networks visualize collective attention on these events. Provided the Japan earthquake dataset, the association networks that emerged from the most frequently used terms exhibit distinct network structure related to people’s attention during the earthquake, whereas one that emerged from emotion-related terms, such as great and terrible, shows a large connected cluster of negative terms and small clusters of positive terms. Furthermore, we compare association networks in different datasets, using the same seed terms. These results indicate the proposed method to be a useful tool for visualizing the implicit nature of collective attention that is otherwise invisible.

[1]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[2]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3]  Ciro Cattuto,et al.  Dynamical classes of collective attention in twitter , 2011, WWW.

[4]  Lin Zhong,et al.  Human as Real-Time Sensors of Social and Physical Events: A Case Study of Twitter and Sports Games , 2011, ArXiv.

[5]  G. Miller Sociology. Social scientists wade into the tweet stream. , 2011, Science.

[6]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[7]  Kazuyuki Aihara,et al.  Quantifying Collective Attention from Tweet Stream , 2013, PloS one.

[8]  Takaya Arita,et al.  Twitter as Social Sensor: Dynamics and Structure in Major Sporting Events , 2014, ALIFE.

[9]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[10]  S. T. Dumais,et al.  Human factors and behavioral science: Statistical semantics: Analysis of the potential performance of key-word information systems , 1983, The Bell System Technical Journal.

[11]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[12]  Ciro Cattuto,et al.  Mining Concurrent Topical Activity in Microblog Streams , 2014, #MSM.

[13]  Susan T. Dumais,et al.  Statistical semantics: analysis of the potential performance of keyword information systems , 1984 .

[14]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[15]  Nello Cristianini,et al.  Nowcasting Events from the Social Web with Statistical Learning , 2012, TIST.