TwitterCrowds: Techniques for Exploring Topic and Sentiment in Microblogging Data

Analysts and social scientists in the humanities and industry require techniques to help visualize large quantities of microblogging data. Methods for the automated analysis of large scale social media data (on the order of tens of millions of tweets) are widely available, but few visualization techniques exist to support interactive exploration of the results. In this paper, we present extended descriptions of ThemeCrowds and SentireCrowds, two tag-based visualization techniques for this data. We subsequently introduce a new list equivalent for both of these techniques and present a number of case studies showing them in operation. Finally, we present a formal user study to evaluate the effectiveness of these list interface equivalents when comparing them to ThemeCrowds and SentireCrowds. We find that discovering topics associated with areas of strong positive or negative sentiment is faster when using a list interface. In terms of user preference, multilevel tag clouds were found to be more enjoyable to use. Despite both interfaces being usable for all tested tasks, we have evidence to support that list interfaces can be more efficient for tasks when an appropriate ordering is known beforehand.

[1]  Jimeng Sun,et al.  FacetAtlas: Multifaceted Visualization for Rich Text Corpora , 2010, IEEE Transactions on Visualization and Computer Graphics.

[2]  David A. Shamma,et al.  Tweet the debates: understanding community annotation of uncollected sources , 2009, WSM@MM.

[3]  Stuart J. Rose,et al.  Describing story evolution from dynamic information streams , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[4]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[5]  Michael J. Muller,et al.  Getting our head in the clouds: toward evaluation studies of tagclouds , 2007, CHI.

[6]  Miguel Rios,et al.  Distilling Massive Amounts of Data into Simple Visualizations : Twitter Case Studies , 2012 .

[7]  Martin Halvey,et al.  An assessment of tag presentation techniques , 2007, WWW '07.

[8]  Jean-Daniel Fekete,et al.  Hierarchical Aggregation for Information Visualization: Overview, Techniques, and Design Guidelines , 2010, IEEE Transactions on Visualization and Computer Graphics.

[9]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[10]  Ari Rappoport,et al.  Enhanced Sentiment Learning Using Twitter Hashtags and Smileys , 2010, COLING.

[11]  Derek Greene,et al.  ThemeCrowds: multiresolution summaries of twitter usage , 2011, SMUC '11.

[12]  Jeffrey Heer,et al.  Scented Widgets: Improving Navigation Cues with Embedded Visualizations , 2007, IEEE Transactions on Visualization and Computer Graphics.

[13]  M. Sheelagh T. Carpendale,et al.  A Visual Backchannel for Large-Scale Events , 2010, IEEE Transactions on Visualization and Computer Graphics.

[14]  Xin Tong,et al.  TextFlow: Towards Better Understanding of Evolving Topics in Text , 2011, IEEE Transactions on Visualization and Computer Graphics.

[15]  Xiaohua Sun,et al.  Whisper: Tracing the Spatiotemporal Process of Information Diffusion in Real Time , 2012, IEEE Transactions on Visualization and Computer Graphics.

[16]  David Auber,et al.  Tulip - A Huge Graph Visualization Framework , 2004, Graph Drawing Software.

[17]  Martin Wattenberg,et al.  TIMELINESTag clouds and the case for vernacular visualization , 2008, INTR.

[18]  Daniel M. Best,et al.  Web-Based Visual Analytics for Social Media , 2012, Proceedings of the International AAAI Conference on Web and Social Media.

[19]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[20]  Yaneer Bar-Yam,et al.  An exploration of social identity: The geography and politics of news-sharing communities in twitter , 2012, Complex..

[21]  Deborah A. Payne,et al.  Turning the Bucket of Text into a Pipe , 2005, INFOVIS.

[22]  Wolfgang Kienreich,et al.  Evaluating a System for Interactive Exploration of Large, Hierarchically Structured Document Repositories , 2004 .

[23]  Chris H. Q. Ding,et al.  Cluster merging and splitting in hierarchical clustering algorithms , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[24]  M. Sheelagh T. Carpendale,et al.  SparkClouds: Visualizing Trends in Tag Clouds , 2010, IEEE Transactions on Visualization and Computer Graphics.

[25]  Jarke J. van Wijk,et al.  Squarified Treemaps , 2000, VisSym.

[26]  Jean-Daniel Fekete,et al.  User-Supplied Sentiments in Tweets , 2012 .

[27]  Derek Greene,et al.  Deriving Insights from National Happiness Indices , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[28]  Edward R. Tufte,et al.  Envisioning Information , 1990 .

[29]  Lei Shi,et al.  Understanding text corpora with multiple facets , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[30]  Steven Skiena,et al.  Watch the Story Unfold with TextWheel: Visualization of Large-Scale News Streams , 2012, TIST.

[31]  Tobias Schreck,et al.  Topic Tracker : Shape-based Visualization for Trend and Sentiment Tracking in Twitter , 2012 .

[32]  John Hannon,et al.  Recommending twitter users to follow using content and collaborative filtering approaches , 2010, RecSys '10.

[33]  Hila Becker,et al.  Beyond Trending Topics: Real-World Event Identification on Twitter , 2011, ICWSM.

[34]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[35]  Derek Greene,et al.  Identifying Representative Textual Sources in Blog Networks , 2021, ICWSM.

[36]  Barbara Tversky,et al.  Animation: can it facilitate? , 2002, Int. J. Hum. Comput. Stud..

[37]  Arjan Kuijper,et al.  Visual Analysis of Large Graphs , 2010, Eurographics.

[38]  David R. Karger,et al.  Scatter/Gather: a cluster-based approach to browsing large document collections , 1992, SIGIR '92.

[39]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[40]  James J. Thomas,et al.  Visualizing the non-visual: spatial analysis and interaction with information from text documents , 1995, Proceedings of Visualization 1995 Conference.

[41]  Adam D. I. Kramer An unobtrusive behavioral model of "gross national happiness" , 2010, CHI.