Automatic selection of social media responses to news

Social media responses to news have increasingly gained in importance as they can enhance a consumer's news reading experience, promote information sharing and aid journalists in assessing their readership's response to a story. Given that the number of responses to an online news article may be huge, a common challenge is that of selecting only the most interesting responses for display. This paper addresses this challenge by casting message selection as an optimization problem. We define an objective function which jointly models the messages' utility scores and their entropy. We propose a near-optimal solution to the underlying optimization problem, which leverages the submodularity property of the objective function. Our solution first learns the utility of individual messages in isolation and then produces a diverse selection of interesting messages by maximizing the defined objective function. The intuitions behind our work are that an interesting selection of messages contains diverse, informative, opinionated and popular messages referring to the news article, written mostly by users that have authority on the topic. Our intuitions are embodied by a rich set of content, social and user features capturing the aforementioned aspects. We evaluate our approach through both human and automatic experiments, and demonstrate it outperforms the state of the art. Additionally, we perform an in-depth analysis of the annotated ``interesting'' responses, shedding light on the subjectivity around the selection process and the perception of interestingness.

[1]  Fernando Diaz,et al.  Time is of the essence: improving recency ranking using Twitter data , 2010, WWW '10.

[2]  Ravi Kumar,et al.  Visualizing tags over time , 2006, WWW '06.

[3]  Munmun De Choudhury,et al.  What makes conversations interesting?: themes, participants and consequences of conversations in online social media , 2009, WWW '09.

[4]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[5]  S. Ye Measuring message propagation and social influence on Twitter , 2013 .

[6]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[7]  Hila Becker,et al.  Selecting Quality Twitter Content for Events , 2011, ICWSM.

[8]  Maurice Queyranne,et al.  An Exact Algorithm for Maximum Entropy Sampling , 1995, Oper. Res..

[9]  Thomas Gottron,et al.  Bad news travel fast: a content-based analysis of interestingness on Twitter , 2011, WebSci '11.

[10]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[11]  Shyhtsun Felix Wu,et al.  Measuring message propagation and social influence on Twitter.com , 2013, Int. J. Commun. Networks Distributed Syst..

[12]  Ana-Maria Popescu,et al.  "Dancing with the Stars, " NBA Games, Politics: An Exploration of Twitter Users' Response to Events , 2011, ICWSM.

[13]  Mor Naaman,et al.  Is it really about me?: message content in social awareness streams , 2010, CSCW '10.

[14]  Rizal Setya Perdana What is Twitter , 2013 .

[15]  Deepa Paranjpe,et al.  Learning document aboutness from implicit user feedback and document structure , 2009, CIKM.

[16]  Fabio Massimo Zanzotto,et al.  Linguistic Redundancy in Twitter , 2011, EMNLP.

[17]  Chong Long,et al.  A new approach for multi-document update summarization , 2010 .

[18]  Hila Becker,et al.  Hip and trendy: Characterizing emerging trends on Twitter , 2011, J. Assoc. Inf. Sci. Technol..

[19]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[20]  Jugal Kalita,et al.  Evaluating Methods for Summarizing Twitter Posts , 2011 .

[21]  Juan-Zi Li,et al.  Social context summarization , 2011, SIGIR.

[22]  Jeffrey Nichols,et al.  Summarizing sporting events using twitter , 2012, IUI '12.

[23]  Yang Liu,et al.  Why is “SXSW” trending? Exploring Multiple Text Sources for Twitter Topic Summarization , 2011 .

[24]  Jugal K. Kalita,et al.  Experiments in Microblog Summarization , 2010, 2010 IEEE Second International Conference on Social Computing.

[25]  Ana-Maria Popescu,et al.  Detecting controversial events from twitter , 2010, CIKM.

[26]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[27]  Mary Czerwinski,et al.  Find Me the Right Content! Diversity-Based Sampling of Social Media Spaces for Topic-Centric Search , 2011, ICWSM.

[28]  Ting Wang,et al.  Opinion Retrieval in Twitter , 2012, ICWSM.

[29]  Subhajit Sanyal,et al.  Multi-objective ranking of comments on web , 2012, WWW.

[30]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[31]  Bernardo A. Huberman,et al.  The Pulse of News in Social Media: Forecasting Popularity , 2012, ICWSM.

[32]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[33]  ChengXiang Zhai,et al.  Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions , 2012, WWW.

[34]  Michael S. Bernstein,et al.  A Torrent of Tweets: Managing Information Overload in Online Social Streams , 2010 .

[35]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[36]  Jacob Cohen,et al.  Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[37]  Alexander K. Kelmans,et al.  Multiplicative submodularity of a matrix's principal minor as a function of the set of its rows and some combinatorial applications , 1983, Discret. Math..