Characterizing and curating conversation threads: expansion, focus, volume, re-entry

Discussion threads form a central part of the experience on many Web sites, including social networking sites such as Facebook and Google Plus and knowledge creation sites such as Wikipedia. To help users manage the challenge of allocating their attention among the discussions that are relevant to them, there has been a growing need for the algorithmic curation of on-line conversations --- the development of automated methods to select a subset of discussions to present to a user. Here we consider two key sub-problems inherent in conversational curation: length prediction --- predicting the number of comments a discussion thread will receive --- and the novel task of re-entry prediction --- predicting whether a user who has participated in a thread will later contribute another comment to it. The first of these sub-problems arises in estimating how interesting a thread is, in the sense of generating a lot of conversation; the second can help determine whether users should be kept notified of the progress of a thread to which they have already contributed. We develop and evaluate a range of approaches for these tasks, based on an analysis of the network structure and arrival pattern among the participants, as well as a novel dichotomy in the structure of long threads. We find that for both tasks, learning-based approaches using these sources of information.

[1]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[2]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[3]  Eric Gilbert,et al.  Blogs Are Echo Chambers: Blogs Are Echo Chambers , 2009 .

[4]  Jon M. Kleinberg,et al.  You Had Me at Hello: How Phrasing Affects Memorability , 2012, ACL.

[5]  Noah A. Smith,et al.  What's Worthy of Comment? Content and Comment Volume in Political Blogs , 2010, ICWSM.

[6]  Jungwoo Kim,et al.  The politics of comments: predicting political orientation of news stories with commenters' sentiment patterns , 2011, CSCW.

[7]  Michael Gamon,et al.  Predicting Responses to Microblog Posts , 2012, NAACL.

[8]  Jon M. Kleinberg,et al.  Echoes of power: language effects and power differences in social interaction , 2011, WWW.

[9]  Eric Gilbert,et al.  Blogs are Echo Chambers: Blogs are Echo Chambers , 2009, 2009 42nd Hawaii International Conference on System Sciences.

[10]  Lars Backstrom,et al.  Structural diversity in social contagion , 2012, Proceedings of the National Academy of Sciences.

[11]  Munmun De Choudhury,et al.  What makes conversations interesting?: themes, participants and consequences of conversations in online social media , 2009, WWW '09.

[12]  Vicenç Gómez,et al.  Statistical analysis of the social network and discussion threads in slashdot , 2008, WWW.

[13]  Arvid Kappas,et al.  Collective Emotions Online and Their Influence on Community Life , 2011, PloS one.

[14]  Gözde Özbal,et al.  Exploring Text Virality in Social Networks , 2011, ICWSM.

[15]  M. de Rijke,et al.  Predicting the volume of comments on online news stories , 2009, CIKM.

[16]  Vicenç Gómez,et al.  Modeling the structure and evolution of discussion cascades , 2010, HT '11.

[17]  Mao Ye,et al.  From user comments to on-line conversations , 2012, KDD.

[18]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[19]  Wolfgang Nejdl,et al.  How useful are your comments?: analyzing and predicting youtube comments and comment ratings , 2010, WWW '10.

[20]  David R. Gibson Marking the Turn: Obligation, Engagement, and Alienation in Group Discussions , 2010 .

[21]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[22]  Ravi Kumar,et al.  Dynamics of conversations , 2010, KDD.

[23]  Kristina Lerman,et al.  Information Contagion: An Empirical Study of the Spread of News on Digg and Twitter Social Networks , 2010, ICWSM.

[24]  Gilad Mishne,et al.  Leave a Reply: An Analysis of Weblog Comments , 2006 .

[25]  D. North Competing Technologies , Increasing Returns , and Lock-In by Historical Events , 1994 .

[26]  D. Aldous Exchangeability and related topics , 1985 .

[27]  Yana Volkovich,et al.  When the Wikipedians Talk: Network and Tree Structure of Wikipedia Discussion Pages , 2011, ICWSM.