Anticipating Discussion Activity on Community Forums

Attention economics is a vital component of the Social Web, where the sheer magnitude and rate at which social data is published forces web users to decide on what content to focus their attention on. By predicting popular posts on the Social Web, that contain lengthy discussions and debates, analysts can focus their attention more effectively on content that is deemed more influential. In this paper we present a two-step approach to anticipate discussions in community forums by a) identifying seed posts - i.e., posts that generate discussions, and b) predicting the length of these discussions. We explore the effectiveness of a range of features in anticipating discussions such as user and content features, and present 'focus' features that capture the topical concentration of a user. For identifying seed posts we show that content features are better predictors than user features, while achieving an F1 value of 0.792 when using all features. For predicting discussion activity we find a positive correlation between the focus of the user and discussion volumes, and achieve an nDCG@1 value of 0.89 when predicting using user features.

[1]  Eugene Agichtein,et al.  Learning to recognize reliable users and content in social media with coupled mutual reinforcement , 2009, WWW '09.

[2]  Elizabeth M. Daly,et al.  Decomposing Discussion Forums using Common User Roles , 2010 .

[3]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[4]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[5]  Matthew Rowe,et al.  Predicting Discussions on the Social Semantic Web , 2011, ESWC.

[6]  Lada A. Adamic,et al.  Knowledge sharing and yahoo answers: everyone knows something , 2008, WWW.

[7]  Brian D. Davison,et al.  Predicting popular messages in Twitter , 2011, WWW.

[8]  James Caverlee,et al.  Ranking Comments on the Social Web , 2009, 2009 International Conference on Computational Science and Engineering.

[9]  Huzefa Rangwala,et al.  Defining a Coparticipation Network Using Comments on Digg , 2010, IEEE Intelligent Systems.

[10]  R. Gunning The Technique of Clear Writing. , 1968 .

[11]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[12]  Wolfgang Nejdl,et al.  How useful are your comments?: analyzing and predicting youtube comments and comment ratings , 2010, WWW '10.

[13]  Vicenç Gómez,et al.  Statistical analysis of the social network and discussion threads in slashdot , 2008, WWW.