Content diffusion prediction in social networks

Social networks are valuable resources for analyzing users' natural behavior. User profile information, social links and interchanging opinions among users in these networks can be used by social analyzers to discover mental and behavioral patterns of users in social networks. In this paper, news agencies are used as the social media to detect effective factors of diffusing contents in public. We believe that the volume of comments on content show how well the content has spread and attracted attentions. As a result, we extract features of contents to predict volume of comments. To achieve this goal, content of the news articles and its publication time are considered as two critical factors. A novel method for prediction of content diffusion is proposed and its accuracy is evaluated. The promising results of our experiments indicate that these factors can gain accuracy of at least 70%.

[1]  H WittenIan,et al.  The WEKA data mining software , 2009 .

[2]  Beibei Li,et al.  Enhancing clustering blog documents by utilizing author/reader comments , 2007, ACM-SE 45.

[3]  Jacki O'Neill,et al.  Text chat in action , 2003, GROUP.

[4]  Huzefa Rangwala,et al.  Digging Digg: Comment Mining, Popularity Prediction, and Social Network Analysis , 2009, 2009 International Conference on Web Information Systems and Mining.

[5]  Josep Blat,et al.  Homogeneous Temporal Activity Patterns in a Large Online Communication Space , 2007, SAW.

[6]  Gilad Mishne,et al.  Leave a Reply: An Analysis of Weblog Comments , 2006 .

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[9]  Jon Crowcroft,et al.  Rhythm and Randomness in Human Contact , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[10]  Gilad Mishne,et al.  Applied text analytics for blogs , 2007 .

[11]  Noah A. Smith,et al.  Predicting Response to Political Blog Posts with Topic Models , 2009, NAACL.

[12]  Elizabeth M. Daly,et al.  Decomposing Discussion Forums and Boards Using User Roles , 2010, ICWSM.

[13]  Ee-Peng Lim,et al.  Comments-oriented blog summarization by sentence extraction , 2007, CIKM '07.

[14]  Yu Wu,et al.  Property description and comparison of BLOG community at different scales , 2011, 2011 3rd International Conference on Computer Research and Development.

[15]  Mark Pendergast,et al.  GROUP'03 : proceedings of the 2003 International ACM SIGGROUP Conference on Supporting Group Work : November 9-12, 2003, Sundial Resort on Sanibel Island, Florida, USA , 2003 .

[16]  Ronan Symoneaux,et al.  Comment analysis of consumer’s likes and dislikes as an alternative tool to preference mapping. A case study on apples , 2012 .

[17]  Douglas W. Oard,et al.  Context-based Message Expansion for Disentanglement of Interleaved Text Conversations , 2009, NAACL.

[18]  M. de Rijke,et al.  Predicting the volume of comments on online news stories , 2009, CIKM.

[19]  Suhuan Sun,et al.  Polarity Words Distance-Weight Count For Opinion Analysis Of Online News Comments , 2011 .

[20]  Ajay Mehra The Development of Social Network Analysis: A Study in the Sociology of Science , 2005 .

[21]  Michelle Gumbrecht,et al.  Blogs as “Protected Space” , 2004 .