Predicting Community Evolution in Social Networks

Nowadays, sustained development of different social media can be observed worldwide. One of the relevant research domains intensively explored recently is analysis of social communities existing in social media as well as prediction of their future evolution taking into account collected historical evolution chains. These evolution chains proposed in the paper contain group states in the previous time frames and its historical transitions that were identified using one out of two methods: Stable Group Changes Identification (SGCI) and Group Evolution Discovery (GED). Based on the observed evolution chains of various length, structural network features are extracted, validated and selected as well as used to learn classification models. The experimental studies were performed on three real datasets with different profile: DBLP, Facebook and Polish blogosphere. The process of group prediction was analysed with respect to different classifiers as well as various descriptive feature sets extracted from evolution chains of different length. The results revealed that, in general, the longer evolution chains the better predictive abilities of the classification models. However, chains of length 3 to 7 enabled the GED-based method to almost reach its maximum possible prediction quality. For SGCI, this value was at the level of 3–5 last periods.

[1]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[2]  Przemyslaw Kazienko,et al.  Identification of Group Changes in Blogosphere , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[3]  Przemyslaw Kazienko,et al.  Multi-label classification using error correcting output codes , 2012, Int. J. Appl. Math. Comput. Sci..

[4]  Przemyslaw Kazienko,et al.  Relational large scale multi-label classification method for video categorization , 2012, Multimedia Tools and Applications.

[5]  Nitesh V. Chawla,et al.  New perspectives and methods in link prediction , 2010, KDD.

[6]  Nagarajan Natarajan,et al.  Exploiting longer cycles for link prediction in signed networks , 2011, CIKM '11.

[7]  Malik Magdon-Ismail,et al.  Tracking and Predicting Evolution of Social Communities , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[8]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[9]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[10]  Przemyslaw Kazienko,et al.  Predicting Group Evolution in the Social Network , 2012, SocInfo.

[11]  Vladimir Batagelj,et al.  Centrality in Social Networks , 1993 .

[12]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[13]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[14]  Przemyslaw Kazienko,et al.  Different approaches to community evolution prediction in blogosphere , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[15]  P. Bonacich Factoring and weighting approaches to status scores and clique identification , 1972 .

[16]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[17]  T. Vicsek,et al.  Clique percolation in random networks. , 2005, Physical review letters.

[18]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[19]  Lise Getoor,et al.  Using Friendship Ties and Family Circles for Link Prediction , 2008, SNAKDD.

[20]  Matjaz Perc,et al.  The Matthew effect in empirical data , 2014, Journal of The Royal Society Interface.

[21]  Krishna P. Gummadi,et al.  On the evolution of user interaction in Facebook , 2009, WOSN '09.

[22]  Christian Bauckhage,et al.  The slashdot zoo: mining a social network with negative edges , 2009, WWW.

[23]  Nitesh V. Chawla,et al.  Supervised methods for multi-relational link prediction , 2013, Social Network Analysis and Mining.

[24]  Jure Leskovec,et al.  The life and death of online groups: predicting group growth and longevity , 2012, WSDM '12.

[25]  Przemyslaw Kazienko,et al.  GED: the method for group evolution discovery in social networks , 2012, Social Network Analysis and Mining.

[26]  Michael Ley,et al.  The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives , 2002, SPIRE.

[27]  Aleksander Byrski,et al.  Graphical analysis of social group dynamics , 2012, 2012 Fourth International Conference on Computational Aspects of Social Networks (CASoN).

[28]  Jie Gao,et al.  Predicting group stability in online social networks , 2013, WWW.

[29]  Osmar R. Zaïane,et al.  Community evolution prediction in dynamic social networks , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[30]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[31]  Xin Yao,et al.  A novel evolutionary data mining algorithm with applications to churn prediction , 2003, IEEE Trans. Evol. Comput..

[32]  Przemyslaw Kazienko,et al.  Key Person Analysis in Social Communities within the Blogosphere , 2012, J. Univers. Comput. Sci..

[33]  Yossi Richter,et al.  Predicting Customer Churn in Mobile Networks through Analysis of Social Groups , 2010, SDM.

[34]  Geoffrey J. McLachlan,et al.  Analyzing Microarray Gene Expression Data , 2004 .

[35]  Panagiotis Symeonidis,et al.  Transitive node similarity for link prediction in social networks with positive and negative links , 2010, RecSys '10.

[36]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.