Social Convos: A New Approach to Modeling Information Diffusion in Social Media

A common approach, adopted by most current research, represents users of a social media platform as nodes in a network, connected by various types of links indicating the different kinds of inter-user relationships and interactions. However, social media dynamics and the observed behavioral phenomena do not conform to this user-node-centric view, partly because it ignores the behavioral impact of connected user collectives. GitHub is unique in the social media setting in this respect: it is organized into “repositories”, which along with the users who contribute to them, form highly-interactive task-oriented “social collectives”. In this paper, we recast our understanding of all social media as a landscape of collectives, or “convos”: sets of users connected by a common interest in an (possibly evolving) information artifact, such as a repository in GitHub, a subreddit in Reddit or a group of hashtags in Twitter. We describe a computational approach to classifying convos at different stages of their “lifespan” into distinct collective behavioral classes. We then train a Multi-layer Perceptron (MLP) to learn transition probabilities between behavioral classes to predict, with high-degree of accuracy, future behavior and activity levels of these convos.

[1]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[2]  A. Agresti An introduction to categorical data analysis , 1997 .

[3]  Duncan J. Watts,et al.  Exploring Limits to Prediction in Complex Social Systems , 2016, WWW.

[4]  Frank Rosenblatt,et al.  PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .

[5]  James D. Herbsleb,et al.  Influence of social and technical factors for evaluating contribution in GitHub , 2014, ICSE.

[6]  Tomek Strzalkowski,et al.  Modeling Sociocultural phenomena in discourse , 2012, Natural Language Engineering.

[7]  Jure Leskovec,et al.  Can cascades be predicted? , 2014, WWW.

[8]  Leland McInnes,et al.  Accelerated Hierarchical Density Based Clustering , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[9]  N. L. Johnson,et al.  Discrete Multivariate Distributions , 1998 .

[10]  Tomek Strzalkowski,et al.  GitHub as a Social Network , 2018 .

[11]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[12]  Alandeom W. Oliveira,et al.  Student leadership in small group science inquiry , 2014 .

[13]  Rob J Hyndman,et al.  Another look at measures of forecast accuracy , 2006 .

[14]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[15]  David A. Huffaker,et al.  Dimensions of leadership and social influence in online communities , 2010 .

[16]  Jake M. Hofman,et al.  Prediction and explanation in social systems , 2017, Science.

[17]  Premkumar T. Devanbu,et al.  Developer onboarding in GitHub: the role of prior social links and language experience , 2015, ESEC/SIGSOFT FSE.

[18]  Richard Hans Robert Hahnloser,et al.  Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit , 2000, Nature.