Generative models of online discussion threads: state of the art and research challenges

Online discussion in form of written comments is a core component of many social media platforms. It has attracted increasing attention from academia, mainly because theories from social sciences can be explored at an unprecedented scale. This interest has led to the development of statistical models which are able to characterize the dynamics of threaded online conversations.In this paper, we review research on statistical modeling of online discussions, in particular, we describe current generative models of the structure and growth of discussion threads. These are parametrized network formation models that are able to generate synthetic discussion threads that reproduce certain features of the real discussions present in different online platforms. We aim to provide a clear overview of the state of the art and to motivate future work in this relevant research field.

[1]  David C. DeAndrea,et al.  The Influence of Online Comments on Perceptions of Antimarijuana Public Service Announcements on YouTube , 2010 .

[2]  Emilio Ferrara,et al.  Measuring Emotional Contagion in Social Media , 2015, PloS one.

[3]  Shyhtsun Felix Wu,et al.  Measuring message propagation and social influence on Twitter.com , 2013, Int. J. Commun. Networks Distributed Syst..

[4]  Christian Borgs,et al.  Emergence of tempered preferential attachment from optimization , 2007, Proceedings of the National Academy of Sciences.

[5]  Ravi Kumar,et al.  Dynamics of conversations , 2010, KDD.

[6]  David Laniado,et al.  Emotions under Discussion: Gender, Status and Communication in Online Collaboration , 2014, PloS one.

[7]  Mike Thelwall,et al.  Homophily in MySpace , 2009, J. Assoc. Inf. Sci. Technol..

[8]  N. Christakis,et al.  Dynamic spread of happiness in a large social network: longitudinal analysis over 20 years in the Framingham Heart Study , 2008, BMJ : British Medical Journal.

[9]  R. Bagozzi,et al.  A Social Influence Model of Consumer Participation in Network- and Small-Group-Based Virtual Communities , 2004 .

[10]  Arkaitz Zubiaga,et al.  Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads , 2015, PloS one.

[11]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[12]  Mark S. Granovetter Threshold Models of Collective Behavior , 1978, American Journal of Sociology.

[13]  Ke Xu,et al.  Higher contagion and weaker ties mean anger spreads faster than joy in social media , 2016, 1608.03656.

[14]  Karolin Kappler,et al.  Communication dynamics in twitter during political campaigns: The case of the 2011 Spanish national election , 2013 .

[15]  Michael Mitzenmacher,et al.  A Brief History of Generative Models for Power Law and Lognormal Distributions , 2004, Internet Math..

[16]  Jan-Erik Lönnqvist,et al.  Homogeneity of personal values and personality traits in Facebook social networks , 2016 .

[17]  H. Simon,et al.  ON A CLASS OF SKEW DISTRIBUTION FUNCTIONS , 1955 .

[18]  Michael S. Bernstein,et al.  Anyone Can Become a Troll: Causes of Trolling Behavior in Online Discussions , 2017, CSCW.

[19]  Omprakash Gnawali,et al.  Language independent analysis and classification of discussion threads in Coursera MOOC forums , 2014, Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014).

[20]  A. Pentland,et al.  Life in the network: The coming age of computational social science: Science , 2009 .

[21]  Nicholas A. Christakis,et al.  Social contagion theory: examining dynamic social networks and human behavior , 2011, Statistics in medicine.

[22]  Karolin Kappler,et al.  Gender homophily in online dyadic and triadic relationships , 2016, EPJ Data Science.

[23]  M. de Rijke,et al.  Predicting the volume of comments on online news stories , 2009, CIKM.

[24]  Vicenç Gómez,et al.  Modeling the structure and evolution of discussion cascades , 2010, HT '11.

[25]  Daantje Derks,et al.  The role of emotion in computer-mediated communication: A review , 2008, Comput. Hum. Behav..

[26]  Tad Hogg,et al.  Using a model of social dynamics to predict popularity of news , 2010, WWW '10.

[27]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[28]  Arvid Kappas,et al.  Collective Emotions Online and Their Influence on Community Life , 2011, PloS one.

[29]  P. Lazarsfeld,et al.  Friendship as Social process: a substantive and methodological analysis , 1964 .

[30]  Yana Volkovich,et al.  When the Wikipedians Talk: Network and Tree Structure of Wikipedia Discussion Pages , 2011, ICWSM.

[31]  Karolin Kappler,et al.  Jointly They Edit: Examining the Impact of Community Identification on Political Interaction in Wikipedia , 2012, PloS one.

[32]  Frank Schweitzer,et al.  Online privacy as a collective phenomenon , 2014, COSN '14.

[33]  Jure Leskovec,et al.  Antisocial Behavior in Online Discussion Communities , 2015, ICWSM.

[34]  Behram F. T. Mistree,et al.  Gaydar: Facebook Friendships Expose Sexual Orientation , 2009, First Monday.

[35]  Arvid Kappas,et al.  Social regulation of emotion: messy layers , 2013, Front. Psychology.

[36]  Jon M. Kleinberg,et al.  Characterizing and curating conversation threads: expansion, focus, volume, re-entry , 2013, WSDM.

[37]  Jeffrey T. Hancock,et al.  Experimental evidence of massive-scale emotional contagion through social networks , 2014, Proceedings of the National Academy of Sciences.

[38]  Frank Schweitzer,et al.  Ideological and Temporal Components of Network Polarization in Online Political Participatory Media , 2015, ArXiv.

[39]  Daryl J. Daley,et al.  An Introduction to the Theory of Point Processes , 2013 .

[40]  Vicenç Gómez,et al.  To Thread or Not to Thread: The Impact of Conversation Threading on Online Discussion , 2017, ICWSM.

[41]  Alberto Lumbreras,et al.  Automatic role detection in online forums , 2016 .

[42]  Arvid Kappas,et al.  The dynamics of emotions in online interaction , 2016, Royal Society Open Science.

[43]  Michael Stefanone,et al.  Social information sharing in a CSCL community , 2002, CSCL.

[44]  John T. Cacioppo,et al.  Emotional Contagion by Elaine Hatfield , 1993 .

[45]  D. Watts,et al.  Influentials, Networks, and Public Opinion Formation , 2007 .

[46]  John W. Emerson,et al.  Nonparametric Goodness-of-Fit Tests for Discrete Null Distributions , 2011, R J..

[47]  David A. Huffaker,et al.  Dimensions of leadership and social influence in online communities , 2010 .

[48]  Vicenç Gómez,et al.  Detecting Platform Effects in Online Discussions , 2017 .

[49]  H. Kelman Compliance, identification, and internalization three processes of attitude change , 1958 .

[50]  Peng Zhang,et al.  Characterizing and Modeling the Dynamics of Activity and Popularity , 2013, PloS one.

[51]  Vicenç Gómez,et al.  A likelihood-based framework for the analysis of discussion threads , 2012, World Wide Web.

[52]  Lada A. Adamic,et al.  Exposure to ideologically diverse news and opinion on Facebook , 2015, Science.

[53]  Le Song,et al.  Variational Policy for Guiding Point Processes , 2017, ICML.

[54]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[55]  Johan Bollen,et al.  Happiness Is Assortative in Online Social Networks , 2011, Artificial Life.

[56]  Julien Brailly,et al.  Exponential Random Graph Models for Social Networks , 2014 .

[57]  Jiawei Han,et al.  An exploration of discussion threads in social news sites: A case study of the Reddit community , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[58]  Vicenç Gómez,et al.  Action selection in growing state spaces: Control of Network Structure Growth , 2016, ArXiv.

[59]  David Laniado,et al.  Co-authorship 2.0: patterns of collaboration in Wikipedia , 2011, HT '11.

[60]  N. Smirnov Table for Estimating the Goodness of Fit of Empirical Distributions , 1948 .

[61]  Philip Fei Wu,et al.  Online Community Response to Major Disaster: A Study of Tianya Forum in the 2008 Sichuan Earthquake , 2009 .

[62]  M. Browne,et al.  Alternative Ways of Assessing Model Fit , 1992 .

[63]  F. Schweitzer,et al.  Emotional persistence in online chatting communities , 2012, Scientific Reports.

[64]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[65]  Lada A. Adamic,et al.  The Anatomy of Large Facebook Cascades , 2013, ICWSM.

[66]  Krishna P. Gummadi,et al.  A measurement-driven analysis of information propagation in the flickr social network , 2009, WWW '09.

[67]  Christopher M. Danforth,et al.  Twitter reciprocal reply networks exhibit assortativity with respect to happiness , 2011, J. Comput. Sci..

[68]  Chin-Lung Hsu,et al.  Acceptance of blog usage: The roles of technology acceptance, social influence and knowledge sharing motivation , 2008, Inf. Manag..

[69]  R. M. Alvarez Birds of the Same Feather Tweet Together: Bayesian Ideal Point Estimation Using Twitter Data , 2014 .

[70]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[71]  Ken-ichi Kawarabayashi,et al.  Reply trees in Twitter: data analysis and branching process models , 2016, Social Network Analysis and Mining.

[72]  Charles Anderson,et al.  The end of theory: The data deluge makes the scientific method obsolete , 2008 .

[73]  Eli Pariser,et al.  The Filter Bubble: What the Internet Is Hiding from You , 2011 .

[74]  E. David,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World , 2010 .

[75]  Arun Sundararajan,et al.  Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks , 2009, Proceedings of the National Academy of Sciences.

[76]  Frank Schweitzer,et al.  Emotional Divergence Influences Information Spreading in Twitter , 2012, ICWSM.

[77]  Hamid R. Rabiee,et al.  RedQueen: An Online Algorithm for Smart Broadcasting in Social Networks , 2016, WSDM.

[78]  Frank Schweitzer,et al.  An agent-based model of collective emotions in online communities , 2010, ArXiv.

[79]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[80]  M. Newman,et al.  Mixing patterns in networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[81]  M. Deutsch,et al.  A study of normative and informational social influences upon individual judgement. , 1955, Journal of abnormal psychology.

[82]  F. Galton,et al.  On the Probability of the Extinction of Families , 1875 .

[83]  Peter Sprent,et al.  Data Driven Statistical Methods , 1997 .

[84]  Matthew Richardson,et al.  Yes, there is a correlation: - from social networks to personal behavior on the web , 2008, WWW.

[85]  Mao Ye,et al.  From user comments to on-line conversations , 2012, KDD.

[86]  Arnim Bleier,et al.  When Politicians Talk: Assessing Online Conversational Practices of Political Parties on Twitter , 2014, ICWSM.

[87]  Costa Alceu Ferraz,et al.  Vote-and-Comment: Modeling the Coevolution of User Interactions in Social Voting Web Sites , 2016 .

[88]  Tim Berners-Lee,et al.  Information Management: A Proposal , 1990 .

[89]  Mike Thelwall,et al.  Negative emotions boost user activity at BBC forum , 2010, 1011.5459.

[90]  AN Kolmogorov-Smirnov,et al.  Sulla determinazione empírica di uma legge di distribuzione , 1933 .

[91]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[92]  Edoardo M. Airoldi,et al.  A Survey of Statistical Network Models , 2009, Found. Trends Mach. Learn..

[93]  C. Sunstein Republic.com 2.0 , 2007 .

[94]  Guido Caldarelli,et al.  Opinion dynamics on interacting networks: media competition and social influence , 2014, Scientific Reports.

[95]  Jacob Ratkiewicz,et al.  Political Polarization on Twitter , 2011, ICWSM.

[96]  Clayton Fink,et al.  Complex contagions and the diffusion of popular Twitter hashtags in Nigeria , 2015, Social Network Analysis and Mining.

[97]  Adam D. I. Kramer,et al.  Detecting Emotional Contagion in Massive Social Networks , 2014, PloS one.

[98]  Yamir Moreno,et al.  Sentiment cascades in the 15M movement , 2015, EPJ Data Science.

[99]  Marika Carrieri,et al.  Gender Differences in Sleep Deprivation Effects on Risk and Inequality Aversion: Evidence from an Economic Experiment , 2015, PloS one.

[100]  Rui Fan,et al.  Anger Is More Influential than Joy: Sentiment Correlation in Weibo , 2013, PloS one.

[101]  Kristina Lerman,et al.  Analysis of social voting patterns on digg , 2008, WOSN '08.

[102]  Jimeng Sun,et al.  Social action tracking via noise tolerant time-varying factor graphs , 2010, KDD.

[103]  Antonio Lima,et al.  Personalized routing for multitudes in smart cities , 2015, EPJ Data Science.