Combining structure, content and meaning in online social networks: The analysis of public's early reaction in social media to newly launched movies

In this paper we present a methodology to assess moviegoers' early reactions to movies' premieres through the extraction of analytics from Twitter conversations that take place in the weekend in which a movie is released. We then apply data mining techniques to a sample of 22 movies to identify models able to predict box-office sales in the first weekend. We show that better predictions are obtained when traffic metrics are combined with social network or conversational indicators rather than with sentiment and that online sentiment achieves the lowest explanatory power among all the considered variables. Our findings confirm that the importance of commonly used buzz-metrics, such as sentiment, is probably overstated, and that conversational analytics can contribute significantly to explain the variance of box office revenues in the first week end of release. More broadly, our work adds to research on information diffusion in online networks by providing evidence that diffusion of messages is not content-neutral and that the analysis of conversational dynamics can help to understand the interplay between collective generation and diffusion of content in social networks as well as to obtain insights on whether information diffusion influences off-line behavior.

[1]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[2]  Hinrich Schütze,et al.  Introduction to Information Retrieval: XML retrieval , 2008 .

[3]  John T. Stasko,et al.  Casual Information Visualization: Depictions of Data in Everyday Life , 2007, IEEE Transactions on Visualization and Computer Graphics.

[4]  Steven Skiena,et al.  Improving Movie Gross Prediction through News Analysis , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[5]  Michael S. Bernstein,et al.  Twitinfo: aggregating and visualizing microblogs for event exploration , 2011, CHI.

[6]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[7]  Herbert H. Clark,et al.  Grounding in communication , 1991, Perspectives on socially shared cognition.

[8]  Robin Cowan,et al.  The dynamics of collective invention , 2003 .

[9]  Michael Gertz,et al.  Mining email social networks , 2006, MSR '06.

[10]  Axel Bruns,et al.  More than a backchannel : Twitter and television , 2013 .

[11]  R. Burt Structural Holes and Good Ideas1 , 2004, American Journal of Sociology.

[12]  M. Sarvary,et al.  Network Effects and Personal Influences: The Diffusion of an Online Social Network , 2011 .

[13]  Wu He,et al.  International Journal of Information Management Social Media Competitive Analysis and Text Mining: a Case Study in the Pizza Industry , 2022 .

[14]  Herbert H. Clark,et al.  Contributing to Discourse , 1989, Cogn. Sci..

[15]  Harith Alani,et al.  Semantic Sentiment Analysis of Twitter , 2012, SEMWEB.

[16]  E. Rogers Diffusion of Innovations , 1962 .

[17]  Choochart Haruechaiyasak,et al.  Discovering Consumer Insight from Twitter via Sentiment Analysis , 2012, J. Univers. Comput. Sci..

[18]  Hisashi Arakaki,et al.  The ‘hit’ phenomenon: a mathematical model of human dynamics interactions as a stochastic process , 2012 .

[19]  Stanley Wasserman,et al.  Social Network Analysis: Social Network Data , 1994 .

[20]  John P. Robinson,et al.  Are Parents Investing Less in Children? Trends in Mothers’ and Fathers’ Time with Children1 , 2004, American Journal of Sociology.

[21]  Anna Sidorova,et al.  The Dual Micro/Macro Informing Role of Social Network Sites: Can Twitter Macro Messages Help Predict Stock Prices? , 2012 .

[22]  Babajide Osatuyi,et al.  Information sharing on social media sites , 2013, Comput. Hum. Behav..

[23]  Anupam Joshi,et al.  Detection of Real-Time Intentions from Micro-blogs , 2013, MobiQuitous.

[24]  Eric D. Brown Will Twitter Make You a Better Investor? A Look at Sentiment, User Reputation and Their Effect on the Stock Market , 2012 .

[25]  M. Alvesson Understanding organizational culture , 2002 .

[26]  Stephen B. Seidman,et al.  Network structure and minimum degree , 1983 .

[27]  Vincent Larivière,et al.  Tweets as impact indicators: Examining the implications of automated “bot” accounts on Twitter , 2014, J. Assoc. Inf. Sci. Technol..

[28]  J. Perry-Smith Social Yet Creative: The Role Of Social Relationships In Facilitating Individual Creativity , 2006 .

[29]  Ray Reagans,et al.  Network Structure and Knowledge Transfer: The Effects of Cohesion and Range , 2003 .

[30]  J. Eliashberg,et al.  The Motion Picture Industry: Critical Issues in Practice, Current Research, and New Research Directions , 2006 .

[31]  Duncan J. Watts,et al.  Who says what to whom on twitter , 2011, WWW.

[32]  B. Allen A Stochastic Interactive Model for the Diffusion of Information , 1982 .

[33]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[34]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[35]  Brian J. McNely Backchannel persistence and collaborative meaning-making , 2009, SIGDOC '09.

[36]  M. Sheelagh T. Carpendale,et al.  A Visual Backchannel for Large-Scale Events , 2010, IEEE Transactions on Visualization and Computer Graphics.

[37]  Yi Feng,et al.  A dynamic theory of the transition to democracy , 2003 .

[38]  Matthew Michelson,et al.  Tweet Disambiguate Entities Retrieve Folksonomy SubTree Step 1 : Discover Categories Generate Topic Profile from SubTrees Step 2 : Discover Profile Topic Profile : “ English Football ” “ World Cup ” , 2010 .

[39]  Joshua Introne,et al.  Analyzing the flow of knowledge in computer mediated teams , 2013, CSCW.

[40]  Warren Sack,et al.  Conversation Map: An Interface for Very Large-Scale Conversations , 2000, J. Manag. Inf. Syst..

[41]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[42]  Yale Song,et al.  #FluxFlow: Visual Analysis of Anomalous Information Spreading on Social Media , 2014, IEEE Transactions on Visualization and Computer Graphics.

[43]  James Allan,et al.  Topic Detection and Tracking , 2002, The Information Retrieval Series.

[44]  Olivia Parr Rud,et al.  Business Intelligence Success Factors: Tools for Aligning Your Business in the Global Economy , 2009 .

[45]  Kazutoshi Sumiya,et al.  Towards better TV viewing rates: exploiting crowd's media life logs over Twitter for TV rating , 2011, ICUIMC '11.

[46]  Alessandro Vespignani Behind enemy lines , 2005 .

[47]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[48]  Jianfeng Guo,et al.  How Does Market Concern Derived from the Internet Affect Oil Prices? , 2013 .

[49]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[50]  Damon Centola An Experimental Study of Homophily in the Adoption of Health Behavior , 2011, Science.

[51]  Thomas Gottron,et al.  Searching microblogs: coping with sparsity and document quality , 2011, CIKM '11.

[52]  Huan Liu,et al.  Text Analytics in Social Media , 2012, Mining Text Data.

[53]  Jo Royle,et al.  The digital marketing skills gap: Developing a Digital Marketer Model for the communication industries , 2014, Int. J. Inf. Manag..

[54]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..