Movie Revenue Prediction Based on Purchase Intention Mining Using YouTube Trailer Reviews

Abstract The increase in acceptability and popularity of social media has made extracting information from the data generated on social media an emerging field of research. An important branch of this field is predicting future events using social media data. This paper is focused on predicting box-office revenue of a movie by mining people's intention to purchase a movie ticket, termed purchase intention, from trailer reviews. Movie revenue prediction is important due to risks involved in movie production despite the high cost involved in the production. Previous studies in this domain focus on the use of twitter data and IMDB reviews for the prediction of movies that have already been released. In this paper, we build a model for movie revenue prediction prior to the movie's release using YouTube trailer reviews. Our model consists of novel methods of calculating purchase intention, positive-to-negative sentiment ratio, and like-to-dislike ratio for movie revenue prediction. Our experimental results prove the superiority of our approach compared to three baseline approaches and achieved a relative absolute error of 29.65%.

[1]  Dai Yao,et al.  Seeking the support of the silent majority: are lurking users valuable to UGC platforms? , 2018, Journal of the Academy of Marketing Science.

[2]  Jinglan Zhang,et al.  Integration of Sentiment Analysis into Customer Relational Model: The Importance of Feature Ontology and Synonym , 2013, ICEEI 2013.

[3]  Ramesh Sharda,et al.  Predicting box-office success of motion pictures with neural networks , 2006 .

[4]  Sameena Shah,et al.  Learning Stock Market Sentiment Lexicon and Sentiment-Oriented Word Vector from StockTwits , 2017, CoNLL.

[5]  Li Zhang,et al.  Forecasting box office revenue of movies with BP neural network , 2009, Expert Syst. Appl..

[6]  Kyung Jae Lee,et al.  Bayesian belief network for box-office performance: A case study on Korean movies , 2009, Expert Syst. Appl..

[7]  Shanzhi Chen,et al.  An inside look into the complexity of box-office revenue prediction in China , 2017, Int. J. Distributed Sens. Networks.

[8]  Azuraliza Abu Bakar,et al.  Hybrid N-gram model using Naïve Bayes for classification of political sentiments on Twitter , 2019, Neural Computing and Applications.

[9]  Anirban Dutta,et al.  Identifying the causal relationship between social media content of a Bollywood movie and its box-office success - a text mining approach , 2017, Int. J. Bus. Inf. Syst..

[10]  Kang Zhao,et al.  Early Prediction of Movie Success - What, Who, and When , 2015, SBP.

[11]  Salwani Abdullah,et al.  Arabic senti-lexicon: Constructing publicly available language resources for Arabic sentiment analysis , 2018, J. Inf. Sci..

[12]  Ross Maciejewski,et al.  Business Intelligence from Social Media: A Study from the VAST Box Office Challenge , 2014, IEEE Computer Graphics and Applications.

[13]  Salwani Abdullah,et al.  Approaches to Cross-Domain Sentiment Analysis: A Systematic Literature Review , 2017, IEEE Access.

[14]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[15]  Brian O'Neill,et al.  A lexical database for public textual cyberbullying detection , 2017 .

[16]  Jingfei Du,et al.  Box office prediction based on microblog , 2014, Expert Syst. Appl..

[17]  Kim Schouten,et al.  An Information Gain-Driven Feature Study for Aspect-Based Sentiment Analysis , 2016, NLDB.

[18]  Ting Liu,et al.  Predicting movie Box-office revenues by exploiting large-scale social media content , 2014, Multimedia Tools and Applications.

[19]  Zhang Yi,et al.  Predicting movie box-office revenues using deep neural networks , 2019, Neural Computing and Applications.

[20]  Mohammad Pourhomayoun,et al.  Predicting Movie Market Revenue Using Social Media Data , 2017, 2017 IEEE International Conference on Information Reuse and Integration (IRI).

[21]  Barry Litman Predicting Success of Theatrical Movies: An Empirical Study , 1983 .

[22]  Joshua Green,et al.  YouTube: Online Video and Participatory Culture , 2009 .

[23]  Luca Iandoli,et al.  Combining structure, content and meaning in online social networks: The analysis of public's early reaction in social media to newly launched movies , 2016 .

[24]  Ting Liu,et al.  A Gaussian copula regression model for movie box-office revenues prediction , 2015, Science China Information Sciences.

[25]  Brian Moon,et al.  Pre-production forecasting of movie revenues with a dynamic artificial neural network , 2015, Expert Syst. Appl..

[26]  Jianping Chai,et al.  An effective daily box office prediction model based on deep neural networks , 2018, Cognitive Systems Research.

[27]  Mohd Ridzwan Yaakub,et al.  Metaheuristic algorithms for feature selection in sentiment analysis , 2015, 2015 Science and Information Conference (SAI).

[28]  Amri Napolitano,et al.  A comparative study of iterative and non-iterative feature selection techniques for software defect prediction , 2013, Information Systems Frontiers.

[29]  Ickjai Lee,et al.  Extracting features with medical sentiment lexicon and position encoding for drug reviews , 2019, Health Information Science and Systems.

[30]  Manabu Okumura,et al.  Social Media Mining , 2013 .

[31]  Ting Liu,et al.  Domain Adaptation via Tree Kernel Based Maximum Mean Discrepancy for User Consumption Intention Identification , 2018, IJCAI.

[32]  Chandan Dasgupta,et al.  Using Twitter data to predict the performance of Bollywood movies , 2015, Ind. Manag. Data Syst..

[33]  Namita Mittal,et al.  Optimal Feature Selection for Sentiment Analysis , 2013, CICLing.

[34]  Charles R. Plott,et al.  Two information aggregation mechanisms for predicting the opening weekend box office revenues of films: Boxoffice Prophecy and Guess of Guesses , 2015 .

[35]  Prashant Rajput,et al.  Box Office Revenue Prediction Using Dual Sentiment Analysis , 2017 .