Predicting long-term returns of individual stocks with online reviews

Abstract Predicting long-term returns is essential for getting a full view of market efficiency. It is, additionally, more challenging for both human and algorithms, especially at the level of individual stocks, and competent solutions are still missing in previous effort. Considering the profound connections between stock prices and consumer opinions, asking experienced buyers can be a promising direction. In this paper, which is based exclusively on online reviews, a framework for predicting long-term returns of individual stocks is established. Specifically, 6,246 features of 13 categories inferred from more than 18 million product reviews are elaborated to build the prediction model. A satisfactory increase in accuracy, 13.94%, was achieved compared to the cutting-edge solution with 10 technical indicators being features. Additionally, we conduct a trading simulation on real-world stocks and our model obtains the highest return 13.12% among all baselines in half year. The robustness of the model is further evaluated and testified. Our approaches, in both feature inference and prediction model, will offer insightful advices to investors in long-term investments on individual stocks.

[1]  Ke Xu,et al.  Tales of emotion and stock in China: volatility, causality and prediction , 2017, World Wide Web.

[2]  Gerard J. Tellis,et al.  Does Chatter Really Matter? Dynamics of User-Generated Content and Stock Performance , 2011, Mark. Sci..

[3]  Yong Liu Word-of-Mouth for Movies: Its Dynamics and Impact on Box Office Revenue , 2006 .

[4]  Steve Y. Yang,et al.  Stock portfolio selection using learning-to-rank algorithms with news sentiment , 2017, Neurocomputing.

[5]  S. Sénécal,et al.  The influence of online product recommendations on consumers' online choices , 2004 .

[6]  Stephen J. Hoch,et al.  Managing What Consumers Learn from Experience , 1989 .

[7]  Brian T. Parker,et al.  A comparison of brand personality and brand user‐imagery congruence , 2009 .

[8]  Huan Liu,et al.  Feature selection for classification: A review , 2014 .

[9]  Edward C. Malthouse,et al.  How Mobile Shopping Affects Customer Purchase Behavior: A Retailer’s Perspective , 2016 .

[10]  Chih-Ming Hsu,et al.  A hybrid procedure for stock price prediction by integrating self-organizing map and genetic programming , 2011, Expert Syst. Appl..

[11]  X. Zhang,et al.  Impact of Online Consumer Reviews on Sales: The Moderating Role of Product and Consumer Characteristics , 2010 .

[12]  Jennifer Rowley,et al.  Mobile shopping behaviour: Insights into attitudes, shopping process involvement and location , 2013 .

[13]  Ling Liu,et al.  The effect of news and public mood on stock movements , 2014, Inf. Sci..

[14]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[15]  M. Urde Core value-based corporate brand building , 2003 .

[16]  M. J. Houston,et al.  Goal-Oriented Experiences and the Development of Knowledge , 1993 .

[17]  R. Thaler,et al.  Does the Stock Market Overreact , 1985 .

[18]  Neil A. Morgan,et al.  The Value of Different Customer Satisfaction and Loyalty Metrics in Predicting Business Performance , 2006 .

[19]  Dongsong Zhang,et al.  What Online Reviewer Behaviors Really Matter? Effects of Verbal and Nonverbal Behaviors on Detection of Fake Online Reviews , 2016, J. Manag. Inf. Syst..

[20]  Kyoung-jae Kim,et al.  Financial time series forecasting using support vector machines , 2003, Neurocomputing.

[21]  Yingjun Chen,et al.  Integrating principle component analysis and weighted support vector machine for stock trading signals prediction , 2018, Neurocomputing.

[22]  Xueming Luo,et al.  Quantifying the Long-Term Impact of Negative Word of Mouth on Cash Flows and Stock Prices , 2009, Mark. Sci..

[23]  Zhi Xiao,et al.  A multiple support vector machine approach to stock index forecasting with mixed frequency sampling , 2017, Knowl. Based Syst..

[24]  Milad Jasemi,et al.  A modern neural network model to do stock market timing on the basis of the ancient investment technique of Japanese Candlestick , 2011, Expert Syst. Appl..

[25]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[26]  T. Meenaghan The role of advertising in brand image development , 1995 .

[27]  Shan Lu,et al.  Aggregating multiple types of complex data in stock market prediction: A model-independent framework , 2018, Knowl. Based Syst..

[28]  Yong Liu,et al.  When do Third-Party Product Reviews Affect Firm Value and what can Firms Do? The Case of Media Critics and Professional Movie Reviews , 2012 .

[29]  E. Fama EFFICIENT CAPITAL MARKETS: A REVIEW OF THEORY AND EMPIRICAL WORK* , 1970 .

[30]  Tomer Geva,et al.  Empirical evaluation of an automated intraday stock recommendation system incorporating both market data and textual news , 2014, Decis. Support Syst..

[31]  G. Tellis,et al.  The Value of Quality: Stock Market Returns to Published Quality Reviews , 2007 .

[32]  R. Belk Possessions and the Extended Self , 1988 .

[33]  Forrest V. Morgeson,et al.  Stock Returns on Customer Satisfaction Do Beat the Market: Gauging the Effect of a Marketing Intangible , 2016 .

[34]  Muzhou Hou,et al.  Forecasting time series with optimal neural networks using multi-objective optimization algorithm based on AICc , 2018, Frontiers of Computer Science.

[35]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[36]  Qifa Xu,et al.  Does Google search index really help predicting stock market volatility? Evidence from a modified mixed data sampling model on volatility , 2019, Knowl. Based Syst..

[37]  Lorin M. Hitt,et al.  Self Selection and Information Role of Online Product Reviews , 2007, Inf. Syst. Res..

[38]  Jun Yang,et al.  Stock Market Autoregressive Dynamics: A Multinational Comparative Study with Quantile Regression , 2016 .

[39]  Yue Zhang,et al.  Using Structured Events to Predict Stock Price Movement: An Empirical Investigation , 2014, EMNLP.

[40]  Lukas Menkhoff The use of technical analysis by fund managers: International evidence , 2010 .

[41]  Chung-Ho Su,et al.  A hybrid fuzzy time series model based on ANFIS and integrated nonlinear feature selection method for forecasting stock , 2016, Neurocomputing.

[42]  Tie-Yan Liu,et al.  Listening to Chaotic Whispers: A Deep Learning Framework for News-oriented Stock Trend Prediction , 2017, WSDM.

[43]  Wen Long,et al.  Deep learning-based feature engineering for stock price movement prediction , 2019, Knowl. Based Syst..

[44]  Amaury Lendasse,et al.  Methodology for long-term prediction of time series , 2007, Neurocomputing.

[45]  Xiaolong Wang,et al.  A novel text mining approach to financial time series forecasting , 2012, Neurocomputing.

[46]  Silke Bambauer-Sachse,et al.  Brand equity dilution through negative online word-of-mouth communication , 2011 .

[47]  Jiekun Huang The Customer Knows Best: The Investment Value of Consumer Opinions , 2018 .

[48]  A. O'Cass,et al.  Examining service experiences and post-consumption evaluations , 2004 .

[49]  Wei Xu,et al.  Enhancing intraday stock price manipulation detection by leveraging recurrent neural networks with ensemble learning , 2019, Neurocomputing.

[50]  Joseph T. Plummer How Personality Makes a Difference , 2000, Journal of Advertising Research.

[51]  Zhi Da,et al.  In Search of Attention , 2009 .

[52]  Dirk Neumann,et al.  Automated news reading: Stock price prediction based on financial news using context-capturing features , 2013, Decis. Support Syst..

[53]  Yoav Freund,et al.  Automated trading with boosting and expert weighting , 2010 .

[54]  E. Anderson,et al.  Dual Emphasis and the Long-Term Financial Impact of Customer Satisfaction , 2005 .

[55]  M. Holbrook The Millennial Consumer in the Texts of Our Times: Experience and Entertainment , 2000 .

[56]  J. Friedman Stochastic gradient boosting , 2002 .

[57]  Stephen J. Hoch,et al.  Product Experience Is Seductive , 2002 .

[58]  Narasimhan Jegadeesh,et al.  Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency , 1993 .

[59]  Jianping Zeng,et al.  Posterior probability model for stock return prediction based on analyst's recommendation behavior , 2013, Knowl. Based Syst..

[60]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[61]  S. Ross,et al.  Economic Forces and the Stock Market , 1986 .

[62]  Jie Jennifer Zhang,et al.  Social Media and Firm Equity Value , 2013, Inf. Syst. Res..

[63]  Xueming Luo,et al.  How Do Consumer Buzz and Traffic in Social Media Marketing Predict the Value of the Firm? , 2013 .

[64]  Ligang Zhou,et al.  Predicting the listing statuses of Chinese-listed companies using decision trees combined with an improved filter feature selection method , 2017, Knowl. Based Syst..

[65]  J. Malbon Taking Fake Online Consumer Reviews Seriously , 2013 .

[66]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.