Web Media and Stock Markets : A Survey and Future Directions from a Big Data Perspective

Stock market volatility is influenced by information release, dissemination, and public acceptance. With the increasing volume and speed of social media, the effects of Web information on stock markets are becoming increasingly salient. However, studies of the effects of Web media on stock markets lack both depth and breadth due to the challenges in automatically acquiring and analyzing massive amounts of relevant information. In this study, we systematically reviewed 229 research articles on quantifying the interplay between Web media and stock markets from the fields of Finance, Management Information Systems, and Computer Science. In particular, we first categorized the representative works in terms of media type and then summarized the core techniques for converting textual information into machine-friendly forms. Finally, we compared the analysis models used to capture the hidden relationships between Web media and stock movements. Our goal is to clarify current cutting-edge research and its possible future directions to fully understand the mechanisms of Web information percolation and its impact on stock markets from the perspectives of investors cognitive behaviors, corporate governance, and stock market regulation.

[1]  T. Gilbert,et al.  Information Aggregation Around Macroeconomic Announcements: Revisions Matter , 2010 .

[2]  Hsinchun Chen,et al.  Analyzing market performance via social media: a case study of a banking industry crisis , 2013, Science China Information Sciences.

[3]  B. Barber,et al.  The “Dartboard” Column: Second-Hand Information and Price Pressure , 1993, Journal of Financial and Quantitative Analysis.

[4]  Joseph Engelberg,et al.  The Causal Impact of Media in Financial Markets , 2009 .

[5]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[6]  S. Pokharel Wisdom of Crowds: The Value of Stock Opinions Transmitted through Social Media , 2014 .

[7]  Jian Zhang,et al.  Daily stock market forecast from textual web data , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[8]  Ömer Kaan Baykan,et al.  Predicting direction of stock price index movement using artificial neural networks and support vector machines: The sample of the Istanbul Stock Exchange , 2011, Expert Syst. Appl..

[9]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[10]  Alexandre d'Aspremont,et al.  Predicting abnormal returns from news using text classification , 2008, 0809.2792.

[11]  Vincenzo Farina,et al.  The Impact of Corporate Governance Press News on Stock Market Returns , 2010 .

[12]  Sell the Rumour, Buy the Fact? , 2014 .

[13]  Xiaolong Wang,et al.  A novel text mining approach to financial time series forecasting , 2012, Neurocomputing.

[14]  M. Mitchell,et al.  The Impact of Public Information on the Stock Market , 1994 .

[15]  Ying Wah Teh,et al.  Text mining for market prediction: A systematic review , 2014, Expert Syst. Appl..

[16]  Kevin Philip Evans,et al.  Intraday jumps and US macroeconomic news announcements , 2011 .

[17]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[18]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[19]  Gur Huberman,et al.  Contagious Speculation and a Cure for Cancer: A Nonevent that Made Stock Prices Soar , 2001 .

[20]  Li Chen,et al.  News impact on stock price return via sentiment analysis , 2014, Knowl. Based Syst..

[21]  James Allan,et al.  Language models for financial news recommendation , 2000, CIKM '00.

[22]  Peter Eickelberg,et al.  Rumor Has It: Sensationalism in Financial Media , 2016 .

[23]  Julien Velcin,et al.  Sentiment analysis on social media for stock movement prediction , 2015, Expert Syst. Appl..

[24]  J. Griffin,et al.  How Important Is the Financial Media in Global Markets , 2011 .

[25]  Jie Jennifer Zhang,et al.  Social Media and Firm Equity Value , 2013, Inf. Syst. Res..

[26]  W. N. Street,et al.  Using conditional probability to identify trends in intra-day high-frequency equity pricing , 2013 .

[27]  Sidney J. Gray,et al.  International Capital Market Pressures and Voluntary Annual Report Disclosures by U.S. and U.K. Multinationals , 1995 .

[28]  D. Hirshleifer,et al.  Good Day Sunshine: Stock Returns and the Weather , 2001 .

[29]  A. Shleifer,et al.  The Limits of Arbitrage , 1995 .

[30]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[31]  Xiaotie Deng,et al.  Enhancing quantitative intra-day stock return prediction by integrating both market news and stock prices information , 2014, Neurocomputing.

[32]  Ivan Medovikov,et al.  When Does the Stock Market Listen to Economic News? New Evidence from Copulas and News Wires , 2014, 1410.8427.

[33]  Hsinchun Chen,et al.  Textual analysis of stock market prediction using breaking financial news: The AZFin text system , 2009, TOIS.

[34]  Hsinchun Chen,et al.  Stakeholder Analyses of Firm-Related Web Forums: Applications in Stock Return Prediction , 2015, TMIS.

[35]  Jonathan H. Wright,et al.  The high-frequency impact of news on long-term yields and forward rates: Is it real? , 2008 .

[36]  Yongli Luo,et al.  Rumor Clarification and Stock Returns: Do Bull Markets Behave Differently from Bear Markets? , 2014 .

[37]  Xiaoquan Zhang,et al.  Impact of Wikipedia on Market Information Environment: Evidence on Management Disclosure and Investor Reaction , 2013, MIS Q..

[38]  Steve Y. Yang,et al.  Stock portfolio selection using learning-to-rank algorithms with news sentiment , 2017, Neurocomputing.

[39]  J. Francis,et al.  Management communications with securities analysts , 1997 .

[40]  Júlio C. Nievola,et al.  Predicting published news effect in the Brazilian stock market , 2012, Expert Syst. Appl..

[41]  E. Fama EFFICIENT CAPITAL MARKETS: A REVIEW OF THEORY AND EMPIRICAL WORK* , 1970 .

[42]  Geunbae Lim,et al.  Ion concentration polarization-based continuous separation device using electrical repulsion in the depletion region , 2013, Scientific Reports.

[43]  L. Summers,et al.  Noise Trader Risk in Financial Markets , 1990, Journal of Political Economy.

[44]  Jan van Dalen,et al.  More than just noise? Examining the information content of stock microblogs on financial markets , 2018, J. Inf. Technol..

[45]  Jianping Zeng,et al.  Emotion space model for classifying opinions in stock message board , 2016, Expert Syst. Appl..

[46]  Qing Li,et al.  Exploiting Social Relations and Sentiment for Stock Prediction , 2014, EMNLP.

[47]  Joel Peress,et al.  Media Coverage and the Cross-Section of Stock Returns , 2008 .

[48]  Wendy Hall,et al.  Creating a Science of the Web , 2006, Science.

[49]  Ying Wah Teh,et al.  Text mining of news-headlines for FOREX market prediction: A Multi-layer Dimension Reduction Algorithm with semantics and sentiment , 2015, Expert Syst. Appl..

[50]  Yue Zhang,et al.  Using Structured Events to Predict Stock Price Movement: An Empirical Investigation , 2014, EMNLP.

[51]  Diego Garcia,et al.  Journalists and the Stock Market , 2011 .

[52]  Yue Zhang,et al.  Knowledge-Driven Event Embedding for Stock Prediction , 2016, COLING.

[53]  Ling Liu,et al.  The effect of news and public mood on stock movements , 2014, Inf. Sci..

[54]  Nikolaus Hautsch,et al.  When machines read the news: Using automated text analytics to quantify high frequency news-implied market reactions , 2011 .

[55]  Steven Skiena,et al.  Trading Strategies to Exploit Blog and News Sentiment , 2010, ICWSM.

[56]  Steve Y. Yang,et al.  Genetic programming optimization for a sentiment feedback strength based trading strategy , 2017, Neurocomputing.

[57]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[58]  Thomas Dimpfl,et al.  Can Internet Search Queries Help to Predict Stock Market Volatility? , 2012 .

[59]  Chenchuramaiah T. Bathala Giving Content to Investor Sentiment: The Role of Media in the Stock Market , 2007 .

[60]  N. Sinha,et al.  Underreaction to News in the US Stock Market , 2010 .

[61]  M. S. Rashes Massively Confused Investors Making Conspicuously Ignorant Choices (MCI-MCIC) , 2001 .

[62]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[63]  Joel Peress,et al.  The Media and the Diffusion of Information in Financial Markets: Evidence from Newspaper Strikes , 2011 .

[64]  R. Frey,et al.  Do Newspaper Articles Predict Aggregate Stock Returns? , 2011 .

[65]  Edward M. Saunders Stock Prices and Wall Street Weather , 1993 .

[66]  E. Fama The Behavior of Stock-Market Prices , 1965 .

[67]  Dongcheol Kim,et al.  Investor Sentiment from Internet Message Postings and Predictability of Stock Returns , 2014 .

[68]  Hsinchun Chen,et al.  A quantitative stock prediction system based on financial news , 2009, Inf. Process. Manag..

[69]  Clara Vega,et al.  Economic News and International Stock Market Co-Movement , 2007 .

[70]  Ramesh Nallapati,et al.  Sparse Word Graphs: A Scalable Algorithm for Capturing Word Correlations in Topic Models , 2007 .

[71]  Robert Hudson,et al.  Efficient or adaptive markets? Evidence from major stock markets using very long run historic data , 2013 .

[72]  Dirk Neumann,et al.  Automated news reading: Stock price prediction based on financial news using context-capturing features , 2013, Decis. Support Syst..

[73]  Hairong Dong,et al.  ACP-Based Control and Management of Urban Rail Transportation Systems , 2011, IEEE Intelligent Systems.

[74]  H Eugene Stanley,et al.  Quantifying the semantics of search behavior before stock market moves , 2014, Proceedings of the National Academy of Sciences.

[75]  David H. Solomon,et al.  Selective Publicity and Stock Prices , 2010 .

[76]  R. Goonatilake The Volatility of the Stock Market and News , 2007 .

[77]  Sa-Kwang Song,et al.  Media-aware quantitative trading based on public Web information , 2014, Decis. Support Syst..

[78]  W. S. Chan,et al.  Stock Price Reaction to News and No-News: Drift and Reversal after Headlines , 2001 .

[79]  Paulo Cortez,et al.  Stock market sentiment lexicon acquisition using microblogging data and statistical measures , 2016, Decis. Support Syst..

[80]  H. Eugene Stanley,et al.  Quantifying Wikipedia Usage Patterns Before Stock Market Moves , 2013, Scientific Reports.

[81]  Sofus A. Macskassy,et al.  More than Words: Quantifying Language to Measure Firms' Fundamentals the Authors Are Grateful for Assiduous Research Assistance from Jie Cao and Shuming Liu. We Appreciate Helpful Comments From , 2007 .

[82]  M. Tumminello,et al.  How News Affect the Trading Behavior of Different Categories of Investors in a Financial Market , 2012 .

[83]  Peter M. Clarkson,et al.  Market Reaction to Takeover Rumour in Internet Discussion Sites , 2006 .

[84]  Charles Song,et al.  SOPS: Stock Prediction Using Web Sentiment , 2007 .

[85]  Vadlamani Ravi,et al.  A survey of the applications of text mining in financial domain , 2016, Knowl. Based Syst..

[86]  Nigel Collier,et al.  An Experiment in Integrating Sentiment Features for Tech Stock Prediction in Twitter , 2012 .

[87]  Hsinchun Chen,et al.  A Discrete Stock Price Prediction Engine Based on Financial News , 2010, Computer.

[88]  Xunkai Wei,et al.  Comparative Study of Extreme Learning Machine and Support Vector Machine , 2006, ISNN.

[89]  Ammar Belatreche,et al.  Forecasting movements of health-care stock prices based on different categories of news articles using multiple kernel learning , 2016, Decis. Support Syst..

[90]  Gene Birz,et al.  The effect of macroeconomic news on stock returns: New evidence from newspaper coverage , 2011 .

[91]  Hsinchun Chen,et al.  Evaluating sentiment in financial news articles , 2012, Decis. Support Syst..

[92]  Gerhard Knolmayer,et al.  NewsCATS: A News Categorization and Trading System , 2006, Sixth International Conference on Data Mining (ICDM'06).

[93]  Mike Y. Chen,et al.  Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web , 2001 .

[94]  Tomaso Aste,et al.  When Can Social Media Lead Financial Markets? , 2014, Scientific Reports.

[95]  Sameena Shah,et al.  Stock Prediction Using Event-Based Sentiment Analysis , 2013, 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[96]  Yue Zhang,et al.  Deep Learning for Event-Driven Stock Prediction , 2015, IJCAI.

[97]  Tobias Preis,et al.  Quantifying the Relationship Between Financial News and the Stock Market , 2013, Scientific Reports.

[98]  I. Mathur,et al.  Stock Price Reactions to Securities Recommended in Business Week's “Inside Wall Street” , 1995 .

[99]  Sheung Yin Kevin Mo,et al.  Twitter financial community sentiment and its predictive relationship to stock market movement , 2015 .

[100]  Paul C. Tetlock Does Public Financial News Resolve Asymmetric Information? , 2010 .

[101]  Hsinchun Chen,et al.  A Tensor-Based Information Framework for Predicting the Stock Market , 2016, ACM Trans. Inf. Syst..

[102]  Robert P. Schumaker,et al.  Evaluating a news-aware quantitative trader: The effect of momentum and contrarian stock selection strategies , 2008 .

[103]  H. Stanley,et al.  Quantifying Trading Behavior in Financial Markets Using Google Trends , 2013, Scientific Reports.

[104]  Werner Antweiler,et al.  Is All that Talk Just Noise? The Information Content of Internet Stock Message Boards , 2001 .

[105]  Ling Liu,et al.  A social-media-based approach to predicting stock comovement , 2015, Expert Syst. Appl..

[106]  Michael Canes,et al.  Stock Prices and the Publication of Second-Hand Information , 1978 .

[107]  Yang Yu,et al.  The impact of social and conventional media on firm equity value: A sentiment analysis approach , 2013, Decis. Support Syst..