Stock Chatter: Using Stock Sentiment to Predict Price Direction

This paper examines a popular stock message board and finds slight daily predictability using supervised learning algorithms when combining daily sentiment with historical price information. Additionally, with the profit potential in trading stocks, it is of no surprise that a number of popular financial websites are attempting to capture investor sentiment by providing an aggregate of this negative and positive online emotion. We question if the existence of dishonest posters are capitalizing on the popularity of the boards by writing sentiment in line with their trading goals as a means of influencing others, and therefore undermining the purpose of the boards. We exclude these posters to determine if predictability increases, but find no discernible difference.

[1]  Munmun De Choudhury,et al.  Can blog communication dynamics be correlated with stock market activity? , 2008, Hypertext.

[2]  Panagiotis G. Ipeirotis,et al.  Quality management on Amazon Mechanical Turk , 2010, HCOMP '10.

[3]  P. Gloor,et al.  Predicting Stock Market Indicators Through Twitter “I hope it is not as bad as I fear” , 2011 .

[4]  Kennedy D. Gunawardana,et al.  PREDICTING STOCK PRICE PERFORMANCE: A NEURAL NETWORK APPROACH , 2007 .

[5]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[6]  P. Gloor,et al.  Predicting Asset Value through Twitter Buzz , 2012 .

[7]  Kin Keung Lai,et al.  A novel nonlinear ensemble forecasting model incorporating GLAR and ANN for foreign exchange rates , 2005, Comput. Oper. Res..

[8]  Werner Antweiler,et al.  Is All that Talk Just Noise? The Information Content of Internet Stock Message Boards , 2001 .

[9]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[10]  D. Hirshleifer,et al.  Sidelined Investors, Trading-Generated News, and Security Returns , 2001 .

[11]  Ron Kohavi,et al.  The Power of Decision Tables , 1995, ECML.

[12]  Arno J. Knobbe,et al.  Pattern Teams , 2006, PKDD.

[13]  Susan Craw,et al.  Genetic Algorithms for Feature Selection and Weighting , 1999 .

[14]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[15]  Maxim Gusev,et al.  Predictable Markets? A News-Driven Model of the Stock Market , 2014, Algorithmic Finance.

[16]  Peter Fankhauser,et al.  Identifying Users Across Social Tagging Systems , 2011, ICWSM.

[17]  C. Apte,et al.  Data mining with decision trees and decision rules , 1997, Future Gener. Comput. Syst..

[18]  Kenneth A. De Jong,et al.  Genetic algorithms as a tool for feature selection in machine learning , 1992, Proceedings Fourth International Conference on Tools with Artificial Intelligence TAI '92.

[19]  Chao Wang,et al.  Improving Stock Market Prediction by Integrating Both Market News and Stock Prices , 2011, DEXA.

[20]  Dirk Neumann,et al.  Early Warning of Impending Oil Crises Using the Predictive Power of Online News Stories , 2013, 2013 46th Hawaii International Conference on System Sciences.

[21]  Konstantinos I. Diamantaras,et al.  Market Sentiment and Exchange Rate Directional Forecasting , 2015, Algorithmic Finance.

[22]  Ronen Feldman,et al.  The Stock Sonar - Sentiment Analysis of Stocks Based on a Hybrid Approach , 2011, IAAI.

[23]  Eric D. Brown Will Twitter Make You a Better Investor? A Look at Sentiment, User Reputation and Their Effect on the Stock Market , 2012 .

[24]  Carolyn Penstein Rosé,et al.  Recovering Implicit Thread Structure in Newsgroup Style Conversations , 2021, ICWSM.

[25]  Mike Y. Chen,et al.  Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web , 2001 .

[26]  Jennifer Preece,et al.  Lurker demographics: counting the silent , 2000, CHI.

[27]  Isabell M. Welpe,et al.  Tweets and Trades: The Information Content of Stock Microblogs , 2010 .

[28]  J. Zittrain,et al.  Spam Works: Evidence from Stock Touts and Corresponding Market Activity , 2007 .

[29]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[30]  Damien Challet,et al.  Do Google Trend Data Contain More Predictability than Price Returns? , 2014 .

[31]  William Nick Street,et al.  An intelligent system for customer targeting: a data mining approach , 2004, Decis. Support Syst..

[32]  Ron Kohavi,et al.  Targeting Business Users with Decision Table Classifiers , 1998, KDD.

[33]  Bin Gu,et al.  Identifying Information in Stock Message Boards and Its Implications for Stock Market Efficiency , 2006 .

[34]  Arie Ben-David,et al.  About the relationship between ROC curves and Cohen's kappa , 2008, Eng. Appl. Artif. Intell..

[35]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[36]  Michifumi Yoshioka,et al.  Sentiment Analysis of Stock Market News with Semi-supervised Learning , 2012, 2012 IEEE/ACIS 11th International Conference on Computer and Information Science.

[37]  P. DeMarzo,et al.  Persuasion Bias, Social Influence, and Uni-Dimensional Opinions , 2001 .

[38]  Hsinchun Chen,et al.  A quantitative stock prediction system based on financial news , 2009, Inf. Process. Manag..

[39]  Aristides Gionis,et al.  Correlating financial time series with micro-blogging activity , 2012, WSDM '12.

[40]  Gaurav Jain,et al.  An approach to text classification using dimensionality reduction and combination of classifiers , 2004, Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, 2004. IRI 2004..

[41]  Padmini Srinivasan,et al.  On the predictive ability of narrative disclosures in annual reports , 2010, Eur. J. Oper. Res..

[42]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.