Exploiting Social Media for Stock Market Prediction with Factorization Machine

When the stock market has become more and more competitive, the stock market prediction has been a hot research topic. Traditional methods are based on historical stock data, which ignore the latest market information. Later although financial news is proposed to access market information, there are some disadvantages for news to predict the stock market. Recently when the micro-blogging service has grown to a popular social media and provides a number of real-time messages for a lot of users, social media is proposed for the stock market prediction. In that case, the high-dimension of textual feature poses a main challenge. In this study, we propose a novel kind of model, Factorization Machine (FM), to predict the trend of stock market. FM not only alleviates the impact of high dimensionality, but also captures some aspects of basic linguistics. Experiments on real-world data show that FM can achieve 81% accuracy and get significantly more profit than state-of-the-art models. In addition, we shed light on how textual representation influences the prediction and find that FM is stable, which is applicable to other social media or prediction application generally.

[1]  Olivia Sheng,et al.  Investigating Predictive Power of Stock Micro Blog Sentiment in Forecasting Future Stock Price Directional Movement , 2011, ICIS.

[2]  Tommi S. Jaakkola,et al.  Weighted Low-Rank Approximations , 2003, ICML.

[3]  Gilad Mishne,et al.  A Study of Blog Search , 2006, ECIR.

[4]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[5]  Ling Liu,et al.  Stock Market Volatility Prediction: A Service-Oriented Multi-kernel Learning Approach , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[6]  Pei-Chann Chang,et al.  A novel model by evolving partially connected neural network for stock price trend forecasting , 2012, Expert Syst. Appl..

[7]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[8]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[9]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[10]  Lars Schmidt-Thieme,et al.  Fast context-aware recommendations with factorization machines , 2011, SIGIR.

[11]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[12]  Eric Gilbert,et al.  Widespread Worry and the Stock Market , 2010, ICWSM.

[13]  Ganapati Panda,et al.  Development and performance evaluation of FLANN based model for forecasting of stock markets , 2009, Expert Syst. Appl..

[14]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[15]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009, J. Assoc. Inf. Sci. Technol..

[16]  H. Varian,et al.  Predicting the Present with Google Trends , 2012 .

[17]  Babita Majhi,et al.  Stock Indices Prediction Using Radial Basis Function Neural Network , 2012, SEMCCO.

[18]  Hongjun Lu,et al.  The Predicting Power of Textual Information on Financial Markets , 2005, IEEE Intell. Informatics Bull..

[19]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[20]  Marc-André Mittermayer,et al.  Forecasting Intraday stock price trends with text mining techniques , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[21]  Chao Wu,et al.  Forecasting stock indices using radial basis function neural networks optimized by artificial fish swarm algorithm , 2011, Knowl. Based Syst..

[22]  Chao Wang,et al.  Improving Stock Market Prediction by Integrating Both Market News and Stock Prices , 2011, DEXA.

[23]  Mark P. Taylor,et al.  Permanent and Temporary Components of Stock Prices: Evidence from Assessing Macroeconomic Shocks , 2002 .

[24]  John R. Nofsinger Social Mood and Financial Economics , 2005 .

[25]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[26]  Brian D. Davison,et al.  Co-factorization machines: modeling user interests and predicting individual decisions in Twitter , 2013, WSDM.

[27]  Hsinchun Chen,et al.  Textual analysis of stock market prediction using breaking financial news: The AZFin text system , 2009, TOIS.

[28]  H. Varian,et al.  Predicting the Present with Google Trends , 2009 .

[29]  Lars Schmidt-Thieme,et al.  Pairwise interaction tensor factorization for personalized tag recommendation , 2010, WSDM '10.

[30]  E. Fama The Behavior of Stock-Market Prices , 1965 .

[31]  Karolin Baecker,et al.  A Random Walk Down Wall Street , 2016 .

[32]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[33]  Ramanathan V. Guha,et al.  The predictive power of online chatter , 2005, KDD '05.

[34]  Chi-Jie Lu,et al.  Combining nonlinear independent component analysis and neural network for the prediction of Asian stock market indexes , 2012, Expert Syst. Appl..

[35]  Gregoris Mentzas,et al.  Using Social Media to Predict Future Events with Agent-Based Markets , 2010, IEEE Intelligent Systems.

[36]  Feng Liang,et al.  Exploiting ranking factorization machines for microblog retrieval , 2013, CIKM.

[37]  James Allan,et al.  Language models for financial news recommendation , 2000, CIKM '00.

[38]  Khaled Rasheed,et al.  Stock market prediction with multiple classifiers , 2007, Applied Intelligence.

[39]  Ming-Chi Lee,et al.  Using support vector machine with a hybrid feature selection method to the stock trend prediction , 2009, Expert Syst. Appl..

[40]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.