Financial Keyword Expansion via Continuous Word Vector Representations

This paper proposes to apply the continuous vector representations of words for discovering keywords from a financial sentiment lexicon. In order to capture more keywords, we also incorporate syntactic information into the Continuous Bag-ofWords (CBOW) model. Experimental results on a task of financial risk prediction using the discovered keywords demonstrate that the proposed approach is good at predicting financial risk.

[1]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[2]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[3]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[4]  Lee-Ing Tong,et al.  Forecasting time series using a methodology based on autoregressive integrated moving average and genetic programming , 2011, Knowl. Based Syst..

[5]  Jerome L. Myers,et al.  Research Design and Statistical Analysis , 1991 .

[6]  F. Diebold,et al.  How Relevant is Volatility Forecasting for Financial Risk Management? , 1997 .

[7]  Jochen L. Leidner,et al.  Hunting for the Black Swan: Risk Mining from Text , 2010, ACL.

[8]  Yoshua Bengio,et al.  Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach , 2011, ICML.

[9]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[10]  Holger Schwenk,et al.  Continuous space language models , 2007, Comput. Speech Lang..

[11]  Fikret S. Gürgen,et al.  A comparison of global, recurrent and smoothed-piecewise neural models for Istanbul stock exchange (ISE) prediction , 2005, Pattern Recognit. Lett..

[12]  Desheng Dash Wu,et al.  Business intelligence in risk management: Some recent progresses , 2014, Inf. Sci..

[13]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[14]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[15]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[16]  Noah A. Smith,et al.  Predicting Risk from Financial Reports with Regression , 2009, NAACL.

[17]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[18]  Jason Weston,et al.  WSABIE: Scaling Up to Large Vocabulary Image Annotation , 2011, IJCAI.

[19]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[20]  Chuan-Ju Wang,et al.  Risk Ranking from Financial Reports , 2013, ECIR.

[21]  Khurshid Ahmad,et al.  Sentiment Polarity Identification in Financial News: A Cohesion-based Approach , 2007, ACL.

[22]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[23]  Andrew Y. Ng,et al.  Parsing Natural Scenes and Natural Language with Recursive Neural Networks , 2011, ICML.

[24]  Eric R. Ziegel,et al.  Analysis of Financial Time Series , 2002, Technometrics.

[25]  F. Diebold,et al.  How Relevant is Volatility Forecasting for Financial Risk Management? , 1997, Review of Economics and Statistics.

[26]  Chuan-Ju Wang,et al.  Financial Sentiment Analysis for Risk Prediction , 2013, IJCNLP.