Lexicon Creation for Financial Sentiment Analysis Using Network Embedding

In this study, we aim to construct a polarity dictionary specialized for the analysis of financial policies. Based on an idea that polarity words are likely located in the secondary proximity in the dependency network, we proposed an automatic dictionary construction method using secondary LINE (Large-scale Information Network Embedding) that is a network representation learning method to quantify relationship. The results suggested the possibility of constructing a dictionary using distributed representation by LINE. We also confirmed that a distributed representation with a property different from the distributed representation by the CBOW (Continuous Bag of Word) model was acquired and analyzed the differences between the distributed representation using LINE and the distributed representation using the CBOW model.

[1]  Wing-Keung Wong,et al.  Topological Characteristics of the Hong Kong Stock Market: A Test-based P-threshold Approach to Understanding Network Complexity , 2017, Scientific Reports.

[2]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[3]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[4]  Johan Bollen,et al.  Twitter Mood as a Stock Market Predictor , 2011, Computer.

[5]  J. Curran,et al.  Minimising semantic drift with Mutual Exclusion Bootstrapping , 2007 .

[6]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[7]  Kiyoshi Izumi,et al.  Transfer Entropy Analysis of Information Flow in a Stock Market , 2017 .

[8]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[9]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[10]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[11]  Ngoc Thang Vu,et al.  Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction , 2016, ACL.

[12]  M. de Rijke,et al.  UvA-DARE ( Digital Academic Repository ) Using WordNet to measure semantic orientations of adjectives , 2004 .

[13]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[14]  Di Wu,et al.  Deciphering Fedspeak: The Information Content of FOMC Meetings , 2017 .