CADEN: A Context-Aware Deep Embedding Network for Financial Opinions Mining

Following the recent advances of artificial intelligence, financial text mining has gained new potential to benefit theoretical research with practice impacts. An essential research question for financial text mining is how to accurately identify the actual financial opinions (e.g., bullish or bearish) behind words in plain text. Traditional methods mainly consider this task as a text classification problem with solutions based on machine learning algorithms. However, most of them rely heavily on the hand-crafted features extracted from the text. Indeed, a critical issue along this line is that the latent global and local contexts of the financial opinions usually cannot be fully captured. To this end, we propose a context-aware deep embedding network for financial text mining, named CADEN, by jointly encoding the global and local contextual information. Especially, we capture and include an attitude-aware user embedding to enhance the performance of our model. We validate our method with extensive experiments based on a real-world dataset and several state-of-the-art baselines for investor sentiment recognition. Our results show a consistently superior performance of our approach for identifying the financial opinions from texts of different formats.

[1]  Yoshua Bengio,et al.  Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.

[2]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[3]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[4]  Jinlin Chen,et al.  Perception-oriented online news extraction , 2008, JCDL '08.

[5]  Björn Gambäck,et al.  Negation Scope Detection for Twitter Sentiment Analysis , 2015, WASSA@EMNLP.

[6]  Ming Zhou,et al.  Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach , 2014, COLING.

[7]  Yue Zhang,et al.  Don’t Count, Predict! An Automatic Approach to Learning Sentiment Lexicons for Short Text , 2016, ACL.

[8]  Yue Zhang,et al.  Context-Sensitive Lexicon Features for Neural Sentiment Analysis , 2016, EMNLP.

[9]  Wenpeng Yin,et al.  Multichannel Variable-Size Convolution for Sentence Classification , 2015, CoNLL.

[10]  Ming Zhou,et al.  Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification , 2014, ACL.

[11]  Yue Zhang,et al.  Tweet Sarcasm Detection Using Deep Neural Network , 2016, COLING.

[12]  Noah A. Smith,et al.  Contrastive Estimation: Training Log-Linear Models on Unlabeled Data , 2005, ACL.

[13]  Ellen Riloff,et al.  Sarcasm as Contrast between a Positive Sentiment and Negative Situation , 2013, EMNLP.

[14]  Philip S. Yu,et al.  BL-MNE: Emerging Heterogeneous Social Network Embedding Through Broad Learning with Aligned Autoencoder , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[15]  Hui Xiong,et al.  To be or Not to be Friends: Exploiting Social Ties for Venture Investments , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[16]  Jun Hong,et al.  Sarcasm Detection on Czech and English Twitter , 2014, COLING.

[17]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[18]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[19]  Enhong Chen,et al.  Mobile App Classification with Enriched Contextual Information , 2014, IEEE Transactions on Mobile Computing.

[20]  Gang Wang,et al.  Crowds on Wall Street: Extracting Value from Collaborative Investing Platforms , 2015, CSCW.

[21]  Lian Duan,et al.  Paradoxical Correlation Pattern Mining , 2018, IEEE Transactions on Knowledge and Data Engineering.

[22]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[23]  Xiaoyan Zhu,et al.  Linguistically Regularized LSTM for Sentiment Classification , 2016, ACL.

[24]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[25]  Dong-Hong Ji,et al.  A topic-enhanced word embedding for Twitter sentiment classification , 2016, Inf. Sci..

[26]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[27]  Paulo Cortez,et al.  The impact of microblogging data for stock market prediction: Using Twitter to predict returns, volatility, trading volume and survey sentiment indices , 2017 .

[28]  Song Wu,et al.  A Double-Layer Neural Network Framework for High-Frequency Forecasting , 2017, ACM Trans. Manag. Inf. Syst..

[29]  Saif Mohammad,et al.  Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..

[30]  Qi Liu,et al.  Modeling Social Attention for Stock Analysis: An Influence Propagation Perspective , 2015, 2015 IEEE International Conference on Data Mining.

[31]  Yue Zhang,et al.  Context-Sensitive Twitter Sentiment Classification Using Neural Network , 2016, AAAI.

[32]  Enhong Chen,et al.  Mining Mobile User Preferences for Personalized Context-Aware Recommendation , 2014, ACM Trans. Intell. Syst. Technol..

[33]  Yue Zhang,et al.  Target-Dependent Twitter Sentiment Classification with Rich Automatic Features , 2015, IJCAI.

[34]  Qi Liu,et al.  Forecasting price shocks with social attention and sentiment analysis , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[35]  Sameena Shah,et al.  Learning Stock Market Sentiment Lexicon and Sentiment-Oriented Word Vector from StockTwits , 2017, CoNLL.

[36]  Jeffrey Pennington,et al.  Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions , 2011, EMNLP.

[37]  Hui Xiong,et al.  Price Shock Detection With an Influence-Based Model of Social Attention , 2017, ACM Trans. Manag. Inf. Syst..

[38]  Zhiyuan Liu,et al.  Neural Sentiment Classification with User and Product Attention , 2016, EMNLP.

[39]  David Bamman,et al.  Contextualized Sarcasm Detection on Twitter , 2015, ICWSM.

[40]  André Freitas,et al.  SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs and News , 2017, *SEMEVAL.

[41]  Yanjie Fu,et al.  Fake News Detection with Deep Diffusive Network Model , 2018, ArXiv.