Sentiment analysis of financial news articles using performance indicators

Mining financial text documents and understanding the sentiments of individual investors, institutions and markets is an important and challenging problem in the literature. Current approaches to mine sentiments from financial texts largely rely on domain-specific dictionaries. However, dictionary-based methods often fail to accurately predict the polarity of financial texts. This paper aims to improve the state-of-the-art and introduces a novel sentiment analysis approach that employs the concept of financial and non-financial performance indicators. It presents an association rule mining-based hierarchical sentiment classifier model to predict the polarity of financial texts as positive, neutral or negative. The performance of the proposed model is evaluated on a benchmark financial dataset. The model is also compared against other state-of-the-art dictionary and machine learning-based approaches and the results are found to be quite promising. The novel use of performance indicators for financial sentiment analysis offers interesting and useful insights.

[1]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[2]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[3]  R. Kaplan,et al.  Linking the Balanced Scorecard to Strategy , 1996 .

[4]  Véronique Hoste,et al.  Fine-grained analysis of explicit and implicit sentiment in financial news articles , 2015, Expert Syst. Appl..

[5]  Sheung Yin Kevin Mo,et al.  News sentiment to market impact and its feedback effect , 2015, Environment Systems and Decisions.

[6]  Chih-Ping Wei,et al.  Understanding Online Consumer Review Opinions with Sentiment Analysis using Machine Learning , 2010, Pac. Asia J. Assoc. Inf. Syst..

[7]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[8]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[9]  James R. Curran Proceedings of the COLING/ACL on Interactive presentation sessions , 2006 .

[10]  Hsinchun Chen,et al.  A Lexicon-Enhanced Method for Sentiment Classification: An Experiment on Online Product Reviews , 2010, IEEE Intelligent Systems.

[11]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[12]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[13]  Sofus A. Macskassy,et al.  More than Words: Quantifying Language to Measure Firms' Fundamentals the Authors Are Grateful for Assiduous Research Assistance from Jie Cao and Shuming Liu. We Appreciate Helpful Comments From , 2007 .

[14]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[15]  Karo Moilanen Packed Feelings and Ordered Sentiments: Sentiment Parsing with Quasi−compositional Polarity Sequencing and Compression , 2010 .

[16]  Pekka Korhonen,et al.  Good debt or bad debt: Detecting semantic orientations in economic texts , 2013, J. Assoc. Inf. Sci. Technol..

[17]  Philip J. Stone,et al.  The general inquirer: A computer system for content analysis and retrieval based on the sentence as a unit of information , 2007 .

[18]  D. Larcker,et al.  Are nonfinancial measures leading indicators of financial performance? An analysis of customer satisfaction , 1998 .

[19]  Daniel Sánchez,et al.  ART: A Hybrid Classification Model , 2004, Machine Learning.

[20]  Dimitris Meretakis,et al.  Extending naïve Bayes classifiers using long itemsets , 1999, KDD '99.

[21]  Tim Loughran,et al.  Textual Analysis in Accounting and Finance: A Survey: TEXTUAL ANALYSIS IN ACCOUNTING AND FINANCE , 2016 .

[22]  Bill McDonald,et al.  The Use of Word Lists in Textual Analysis , 2015 .

[23]  Yuanxin Ouyang,et al.  Investigating association rules for sentiment classification of Web reviews , 2014, J. Intell. Fuzzy Syst..

[24]  Colm Kearney,et al.  Textual Sentiment in Finance: A Survey of Methods and Models , 2013 .

[25]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[26]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[27]  Bill McDonald,et al.  Textual Analysis in Accounting and Finance: A Survey , 2016 .

[28]  Li Chen,et al.  News impact on stock price return via sentiment analysis , 2014, Knowl. Based Syst..

[29]  Dennis Philip,et al.  Media Content and Stock Returns: The Predictive Power of Press , 2014 .

[30]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[31]  Hsinchun Chen,et al.  A Tensor-Based Information Framework for Predicting the Stock Market , 2016, ACM Trans. Inf. Syst..

[32]  Erik Cambria,et al.  SenticNet 3: A Common and Common-Sense Knowledge Base for Cognition-Driven Sentiment Analysis , 2014, AAAI.

[33]  Feng Li The Information Content of Forward-Looking Statements in Corporate Filings—A Naïve Bayesian Machine Learning Approach , 2010 .

[34]  Hsinchun Chen,et al.  Textual analysis of stock market prediction using breaking financial news: The AZFin text system , 2009, TOIS.

[35]  Alan F. Smeaton,et al.  Topic-dependent sentiment analysis of financial blogs , 2009, TSA@CIKM.

[36]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[37]  Paul C. Tetlock Giving Content to Investor Sentiment: The Role of Media in the Stock Market , 2005, The Journal of Finance.

[38]  Allen H. Huang,et al.  Evidence on the Information Content of Text in Analyst Reports , 2014 .

[39]  Mengchi Liu,et al.  Mining high utility itemsets without candidate generation , 2012, CIKM.

[40]  Ling Liu,et al.  The effect of news and public mood on stock movements , 2014, Inf. Sci..

[41]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[42]  Werner Antweiler,et al.  Is All that Talk Just Noise? The Information Content of Internet Stock Message Boards , 2001 .