Enhancing Sentiment Analysis of Financial News by Detecting Negation Scopes

Sentiment analysis refers to the extraction of the polarity of source materials, such as financial news. However, measuring positive tone requires the correct classification of sentences that are negated, i.e. The negation scopes. For example, around 4.74% of all sentences in German ad hoc announcements contain negations. To predict the corresponding negation scope, related literature commonly utilizes two approaches, namely, rule-based algorithms and machine learning. Nevertheless, a thorough comparison is missing, especially for the sentiment analysis of financial news. To close this gap, this paper uses German ad hoc announcements as a common example of financial news in order to pursue a two-sided evaluation. First, we compare the predictive performance using a manually-labeled dataset. Second, we examine how detecting negation scopes can improve the accuracy of sentiment analysis. In this instance, rule-based algorithms produce superior results, resulting in an improvement of up to 9.80% in the correlation between news sentiment and stock market returns.

[1]  Isaac G. Councill,et al.  What's great and what's not: learning to classify the scope of negation for improved sentiment analysis , 2010, NeSp-NLP@ACL.

[2]  Uzay Kaymak,et al.  Determining negation scope and strength in sentiment analysis , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[3]  Pasi Tapanainen,et al.  What is a word, What is a sentence? Problems of Tokenization , 1994 .

[4]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[5]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[6]  Franciska de Jong,et al.  Scope of negation detection in sentiment analysis , 2011 .

[7]  Eric Brill,et al.  A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[8]  Dan I. Moldovan,et al.  Some Issues on Detecting Negation from Text , 2011, FLAIRS.

[9]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[10]  Clement T. Yu,et al.  The effect of negation on sentiment analysis and retrieval effectiveness , 2009, CIKM.

[11]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[12]  Dietrich Klakow,et al.  A survey on the role of negation in sentiment analysis , 2010, NeSp-NLP@ACL.

[13]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[14]  Elizabeth Demers,et al.  Soft information in earnings announcements: news or noise? , 2008 .

[15]  Lior Rokach,et al.  Negation recognition in medical narrative reports , 2008, Information Retrieval.

[16]  Marc-André Mittermayer,et al.  Text Mining Systems for Market Response to News: A Survey , 2007 .

[17]  Christoph Schommer,et al.  News and stock markets: A survey on abnormal returns and prediction models , 2012 .

[18]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[19]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[20]  E. Henry Are Investors Influenced By How Earnings Press Releases Are Written? , 2006 .

[21]  Clara Vega,et al.  Soft information in earnings announcements: news or noise? , 2008 .

[22]  Jan Muntermann,et al.  Intraday Stock Price Effects of Ad Hoc Disclosures: The German Case , 2007 .

[23]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[24]  Clara Vega,et al.  The Impact of Credibility on the Pricing of Managerial Textual Content , 2014 .

[25]  Bill McDonald,et al.  IPO First-Day Returns, Offer Price Revisions, Volatility, and Form S-1 Language , 2013 .

[26]  Paul C. Tetlock Giving Content to Investor Sentiment: The Role of Media in the Stock Market , 2005, The Journal of Finance.

[27]  Awais Athar,et al.  Sentiment Analysis of Citations using Sentence Structure-Based Features , 2011, ACL.

[28]  E. B. Andersen,et al.  Information Science and Statistics , 1986 .

[29]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[30]  Stefan Feuerriegel,et al.  News or Noise? How News Drives Commodity Prices , 2013, ICIS.