Generating Domain-Specific Dictionaries using Bayesian Learning

This paper aims to operationalize subjective information processing in financial news disclosures. In order to measure news tone, previous research commonly utilizes manually-selected positive and negative word lists, such as the Harvard-IV psychological dictionary. However, such dictionaries may not be suitable for the domain of financial news because positive and negative entries could have different connotations in a financial context. To overcome the problem of words that are selected ex ante, we incorporate several Bayesian variable selection methods to select the relevant positive and negative words from financial news disclosures. These domain-specific dictionaries outperform existing dictionaries in terms of both their explanatory power and predictive performance, resulting in an improvement of up to 93.25 % in the correlation between news sentiment and stock market returns. According to our findings, the interpretation of words strongly depends on the context and managers need to be cautious when framing negative content using positive words.

[1]  Hsinchun Chen,et al.  A quantitative stock prediction system based on financial news , 2009, Inf. Process. Manag..

[2]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[3]  K. Hanley,et al.  Strategic Disclosure and the Pricing of Initial Public Oerings , 2007 .

[4]  R. Bloomfield The 'Incomplete Revelation Hypothesis' and Financial Reporting , 2002 .

[5]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[6]  Edward A. Fox,et al.  Research Contributions , 2014 .

[7]  Jan Muntermann,et al.  An intraday market risk management approach based on textual analysis , 2011, Decis. Support Syst..

[8]  Joseph Engelberg Costly Information Processing: Evidence from Earnings Announcements , 2008 .

[9]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[10]  Stefan Feuerriegel,et al.  Which News Disclosures Matter? News Reception Compared Across Topics Extracted from the Latent Dirichlet Allocation , 2015 .

[11]  Paul C. Tetlock Giving Content to Investor Sentiment: The Role of Media in the Stock Market , 2005, The Journal of Finance.

[12]  Feng Li Annual Report Readability, Current Earnings, and Earnings Persistence , 2008 .

[13]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[14]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[15]  Kurt Hornik,et al.  Text Mining Infrastructure in R , 2008 .

[16]  Xuan Wang,et al.  Exploiting Rich Features for Detecting Hedges and their Scope , 2010, CoNLL Shared Task.

[17]  Matt Taddy,et al.  Multinomial Inverse Regression for Text Analysis , 2010, 1012.2098.

[18]  Sofus A. Macskassy,et al.  More than Words: Quantifying Language to Measure Firms' Fundamentals the Authors Are Grateful for Assiduous Research Assistance from Jie Cao and Shuming Liu. We Appreciate Helpful Comments From , 2007 .

[19]  Stefan Feuerriegel,et al.  News or Noise? How News Drives Commodity Prices , 2013, ICIS.

[20]  Clara Vega,et al.  Soft information in earnings announcements: news or noise? , 2008 .

[21]  E. Henry Are Investors Influenced By How Earnings Press Releases Are Written? , 2006 .

[22]  Di Wu,et al.  Word Power: A New Approach for Content Analysis , 2013 .

[23]  Jan Muntermann,et al.  Intraday Stock Price Effects of Ad Hoc Disclosures: The German Case , 2007 .

[24]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[25]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[26]  Benjamin Segal,et al.  The Incremental Information Content of Tone Change in Management Discussion and Analysis , 2008 .

[27]  Stefan Feuerriegel,et al.  Enhancing Sentiment Analysis of Financial News by Detecting Negation Scopes , 2015, 2015 48th Hawaii International Conference on System Sciences.

[28]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[29]  Elizabeth Demers,et al.  Soft information in earnings announcements: news or noise? , 2008 .

[30]  A. Mackinlay,et al.  Event Studies in Economics and Finance , 1997 .

[31]  Werner Antweiler,et al.  Is All that Talk Just Noise? The Information Content of Internet Stock Message Boards , 2001 .

[32]  Marc-André Mittermayer,et al.  Text Mining Systems for Market Response to News: A Survey , 2007 .

[33]  Daniel E. O'Leary,et al.  Event Study Methodologies in Information Systems Research , 2011, Int. J. Account. Inf. Syst..