Fine-grained analysis of explicit and implicit sentiment in financial news articles

In the financial domain, news has an impact on the stock markets.Most sentiment analysis methods are coarse-grained and focus on explicit sentiment.Such a method is insufficient to detect topic-specific sentiment in financial news articles.We propose a novel fine-grained method that detects explicit and implicit sentiment.This is a viable method for topic-specific sentiment analysis in financial news text. This paper focuses on topic-specific and more specifically company-specific sentiment analysis in financial newswire text. This application is of great use to researchers in the financial domain who study the impact of news (media) on the stock markets.We investigate the viability of a new fine-grained sentiment annotation scheme. Most of the current approaches to sentiment analysis focus on the detection of explicit sentiment. As news text often contains implicit sentiment, i.e. factual information implying positive or negative sentiment, our approach aims to identify both explicit and implicit sentiment. Furthermore, this sentiment is analyzed on a fine-grained level by detecting the topic of the sentiment, as sentiment is not always expressed towards the topics one is interested in.In order to test our approach, we assembled a corpus of company-specific news articles, which was manually labeled by four annotators to create a gold standard. We compare the results of our method to the performance of two coarse-grained baseline systems: a lexicon-based approach and a supervised machine learning approach that makes use of lexical features. Our fine-grained approach outperforms both baselines, and its output shows substantial to almost perfect agreement with the gold standard sentiment labels. Using our annotation scheme, we are able to filter out irrelevant sentiment expressions and detect explicit and implicit sentiment in a reliable way.

[1]  Paul C. Tetlock,et al.  All the News That's Fit to Reprint: Do Investors React to Stale Information? , 2010 .

[2]  Katja Hofmann,et al.  Generating a Non-English Subjectivity Lexicon: Relations That Matter , 2009, EACL.

[3]  Iryna Gurevych,et al.  Sentence and Expression Level Annotation of Opinions in User-Generated Discourse , 2010, ACL.

[4]  Eduard Hovy,et al.  Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text , 2006 .

[5]  R. Shiller Irrational Exuberance Ed. 2 , 2005 .

[6]  Michael T. Cliff,et al.  Investor Sentiment and the Near-Term Stock Market , 2001 .

[7]  Véronique Hoste,et al.  The good, the bad and the implicit: a comprehensive approach to annotating explicit and implicit sentiment , 2015, Lang. Resour. Evaluation.

[8]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[9]  Thomas Schuster,et al.  Meta-Communication and Market Dynamics. Reflexive Interactions of Financial Markets and the Mass Media , 2003 .

[10]  S Sidney,et al.  ALEXANDER, . Price Movements in Speculative Markets: Trends or Random Review, , . , 1961 .

[11]  Srinivasan Sankaraguruswamy,et al.  Investor Sentiment and Stock Market Response to Earnings News , 2012 .

[12]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[13]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[14]  Brett Drury,et al.  Identification of fine grained feature based event and sentiment phrases from business news stories , 2011, WIMS '11.

[15]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[16]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[17]  Ying Wah Teh,et al.  Text mining of news-headlines for FOREX market prediction: A Multi-layer Dimension Reduction Algorithm with semantics and sentiment , 2015, Expert Syst. Appl..

[18]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[19]  Vikas Jain From Efficient Markets Theory to Behavioral Finance , 2003 .

[20]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[21]  E. Fama EFFICIENT CAPITAL MARKETS: A REVIEW OF THEORY AND EMPIRICAL WORK* , 1970 .

[22]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[23]  R. Thaler Advances in behavioral finance , 1995 .

[24]  C. Peake,et al.  Stock Market Volatility, Excess Returns, and the Role of Investor Sentiment , 2003 .

[25]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[26]  Alan F. Smeaton,et al.  Exploring the use of paragraph-level annotations for sentiment analysis of financial blogs , 2009 .

[27]  Andrés Montoyo,et al.  EmotiNet: A Knowledge Base for Emotion Detection in Text Built on the Appraisal Theories , 2011, NLDB.

[28]  Hsin-Hsi Chen,et al.  Overview of Opinion Analysis Pilot Task at NTCIR-6 , 2007, NTCIR.

[29]  Klaus Krippendorff,et al.  Estimating the Reliability, Systematic Error and Random Error of Interval Data , 1970 .

[30]  Mohamed M. Mostafa,et al.  More than words: Social networks' text mining for consumer brand sentiments , 2013, Expert Syst. Appl..

[31]  B. Malkiel The Efficient Market Hypothesis and Its Critics , 2003 .

[32]  Isabell M. Welpe,et al.  Tweets and Trades: The Information Content of Stock Microblogs , 2010 .

[33]  Theresa Wilson,et al.  Annotating Subjective Content in Meetings , 2008, LREC.

[34]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[35]  Yejin Choi,et al.  Connotation Lexicon: A Dash of Sentiment Beneath the Surface Meaning , 2013, ACL.

[36]  Janyce Wiebe,et al.  Sentiment Propagation via Implicature Constraints , 2014, EACL.

[37]  R. Thaler,et al.  Financial Decision-Making in Markets and Firms: A Behavioral Perspective , 1994 .

[38]  Ying Wah Teh,et al.  Text mining for market prediction: A systematic review , 2014, Expert Syst. Appl..

[39]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[40]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[41]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[42]  Els Lefever,et al.  LeTs Preprocess: The multilingual LT3 linguistic preprocessing toolkit , 2013, CLIN 2013.

[43]  John R. Nofsinger The Psychology of Investing , 2001 .

[44]  C. J. van Rijsbergen,et al.  Information Retrieval , 1979, Encyclopedia of GIS.

[45]  Suresh Manandhar,et al.  SemEval-2014 Task 4: Aspect Based Sentiment Analysis , 2014, *SEMEVAL.

[46]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[47]  Moshe Koppel,et al.  Good News or Bad News? Let the Market Decide , 2006, Computing Attitude and Affect in Text.

[48]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[49]  Walter Daelemans,et al.  "Vreselijk mooi!" (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives , 2012, LREC.

[50]  David Zimbra,et al.  Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network , 2013, Expert Syst. Appl..

[51]  Khurshid Ahmad,et al.  Sentiment Polarity Identification in Financial News: A Cohesion-based Approach , 2007, ACL.

[52]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[53]  Chenchuramaiah T. Bathala Giving Content to Investor Sentiment: The Role of Media in the Stock Market , 2007 .

[54]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[55]  Lei Zhang,et al.  Identifying Noun Product Features that Imply Opinions , 2011, ACL.

[56]  Peter D. Turney,et al.  Emotions Evoked by Common Words and Phrases: Using Mechanical Turk to Create an Emotion Lexicon , 2010, HLT-NAACL 2010.

[57]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[58]  L. Summers,et al.  Noise Trader Risk in Financial Markets , 1990, Journal of Political Economy.

[59]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[60]  Stefan Trausan-Matu,et al.  The Impact of Valence Shifters on Mining Implicit Economic Opinions , 2010, AIMSA.

[61]  Andrés Montoyo,et al.  Detecting Emotions in Social Affective Situations Using the EmotiNet Knowledge Base , 2011, ISNN.

[62]  Jingfei Du,et al.  Box office prediction based on microblog , 2014, Expert Syst. Appl..

[63]  R. C. Merton,et al.  Presidential Address: A simple model of capital market equilibrium with incomplete information , 1987 .

[64]  Monireh Ebrahimi Side effects recognition as implicit opinion words in drug reviews , 2013 .

[65]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[66]  Patricio Martínez-Barco,et al.  Using EmotiBlog to annotate and analyse subjectivity in the new textual genres , 2012, Data Mining and Knowledge Discovery.

[67]  Alan F. Smeaton,et al.  Topic-dependent sentiment analysis of financial blogs , 2009, TSA@CIKM.

[68]  E. Fama The Behavior of Stock-Market Prices , 1965 .

[69]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[70]  Mingxi Wang,et al.  Investor Sentiment and the Cross-Section of Stock Returns , 2009 .

[71]  Isabell M. Welpe,et al.  Tweets and Trades: The Information Content of Stock Microblogs , 2010 .

[72]  Véronique Hoste,et al.  LT3: Sentiment Classification in User-Generated Content Using a Rich Feature Set , 2014, *SEMEVAL.