Extracting Investor Sentiment from Weblog Texts: A Knowledge-based Approach

Financial web logs contain a large amount of investor sentiments, i.e., expert assessments of financial instruments and market situations. These blogs provide potentially new and relevant information for investment managers. Since humans are not able to process and interpret the large amounts of available web information, an automated solution is required. We present a knowledge-based approach for extracting investor sentiment directly at high frequency. The approach performs a semantic analysis that starts on the word and sentence level. We employ ontology-guided and rule-based web information extraction based on domain expertise and linguistic knowledge. We evaluate our approach against standard machine learning approaches. A portfolio selection test using extracted sentiments provides evidence for the economic utility of investor sentiments from weblogs.

[1]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[2]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[3]  Mike Y. Chen,et al.  Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web , 2001 .

[4]  Jan Muntermann,et al.  A Text Mining Approach to Support Intraday Financial Decision-Making , 2008, AMCIS.

[5]  Chunping Li,et al.  Ontology Based Opinion Mining for Movie Reviews , 2009, KSEM.

[6]  B. Lev,et al.  Fundamental Information Analysis , 1993 .

[7]  Ying-Wong Cheung,et al.  International evidence on the stock market and aggregate economic activity , 1998 .

[8]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[9]  Marc-André Mittermayer,et al.  Text Mining Systems for Market Response to News: A Survey , 2007 .

[10]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[11]  Mitsuru Ishizuka,et al.  SENTIMENT ASSESSMENT OF TEXT BY ANALYZING LINGUISTIC FEATURES AND CONTEXTUAL VALENCE ASSIGNMENT , 2008, Appl. Artif. Intell..

[12]  S. Ross,et al.  Economic Forces and the Stock Market , 1986 .

[13]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[14]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[15]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[16]  Songbo Tan,et al.  An improved centroid classifier for text categorization , 2008, Expert Syst. Appl..

[17]  Eric Gilbert,et al.  Widespread Worry and the Stock Market , 2010, ICWSM.

[18]  L. Summers,et al.  The Noise Trader Approach to Finance , 1990 .

[19]  Sasha Blair-Goldensohn,et al.  Sentiment Summarization: Evaluating and Learning User Preferences , 2009, EACL.

[20]  Michael T. Cliff,et al.  Investor Sentiment and Asset Valuation , 2001 .

[21]  Steven Skiena,et al.  Trading Strategies to Exploit Blog and News Sentiment , 2010, ICWSM.

[22]  Songbo Tan,et al.  A survey on sentiment detection of reviews , 2009, Expert Syst. Appl..

[23]  N. Mark,et al.  International Macroeconomics and Finance: Theory and Empirical Methods , 2000 .

[24]  R. Portes,et al.  International macroéconomics and finance , 2007 .

[25]  Alan F. Smeaton,et al.  Topic-dependent sentiment analysis of financial blogs , 2009, TSA@CIKM.