Using frame-based resources for sentiment analysis within the financial domain

User-generated data in blogs and social networks have recently become a valuable resource for sentiment analysis in the financial domain, since they have been shown to be extremely significant to marketing research companies and public opinion organizations. In order to identify bullish and bearish sentiments associated with companies and stocks, we propose a fine-grained approach that returns a continuous score in the $$[-\,1,+\,1]$$[-1,+1] range. Our supervised approach leverages a frame-based ontological resource which produces feature sets such as lexical features, semantic features and their combination. One of the outcome of our analysis suggests that the frame-based ontological resource we have used might be successfully applied for sentiment analysis within the financial domain achieving better results than traditional sentiment analysis methods that do not embody semantics. We also show the higher performance of a fine-grained approach based solely on the evaluation of specific substrings of the message, rather than on features extracted from the whole text of a financial microblog message through the frame-based ontological resource. We have also compared our system with semi-supervised and unsupervised approaches and results indicate that our approach outperforms the others. Last but not the least, our approach is general and can be applied on top of any existing supervised method of polarity detection.

[1]  Diego Reforgiato Recupero,et al.  Joint Proceedings of the 1st Workshop on Semantic Sentiment Analysis (SSA2014), and the Workshop on Social Media and Linked Data for Emergency Response (SMILE 2014) co-located with 11th European Seman , 2015 .

[2]  Martha Palmer,et al.  Class-Based Construction of a Verb Lexicon , 2000, AAAI/IAAI.

[3]  Emanuele Di Rosa,et al.  Evaluating Industrial and Research Sentiment Analysis Engines on Multiple Sources , 2017, AI*IA.

[4]  John Rothfels,et al.  Unsupervised sentiment classification of English movie reviews using automatic selection of positive and negative sentiment items , 2010 .

[5]  Aldo Gangemi,et al.  The OntoWordNet Project: Extension and Axiomatization of Conceptual Relations in WordNet , 2003, OTM.

[6]  Joint Proceedings of the 2th Workshop on Emotions, Modality, Sentiment Analysis and the Semantic Web and the 1st International Workshop on Extraction and Processing of Rich Semantics from Medical Texts co-located with ESWC 2016, Heraklion, Greece, May 29, 2016 , 2016, EMSA-RMed@ESWC.

[7]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[8]  Axel Schulz,et al.  A fine-grained sentiment analysis approach for detecting crisis related microposts , 2013, ISCRAM.

[9]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[10]  Erik Cambria,et al.  Sentic Computing: Techniques, Tools, and Applications , 2012 .

[11]  Li Chen,et al.  News impact on stock price return via sentiment analysis , 2014, Knowl. Based Syst..

[12]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[13]  Diego Reforgiato Recupero,et al.  Challenge on Fine-Grained Sentiment Analysis Within ESWC2016 , 2016, SemWebEval@ESWC.

[14]  Diego Reforgiato Recupero,et al.  Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Document Clustering , 2017, SeWeBMeDA@ESWC.

[15]  Prashant Raina,et al.  Sentiment Analysis in News Articles Using Sentic Computing , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[16]  Ankur Sinha,et al.  Gold-standard for Topic-specific Sentiment Analysis of Economic Texts , 2014, LREC.

[17]  Harith Alani,et al.  Contextual semantics for sentiment analysis of Twitter , 2016, Inf. Process. Manag..

[18]  Gilles Kassel,et al.  Towards a General Ontology of Computer Programs , 2007, ICSOFT.

[19]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[20]  Qigang Gao,et al.  An Ensemble Sentiment Classification System of Twitter Data for Airline Services Analysis , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[21]  Diego Reforgiato Recupero,et al.  Framester: A Wide Coverage Linguistic Linked Data Hub , 2016, EKAW.

[22]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[23]  Diego Reforgiato Recupero,et al.  Frame-Based Detection of Opinion Holders and Topics: A Model and a Tool , 2014, IEEE Computational Intelligence Magazine.

[24]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[25]  Diego Reforgiato Recupero,et al.  Bearish-Bullish Sentiment Analysis on Financial Microblogs , 2017, EMSASW@ESWC.

[26]  Ying Wah Teh,et al.  Text mining for market prediction: A systematic review , 2014, Expert Syst. Appl..

[27]  Diego Reforgiato Recupero,et al.  ESWC'14 Challenge on Concept-Level Sentiment Analysis , 2014, SemWebEval@ESWC.

[28]  Eibe Frank,et al.  Introducing Machine Learning Concepts with WEKA , 2016, Statistical Genomics.

[29]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[30]  Diego Reforgiato Recupero,et al.  Leveraging semantics for sentiment polarity detection in social media , 2019, Int. J. Mach. Learn. Cybern..

[31]  Diego Reforgiato Recupero,et al.  ESWC 15 Challenge on Concept-Level Sentiment Analysis , 2015, SemWebEval@ESWC.

[32]  Saeedeh Momtazi,et al.  Fine-grained German Sentiment Analysis on Social Media , 2012, LREC.

[33]  Pushpak Bhattacharyya,et al.  WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarization With Wikipedia , 2012, ECML/PKDD.

[34]  Mohamed M. Mostafa,et al.  More than words: Social networks' text mining for consumer brand sentiments , 2013, Expert Syst. Appl..

[35]  Isabell M. Welpe,et al.  Tweets and Trades: The Information Content of Stock Microblogs , 2010 .

[36]  Aldo Gangemi,et al.  Ontology and the Lexicon: What's in a schema? , 2010 .

[37]  Isabell M. Welpe,et al.  Tweets and Trades: The Information Content of Stock Microblogs , 2010 .

[38]  Stefan Feuerriegel,et al.  Analysis of How Underlying Topics in Financial News Affect Stock Prices Using Latent Dirichlet Allocation , 2016, 2016 49th Hawaii International Conference on System Sciences (HICSS).

[39]  Diego Reforgiato Recupero,et al.  A Semantic Web Based Core Engine to Efficiently Perform Sentiment Analysis , 2014, ESWC.

[40]  Erik Cambria,et al.  SenticNet: A Publicly Available Semantic Resource for Opinion Mining , 2010, AAAI Fall Symposium: Commonsense Knowledge.

[41]  Diego Reforgiato Recupero,et al.  Fine-Grained Sentiment Analysis on Financial Microblogs and News Headlines , 2017, SemWebEval@ESWC.

[42]  Erik Cambria,et al.  SenticNet 2: A Semantic and Affective Resource for Opinion Mining and Sentiment Analysis , 2012, FLAIRS.

[43]  David B. Dunson,et al.  Probabilistic topic models , 2011, KDD '11 Tutorials.

[44]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[45]  Jingfei Du,et al.  Box office prediction based on microblog , 2014, Expert Syst. Appl..

[46]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[47]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[48]  Hadi Pouransari,et al.  Deep learning for sentiment analysis of movie reviews , 2015 .

[49]  Alan F. Smeaton,et al.  Topic-dependent sentiment analysis of financial blogs , 2009, TSA@CIKM.

[50]  Diego Reforgiato Recupero,et al.  FineNews: fine-grained semantic sentiment analysis on financial microblogs and news , 2019, Int. J. Mach. Learn. Cybern..

[51]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[52]  Harith Alani,et al.  SentiCircles: A Platform for Contextual and Conceptual Sentiment Analysis , 2016, ESWC.

[53]  David Sun,et al.  A Large-Scale Linear Regression Sentiment Model , 2012 .

[54]  John Carroll,et al.  Automatic Seed Word Selection for Unsupervised Sentiment Classification of Chinese Text , 2008, COLING.

[55]  Alan F. Smeaton,et al.  Exploring the use of paragraph-level annotations for sentiment analysis of financial blogs , 2009 .

[56]  H. N. Bharathi,et al.  Stock trend prediction using news sentiment analysis , 2016, ArXiv.

[57]  Véronique Hoste,et al.  Fine-grained analysis of explicit and implicit sentiment in financial news articles , 2015, Expert Syst. Appl..

[58]  Diego Reforgiato Recupero,et al.  Semantic Sentiment Analysis Challenge at ESWC2017 , 2017, SemWebEval@ESWC.

[59]  André Freitas,et al.  The SSIX Corpora: Three Gold Standard Corpora for Sentiment Analysis in English, Spanish and German Financial Microblogs , 2018, LREC.

[60]  Estevam R. Hruschka,et al.  Tweet sentiment analysis with classifier ensembles , 2014, Decis. Support Syst..

[61]  Marco Guerini,et al.  SentiWords: Deriving a High Precision and High Coverage Lexicon for Sentiment Analysis , 2015, IEEE Transactions on Affective Computing.

[62]  Keith Allan Natural Language Semantics , 2001 .

[63]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[64]  Junlan Feng,et al.  Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[65]  David Zimbra,et al.  Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network , 2013, Expert Syst. Appl..

[66]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[67]  Khurshid Ahmad,et al.  Sentiment Polarity Identification in Financial News: A Cohesion-based Approach , 2007, ACL.

[68]  Mauro Dragoni,et al.  A Knowledge-Based Approach for Aspect-Based Opinion Mining , 2016, SemWebEval@ESWC.

[69]  Ammar Belatreche,et al.  Pre-processing online financial text for sentiment classification: A natural language processing approach , 2014, 2014 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr).

[70]  C. Fillmore FRAME SEMANTICS AND THE NATURE OF LANGUAGE * , 1976 .

[71]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[72]  Dan Ventura,et al.  Sentiment Regression: Using Real-Valued Scores to Summarize Overall Document Sentiment , 2008, 2008 IEEE International Conference on Semantic Computing.

[73]  Harith Alani,et al.  Semantic Patterns for Sentiment Analysis of Twitter , 2014, SEMWEB.

[74]  Diego Reforgiato Recupero,et al.  Sentilo: Frame-Based Sentiment Analysis , 2014, Cognitive Computation.

[75]  R. Goonatilake The Volatility of the Stock Market and News , 2007 .

[76]  Haoran Xie,et al.  Does Summarization Help Stock Prediction? A News Impact Analysis , 2015, IEEE Intelligent Systems.

[77]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[78]  Diana Maynard,et al.  Automatic Detection of Political Opinions in Tweets , 2011, #MSM.

[79]  Ying Wah Teh,et al.  Text mining of news-headlines for FOREX market prediction: A Multi-layer Dimension Reduction Algorithm with semantics and sentiment , 2015, Expert Syst. Appl..

[80]  Paolo Rosso,et al.  SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter , 2015, *SEMEVAL.

[81]  André Freitas,et al.  SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs and News , 2017, *SEMEVAL.