Sentiment Analysis of French Tweets based on Subjective Lexicon Approach: Evaluation of the use of OpenNLP and CoreNLP Tools

Corresponding Author: Abdelkader Rhouati Team SIQL, Laboratory LSEII, ENSAO Mohammed First University, Oujda, Morocco Email: abdelkader.rhouati@gmail.com Abstract: Nowadays, sentiment analysis is becoming a very important issue of research. This paper present experimentation on sentiment analysis based on subjective lexicon method. This experimentation is tested over French tweets using "Public Opinion Knowledge (POK)" platform. POK is a platform consists in getting public opinion orientation from text extracted from social network and blogs, which we have developed and presented in previous papers. There are three algorithms as classifiers, which are based on Natural Language Processing Tools. The first is based on OpenNLP, the second on CoreNLP and the third on dependency analysis implemented by CoreNLP. Each classifier consists of three steps, which are Part of Speech Tagging (POS), word polarity classification and sentiment classification algorithm. On the one hand, the results are used to evaluate the use of OpenNLP and CoreNLP, on other, they draw to make a comparison between lexicon and machine-learning approaches. So, experimentation leads us to conclude that tools of sentiment analysis based on lexicon are much performant than those based on machine learning and they can reach a rate of precision of 70% and F-measure of 0.7. Also, we conclude that CoreNLP is more efficient than OpenNLP by 3% of precision, this fact is due to the efficiency of Part of Speech tagging algorithms.

[1]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[2]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[3]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[4]  Shabina Dhuria Analysis : An approach in Natural Language Processing for Data Extraction , 2015 .

[5]  Mohammed Ghaouth Belkasmi,et al.  Get the Public Opinion from Content Published on the Web/CSM: New Approach Based on Big Data , 2016 .

[6]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[7]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[8]  Slav Petrov,et al.  A Universal Part-of-Speech Tagset , 2011, LREC.

[9]  Suad Alhojely,et al.  Sentiment Analysis and Opinion Mining: A Survey , 2016 .

[10]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[11]  Ralph Weischedel,et al.  PERFORMANCE MEASURES FOR INFORMATION EXTRACTION , 2007 .

[12]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.

[13]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[14]  R. Bodin A Sentimental Education , 2011 .

[15]  Rudy Prabowo,et al.  Sentiment analysis: A combined approach , 2009, J. Informetrics.

[16]  Mohammed Ghaouth Belkasmi,et al.  Public Opinion Knowledge (POK) platform based on apache hadoop: To get public opinion from French content published on the Web/CSM , 2016, 2016 2nd International Conference on Cloud Computing Technologies and Applications (CloudTech).

[17]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[18]  Andrea Esuli,et al.  SentiWordNet: A High-Coverage Lexical Resource for Opinion Mining , 2006 .

[19]  Erik Cambria,et al.  Jumping NLP Curves: A Review of Natural Language Processing Research [Review Article] , 2014, IEEE Computational Intelligence Magazine.

[20]  Robert B. Allen,et al.  Several Studies on Natural Language ·and Back-Propagation , 1987 .

[21]  Dan Klein,et al.  Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.

[22]  Prem Melville,et al.  Sentiment analysis of blogs by combining lexical knowledge with text classification , 2009, KDD.

[23]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[24]  Razvan C. Bunescu,et al.  Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques , 2003, Third IEEE International Conference on Data Mining.

[25]  Doaa Mohey El Din Mohamed Hussein,et al.  A survey on sentiment analysis challenges , 2016, Journal of King Saud University - Engineering Sciences.