Dynamic Construction of Dictionaries for Sentiment Classification

The sentiment classification is one of the new challenges emerged with the advence of social networks. Our purpose is to determine the sentimental orientation of a Facebook comment (positive or negative) by using the linguistic approach. In most of the sentiment analysis applications using this approach, the sentiment lexicon plays a key role. Thus, it is very important to create a lexicon covering several sentiment words. For this reason, we address in this paper the problem how to group and list words present in the corpus into two dictionaries. We proposed a new automatic technique to create the positive and negative dictionaries that exploits the emotions symbols (emoticons, acronyms and exclamation words) present in comments. More importantly, our idea allows to enlarge these dictionaries with an enrichment step. Finally, by using these prepared dictionaries, we predict the positive and negative polarities of the comment. We evaluate our approach by comparison to human classification. Our results are also effective and consistent.

[1]  Jeonghee Yi,et al.  Sentiment analysis: capturing favorability using natural language processing , 2003, K-CAP '03.

[2]  Hiroshi Kanayama,et al.  Fully Automatic Lexicon Expansion for Domain-oriented Sentiment Analysis , 2006, EMNLP.

[3]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[4]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[5]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[6]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[7]  Bing Liu,et al.  The utility of linguistic rules in opinion mining , 2007, SIGIR.

[8]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[9]  Francesco Piazza,et al.  Sentic Web: A New Paradigm for Managing Social Media Affective Information , 2011, Cognitive Computation.

[10]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[11]  Jonathon Read,et al.  Using Emoticons to Reduce Dependency in Machine Learning Techniques for Sentiment Classification , 2005, ACL.

[12]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[13]  J. Kamps,et al.  Words with attitude , 2002 .

[14]  Erik Cambria,et al.  Application of multi-dimensional scaling and artificial neural networks for biologically inspired opinion mining , 2013, BICA 2013.

[15]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[16]  Diego Reforgiato Recupero,et al.  Sentiment Analysis: Adjectives and Adverbs are Better than Adjectives Alone , 2007, ICWSM.

[17]  Maite Taboada,et al.  Methods for Creating Semantic Orientation Dictionaries , 2006, LREC.

[18]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[19]  Cécile Bothorel,et al.  Approches Statistique et Linguistique Pour la Classification de Textes d'Opinion Portant sur les Films , 2009, Fouille de Données d'Opinions.

[20]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.