PerSent 2.0: Persian Sentiment Lexicon Enriched with Domain-Specific Words

Sentiment analysis is probably the most actively growing area of natural language processing nowadays, which leverages huge amount of user-contributed data on Internet to improve income of businesses and quality of life of consumer. The majority of existent sentiment-analysis systems is focused on English, due to lack of resources and tools for other languages. To fill this gap for Persian language, in our previous work we have compiled the first version of PerSent Persian sentiment lexicon, which was small and included only words and phrases from general domain. In this paper, we present its extension with words from three different domains and evaluate its performance on polarity classification task using various machine learning-based classifiers. We use a multi-domain dataset to evaluate the performance of our new lexicon on various domains. Our results demonstrate usefulness of the new lexicon for analysis of product and movie reviews and especially of political news in Persian language.

[1]  Christopher M. Danforth,et al.  Temporal Patterns of Happiness and Information in a Global Social Network: Hedonometrics and Twitter , 2011, PloS one.

[2]  Hadi Larijani,et al.  Exploiting Deep Learning for Persian Sentiment Analysis , 2018, BICS.

[3]  Amir Hussain,et al.  A novel brain-inspired compression-based optimised multimodal fusion for emotion recognition , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[4]  Behrouz Minaei-Bidgoli,et al.  An Empirical Study on the Effect of Morphological and Lexical Features in Persian Dependency Parsing , 2013, SPMRL@EMNLP.

[5]  Hadi Larijani,et al.  Statistical Analysis Driven Optimized Deep Learning System for Intrusion Detection , 2018, BICS.

[6]  Francesco Carlo Morabito,et al.  Toward an Automatic Classification of SEM Images of Nanomaterials via a Deep Learning Approach , 2020, Neural Approaches to Dynamics of Signal Exchanges.

[7]  Erik Cambria,et al.  SenticNet: A Publicly Available Semantic Resource for Opinion Mining , 2010, AAAI Fall Symposium: Commonsense Knowledge.

[8]  Hsin-Hsi Chen,et al.  Building Emotion Lexicon from Weblog Corpora , 2007, ACL.

[9]  Qiang Zhou,et al.  Multilingual Sentiment Analysis: State of the Art and Independent Comparison of Techniques , 2016, Cognitive Computation.

[10]  Björn W. Schuller,et al.  SenticNet 4: A Semantic Resource for Sentiment Analysis Based on Conceptual Primitives , 2016, COLING.

[11]  Amir Hussain,et al.  Persian Named Entity Recognition , 2017, 2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC).

[12]  Tariq S. Durrani,et al.  A Comparative Study of Persian Sentiment Analysis Based on Different Feature Combinations , 2017, CSPS.

[13]  Erik Cambria,et al.  SenticNet 2: A Semantic and Affective Resource for Opinion Mining and Sentiment Analysis , 2012, FLAIRS.

[14]  Mirna Adriani,et al.  A Comparative Study on Twitter Sentiment Analysis: Which Features are Good? , 2015, NLDB.

[15]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[16]  Alexander F. Gelbukh,et al.  Adaptation of Sentiment Analysis Techniques to Persian Language , 2017, CICLing.

[17]  Victoria Bobicev,et al.  Emotions in Words: Developing a Multilingual WordNet-Affect , 2010, CICLing.

[18]  Mohammad Ehsan Basiri,et al.  A Framework for Sentiment Analysis in Persian , 2014 .

[19]  Amir Hussain,et al.  Deep learning driven multimodal fusion for automated deception detection , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[20]  Qiang Zhou,et al.  PerSent: A Freely Available Persian Sentiment Lexicon , 2016, BICS.

[21]  Mahmoud Al-Ayyoub,et al.  Automatic Lexicon Construction for Arabic Sentiment Analysis , 2014, 2014 International Conference on Future Internet of Things and Cloud.

[22]  Salwani Abdullah,et al.  Arabic senti-lexicon: Constructing publicly available language resources for Arabic sentiment analysis , 2018, J. Inf. Sci..

[23]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[24]  Ana María Martínez Enríquez,et al.  Lexicon Based Sentiment Analysis of Urdu Text Using SentiUnits , 2010, MICAI.

[25]  Pablo Gervás,et al.  SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis , 2012, LREC.

[26]  Erik Cambria,et al.  SenticNet 5: Discovering Conceptual Primitives for Sentiment Analysis by Means of Context Embeddings , 2018, AAAI.

[27]  Francesco Carlo Morabito,et al.  A Convolutional Neural Network approach for classification of dementia stages based on 2D-spectral representation of EEG recordings , 2019, Neurocomputing.

[28]  Pushpak Bhattacharyya,et al.  A Sentiment Analyzer for Hindi Using Hindi Senti Lexicon , 2014, ICON.