Neural Network-Based Architecture for Sentiment Analysis in Indian Languages

Abstract Sentiment analysis refers to determining the polarity of the opinions represented by text. The paper proposes an approach to determine the sentiments of tweets in one of the Indian languages (Hindi, Bengali, and Tamil). Thirty-nine sequential models have been created using three different neural network layers [recurrent neural networks (RNNs), long short-term memory (LSTM), convolutional neural network (CNN)] with optimum parameter settings (to avoid over-fitting and error accumulation). These sequential models have been investigated for each of the three languages. The proposed sequential models are experimented to identify how the hidden layers affect the overall performance of the approach. A comparison has also been performed with existing approaches to find out if neural networks have an added advantage over traditional machine learning techniques.

[1]  Aidong Zhang,et al.  A Correlated Topic Model Using Word Embeddings , 2017, IJCAI.

[2]  Gökhan Tür,et al.  Multi-Domain Joint Semantic Frame Parsing Using Bi-Directional RNN-LSTM , 2016, INTERSPEECH.

[3]  K. P. Soman,et al.  Predicting the Sentimental Reviews in Tamil Movie using Machine Learning Algorithms , 2016 .

[4]  K. P. Soman,et al.  AMRITA-CEN@SAIL2015: Sentiment Analysis in Indian Languages , 2015, MIKE.

[5]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[6]  Gaurav Harit,et al.  Topographic Feature Extraction for Bengali and Hindi Character Images , 2011, ArXiv.

[7]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[8]  Daniel Dajun Zeng,et al.  Twitter Sentiment Analysis: A Bootstrap Ensemble Framework , 2013, 2013 International Conference on Social Computing.

[9]  Estevam R. Hruschka,et al.  Tweet sentiment analysis with classifier ensembles , 2014, Decis. Support Syst..

[10]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[11]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[12]  Christian Biemann,et al.  IIT-TUDA: System for Sentiment Analysis in Indian Languages Using Lexical Acquisition , 2015, MIKE.

[13]  Regina Barzilay,et al.  Molding CNNs for text: non-linear, non-consecutive convolutions , 2015, EMNLP.

[14]  Shibamouli Lahiri,et al.  Sentiment Analysis of Tweets in Three Indian Languages , 2016, WSSANLP@COLING.

[15]  Rudolf Kadlec,et al.  Improved Deep Learning Baselines for Ubuntu Corpus Dialogs , 2015, ArXiv.

[16]  J. Fernando Sánchez-Rada,et al.  Enhancing deep learning sentiment analysis with ensemble techniques in social applications , 2020 .

[17]  Xiao Sun,et al.  Sentiment analysis for Chinese microblog based on deep neural networks with convolutional extension features , 2016, Neurocomputing.

[18]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[19]  K. Robert Lai,et al.  Dimensional Sentiment Analysis Using a Regional CNN-LSTM Model , 2016, ACL.

[20]  S. K. Mohanty The Formulation of Parameters for Type Design of Indian Script Based on Calligraphic Studies , 1998, EP.

[21]  Cícero Nogueira dos Santos,et al.  Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts , 2014, COLING.

[22]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[23]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[24]  Fabio Crestani,et al.  Like It or Not , 2016, ACM Comput. Surv..

[25]  Thomas Wiatowski,et al.  A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction , 2015, IEEE Transactions on Information Theory.

[26]  Sabir Ismail,et al.  Detecting Sentiment from Bangla Text using Machine Learning Technique and Feature Analysis , 2016 .

[27]  Tej Prasad Dhamala,et al.  Sentiment Analysis of English and Tamil Tweets using Path Length Similarity based Word Sense Disambiguation , 2016 .

[28]  Yoshua Bengio,et al.  Scaling learning algorithms towards AI , 2007 .

[29]  Mikio L. Braun,et al.  Fast cross-validation via sequential testing , 2012, J. Mach. Learn. Res..

[30]  Braja Gopal Patra,et al.  Shared Task on Sentiment Analysis in Indian Languages (SAIL) Tweets - An Overview , 2015, MIKE.

[31]  M. Anand Kumar,et al.  Analyzing sentiment in Indian languages micro text using recurrent neural network , 2016 .

[32]  B. Premjith,et al.  AMRITA_CEN-NLP@SAIL2015: Sentiment Analysis in Indian Language Using Regularized Least Square Approach with Randomized Feature Learning , 2015, MIKE.

[33]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..