Deep Learning Performance In Sentiment Analysis

A problem of sentiment analysis is the text representation, that encodes text into a continuous vector by arranging projections from semantics to the points in high dimensional space. Deep learning methods have been widely used to solve various sentiment analysis problems. To improve the performance of deep learning in sentiment analysis requires a good method of text representation to be used as an embedding layer. In this study, we analyzed deep learning with the Recurrent Neural Network (RNN) method with Long Short-Term Memory (LSTM) variants in sentiment classification. We compare the performance of the LSTM network with Word2Vec to the network without Word2Vec as word embedding. Sentiment data used is derived from user-provided reviews of the applications in Google Play. The training process involves the Dropout layer and Early Stopping points to prevent overfitting. The results showed that the LSTM network using a word embedding Word2Vec is better than without Word2Vec. LSTM with Word2Vec 300 words dimension got a low error value of 0.3287 with an accuracy of 86.76%. While the LSTM testing results without Word2Vec get the lowest error of 0.3751 with an accuracy of 84.14%.

[1]  Antonio Jimeno-Yepes,et al.  Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation , 2017, J. Biomed. Informatics.

[2]  Zhenfang Zhu,et al.  Lexicon-Enhanced Attention Network Based on Text Representation for Sentiment Classification , 2019, Applied Sciences.

[3]  Hugh E. Williams,et al.  Stemming Indonesian: A confix-stripping approach , 2007, TALIP.

[4]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[5]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[6]  Ali H. Mirza,et al.  Computer network intrusion detection using sequential LSTM Neural Networks autoencoders , 2018, 2018 26th Signal Processing and Communications Applications Conference (SIU).

[7]  W. Marsden I and J , 2012 .

[8]  Ausif Mahmood,et al.  Deep learning for sentence classification , 2017, 2017 IEEE Long Island Systems, Applications and Technology Conference (LISAT).

[9]  Tong Zhang,et al.  Text Mining: Predictive Methods for Analyzing Unstructured Information , 2004 .

[10]  Edi Winarko,et al.  Sentimen Analisis Tweet Berbahasa Indonesia Dengan Deep Belief Network , 2017 .

[11]  Mehmet Serkan Apaydin,et al.  A Deep Learning Approach to Sentiment Analysis in Turkish , 2018, 2018 International Conference on Artificial Intelligence and Data Processing (IDAP).

[12]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[13]  Henda Hajjami Ben Ghézala,et al.  Comparative study of word embedding methods in topic segmentation , 2017, KES.

[14]  Bin Ge,et al.  Classification Algorithmof Chinese SentimentOrientation Based on Dictionary and LSTM , 2018, ICBDR.