Sentiment Analysis of Hotel Aspect Using Probabilistic Latent Semantic Analysis, Word Embedding and LSTM

In the industrial era 5.0, product reviews are necessary for the sustainability of a company. Product reviews are a User Generated Content (UGC) feature which describes customer satisfaction. The researcher used five hotel aspects including location, meal, service, comfort, and cleanliness to measure customer satisfaction. Each product review was preprocessed into a term list document. In this context, we proposed the Probabilistic Latent Semantic Analysis (PLSA) method to produce a hidden topic. Semantic Similarity was used to classify topics into five hotel aspects. The Term Frequency-Inverse Corpus Frequency (TF-ICF) method was used for weighting each term list, which had been expanded from each cluster in the document. The researcher used Word embedding to obtain vector values in the deep learning method from Long Short-Term Memory (LSTM) for sentiment classification. The result showed that the combination of the PLSA + TF ICF 100% + Semantic Similarity method was superior are 0.840 in the fifth categorization of the hotel aspects; the Word Embedding + LSTM method outperformed the sentiment classification at value 0.946; the service aspect received positive sentiment value higher are 45.545 than the other aspects; the comfort aspect received negative sentiment value higher are 12.871 than the other aspects. Other results also showed that sentiment was affected by the aspects.

[1]  Okran Jeong,et al.  Social media contents based sentiment analysis and prediction system , 2018, Expert Syst. Appl..

[2]  Parsa Ghaffari,et al.  Opinion Mining and Sentiment Polarity on Twitter and Correlation between Events and Sentiment , 2016, 2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService).

[3]  Yibai Li,et al.  Business intelligence in online customer textual reviews: Understanding consumer perceptions and influential factors , 2017, Int. J. Inf. Manag..

[4]  Christopher D. Manning,et al.  Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[5]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[6]  Abhishek Kumar,et al.  Aspect based Sentiment Oriented Summarization of Hotel Reviews , 2017 .

[7]  Vural Özdemir,et al.  Birth of Industry 5.0: Making Sense of Big Data with Artificial Intelligence, "The Internet of Things" and Next-Generation Technology Policy. , 2018, Omics : a journal of integrative biology.

[8]  Peng Wang,et al.  Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification , 2016, Neurocomputing.

[9]  Vassil Alexandrov,et al.  News clustering based on similarity analysis , 2017, ITQM.

[10]  F. Okumus,et al.  Understanding Satisfied and Dissatisfied Hotel Customers: Text Mining of Online Hotel Reviews , 2016 .

[11]  Riyanarto Sarno,et al.  Prediction of Movie Sentiment Based on Reviews and Score on Rotten Tomatoes Using SentiWordnet , 2018, 2018 International Seminar on Application for Technology of Information and Communication.

[12]  Filiberto Pla,et al.  Latent topics-based relevance feedback for video retrieval , 2016, Pattern Recognit..

[13]  Jiangtao Ren,et al.  A topic-driven language model for learning to generate diverse sentences , 2019, Neurocomputing.

[14]  Riyanarto Sarno,et al.  Supervised Probabilistic Latent Semantic Analysis (sPLSA) for estimating technology readiness level , 2017, 2017 11th International Conference on Information & Communication Technology and System (ICTS).

[15]  Riyanarto Sarno,et al.  Sentiment Detection of Comment Titles in Booking.com Using Probabilistic Latent Semantic Analysis , 2018, 2018 6th International Conference on Information and Communication Technology (ICoICT).

[16]  Xin Wang,et al.  Efficient algorithms for graph regularized PLSA for probabilistic topic modeling , 2019, Pattern Recognit..

[17]  J. Fernando Sánchez-Rada,et al.  Enhancing deep learning sentiment analysis with ensemble techniques in social applications , 2020 .

[18]  Dursun Delen,et al.  Predicting hospital readmission for lupus patients: An RNN-LSTM-based deep-learning methodology , 2018, Comput. Biol. Medicine.

[19]  Xin Wang,et al.  Co-Regularized PLSA for Multi-Modal Learning , 2016, AAAI.

[20]  Li Chen,et al.  Augmenting service recommender systems by incorporating contextual opinions from user reviews , 2015, User Modeling and User-Adapted Interaction.

[21]  Abeer Alsadoon,et al.  Deep Learning for Aspect-Based Sentiment Analysis: A Comparative Review , 2019, Expert Syst. Appl..

[22]  Riyanarto Sarno,et al.  Automatic ranking system of university based on technology readiness level using LDA-Adaboost.MH , 2018, 2018 International Conference on Information and Communications Technology (ICOIACT).

[23]  Patrícia Augustin Jaques,et al.  An Analysis of Hierarchical Text Classification Using Word Embeddings , 2018, Inf. Sci..

[24]  Hadi Veisi,et al.  Sentiment analysis based on improved pre-trained word embeddings , 2019, Expert Syst. Appl..

[25]  Robert P. Schumaker,et al.  Predicting wins and spread in the Premier League using a sentiment analysis of twitter , 2016, Decis. Support Syst..

[26]  Laizhong Cui,et al.  Weakly supervised topic sentiment joint model with word embeddings , 2018, Knowl. Based Syst..

[27]  Ekin Ekinci,et al.  An Aspect-Sentiment Pair Extraction Approach Based on Latent Dirichlet Allocation , 2018 .

[28]  Soumya K. Ghosh,et al.  Sentiment Analysis of Customer Reviews Using Robust Hierarchical Bidirectional Recurrent Neural Network , 2016, CSOC.

[29]  Zhiyong Feng,et al.  LSTM with sentence representations for document-level sentiment classification , 2018, Neurocomputing.

[30]  R. Sarno,et al.  Traceability between business process and software component using Probabilistic Latent Semantic Analysis , 2016, 2016 International Conference on Informatics and Computing (ICIC).

[31]  Anh-Cuong Le,et al.  Exploiting multiple word embeddings and one-hot character vectors for aspect-based sentiment analysis , 2018, Int. J. Approx. Reason..

[32]  Z. Schwartz,et al.  What can big data and text analytics tell us about hotel guest experience and satisfaction , 2015 .

[33]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[34]  Aung Win,et al.  Aspect Level Opinion Mining for Hotel Reviews in Myanmar Language , 2018, 2018 IEEE International Conference on Agents (ICA).

[35]  Riyanarto Sarno,et al.  Discovering traceability between business process and software component using Latent Dirichlet Allocation , 2016, 2016 International Conference on Informatics and Computing (ICIC).

[36]  Riyanarto Sarno,et al.  Business Process model similarity analysis using hybrid PLSA and WDAG methods , 2016, 2016 International Conference on Information & Communication Technology and Systems (ICTS).

[37]  Xuejie Zhang,et al.  Using a stacked residual LSTM model for sentiment intensity prediction , 2018, Neurocomputing.

[38]  Zhengxin Chen,et al.  Towards Extended Data Mining: An Examination of Technical Aspects , 2018, ITQM.