Sentiment Analysis of Hotel Reviews Using Latent Dirichlet Allocation, Semantic Similarity and LSTM

Product reviews are usually determined by sentiment of customers; however sentiment analysis based on aspects still need further research. A hotel commonly has five aspects, which are location, meal, service, comfort and cleanliness. This research proposes methods to determine review sentiment according to the hotel aspects. A hotel reviews are preprocessed into a term list. Firstly, Latent Dirichelet Allocation (LDA) determines the hidden topics of a term list; then Semantic Similarity categorizes the term list based on the topic resulted by Latent Dirichelet Allocation (LDA) into the five aspects of a hotel. Then in calculating similarity measurement, the term list is expanded by using the Term Frequency-Inverse Cluster Frequency (TF-ICF) method. Finally, a classification of customer sentiment (satisfied or dissatisfied) is conducted by using the combination of Word Embedding and Long-short Term Memory (LSTM). The results show that the proposed method can classify the reviews into the five hotel aspects. The highest aspect categorization performance is obtained by using LDA + TF-ICF 100% + Semantic Similarity which reaches 85%; the performance sentiment classification for the highest aspect-based sentiment analysis is obtained by using Word Embedding + LSTM which reaches 93%; and the comfort aspect receives more negative sentiments compared to the sentiments of other aspects. Also the results show that a sentiment is influenced by an aspect.

[1]  A. S. Cantallops,et al.  International Journal of Hospitality Management New Consumer Behavior: a Review of Research on Ewom and Hotels , 2022 .

[2]  Endang Wahyu Pamungkas,et al.  B-BabelNet: Business-Specific Lexical Database for Improving Semantic Analysis of Business Process Models , 2017 .

[3]  Riyanarto Sarno,et al.  Prediction of Movie Sentiment Based on Reviews and Score on Rotten Tomatoes Using SentiWordnet , 2018, 2018 International Seminar on Application for Technology of Information and Communication.

[4]  Ayu Purwarianti,et al.  Combination of Latent Dirichlet Allocation (LDA) and Term Frequency-Inverse Cluster Frequency (TFxICF) in Indonesian text clustering with labeling , 2016, 2016 4th International Conference on Information and Communication Technology (ICoICT).

[5]  M. Geetha,et al.  Relationship between customer sentiment and online customer ratings for hotels - An empirical analysis , 2017 .

[6]  Riyanarto Sarno,et al.  Discovering traceability between business process and software component using Latent Dirichlet Allocation , 2016, 2016 International Conference on Informatics and Computing (ICIC).

[7]  Kuanchin Chen,et al.  Predicting hotel review helpfulness: The impact of review visibility, and interaction between hotel stars and review ratings , 2016, Int. J. Inf. Manag..

[8]  Hadi Veisi,et al.  Sentiment analysis based on improved pre-trained word embeddings , 2019, Expert Syst. Appl..

[9]  Yuan Jiang,et al.  Exploring hidden factors behind online food shopping from Amazon reviews: A topic mining approach , 2018 .

[10]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[11]  Ying Fu,et al.  Automated classification of software change messages by semi-supervised Latent Dirichlet Allocation , 2015, Inf. Softw. Technol..

[12]  Alexandros Potamianos,et al.  Similarity computation using semantic networks created from web-harvested data , 2013, Natural Language Engineering.

[13]  Jie Yang,et al.  Short text classification based on LDA topic model , 2016, 2016 International Conference on Audio, Language and Image Processing (ICALIP).

[14]  Jinyoung Han,et al.  Understanding customers' hotel revisiting behaviour: a sentiment analysis of online feedback reviews , 2018, Current Issues in Tourism.

[15]  Yasen Jiao,et al.  Performance measures in evaluating machine learning based bioinformatics predictors for classifications , 2016, Quantitative Biology.

[16]  Li Chen,et al.  Augmenting service recommender systems by incorporating contextual opinions from user reviews , 2015, User Modeling and User-Adapted Interaction.

[17]  Chih-Fong Tsai,et al.  Under-sampling class imbalanced datasets by combining clustering analysis and instance selection , 2019, Inf. Sci..

[18]  MoraesRodrigo,et al.  Document-level sentiment classification , 2013 .

[19]  H. Farsijani,et al.  The Effect of Customer Relationship Management and its Significant Relationship by Customers’ Reactions in LG Company☆ , 2016 .

[20]  Murtadha Talib AL-Sharuee,et al.  Sentiment analysis: An automatic contextual analysis and ensemble clustering approach and comparison , 2018, Data Knowl. Eng..

[21]  Preslav Nakov,et al.  Machine Translation Evaluation with Neural Networks , 2017, Comput. Speech Lang..

[22]  Guergana K. Savova,et al.  Unsupervised Document Classification with Informed Topic Models , 2016, BioNLP@ACL.

[23]  Abhishek Kumar,et al.  Aspect based Sentiment Oriented Summarization of Hotel Reviews , 2017 .

[24]  Riyanarto Sarno,et al.  A comparative study of sentiment analysis using SVM and SentiWordNet , 2019, Indonesian Journal of Electrical Engineering and Computer Science.

[25]  Abeer Alsadoon,et al.  Deep Learning for Aspect-Based Sentiment Analysis: A Comparative Review , 2019, Expert Syst. Appl..

[26]  Riyanarto Sarno,et al.  Automatic ranking system of university based on technology readiness level using LDA-Adaboost.MH , 2018, 2018 International Conference on Information and Communications Technology (ICOIACT).

[27]  Xuejie Zhang,et al.  Using a stacked residual LSTM model for sentiment intensity prediction , 2018, Neurocomputing.

[28]  Nagwa M. El-Makky,et al.  Sentiment Analysis of Arabic Tweets using Deep Learning , 2018, ACLING.

[29]  R. Law,et al.  Hospitality and Tourism Online Reviews: Recent Trends and Future Directions , 2015 .