Deep Learning-based Sentiment Analysis and Topic Modeling on Tourism During Covid-19 Pandemic

The Covid-19 pandemic has disrupted the world economy and significantly influenced the tourism industry. Millions of people have shared their emotions, views, facts, and circumstances on numerous social media platforms, which has resulted in a massive flow of information. The high-density social media data has drawn many researchers to extract valuable information and understand the user’s emotions during the pandemic time. The research looks at the data collected from the micro-blogging site Twitter for the tourism sector, emphasizing sub-domains hospitality and healthcare. The sentiment of approximately 20,000 tweets have been calculated using Valence Aware Dictionary for Sentiment Reasoning (VADER) model. Furthermore, topic modeling was used to reveal certain hidden themes and determine the narrative and direction of the topics related to tourism healthcare, and hospitality. Topic modeling also helped us to identify inter-cluster similar terms and analyzing the flow of information from a group of a similar opinion. Finally, a cutting-edge deep learning classification model was used with different epoch sizes of the dataset to anticipate and classify the people’s feelings. The deep learning model has been tested with multiple parameters such as training set accuracy, test set accuracy, validation loss, validation accuracy, etc., and resulted in more than a 90% in training set accuracy tourism hospitality and healthcare reported 80.9 and 78.7% respectively on test set accuracy.

[1]  M. Tatum Will medical tourism survive covid-19? , 2020, BMJ.

[2]  Changsok Yoo,et al.  An analysis of the utilization of Facebook by local Korean governments for tourism development and the network of smart tourism ecosystem , 2016, Int. J. Inf. Manag..

[3]  A. R. Javed,et al.  Classification and Categorization of COVID-19 Outbreak in Pakistan , 2021 .

[4]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[5]  Praveen Kumar Reddy Maddikunta,et al.  Genetically Optimized Prediction of Remaining Useful Life , 2021, Sustain. Comput. Informatics Syst..

[6]  Naciye Güliz Uğur,et al.  Impacts of COVID-19 on global tourism industry: A cross-regional comparison , 2020, Tourism Management Perspectives.

[7]  Xia Feng,et al.  Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey , 2017, Multimedia Tools and Applications.

[8]  Praveen Kumar Reddy Maddikunta,et al.  Early Detection of Diabetic Retinopathy Using PCA-Firefly Based Deep Learning Model , 2020, Electronics.

[9]  C. L. Chowdhary,et al.  Deep learning and medical image processing for coronavirus (COVID-19) pandemic: A survey , 2020, Sustainable Cities and Society.

[10]  K. Phuah,et al.  The impact of COVID-19 on tourism industry in Malaysia , 2020, Current Issues in Tourism.

[11]  Alex Sherstinsky,et al.  Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network , 2018, Physica D: Nonlinear Phenomena.

[12]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[13]  A. Mishra,et al.  Use of Classification Algorithms in Health Care , 2020 .

[14]  Ali Kashif Bashir,et al.  COVID-19 Patient Health Prediction Using Boosted Random Forest Algorithm , 2020, Frontiers in Public Health.

[15]  Chunyu Kit,et al.  Tokenization as the Initial Phase in NLP , 1992, COLING.

[16]  Praveen Kumar Reddy Maddikunta,et al.  An ensemble machine learning approach through effective feature extraction to classify fake news , 2021, Future Gener. Comput. Syst..

[17]  Douglas Williams,et al.  Deep Learning and its Application for Healthcare Delivery in Low and Middle Income Countries , 2021, Frontiers in Artificial Intelligence.

[18]  M. Thelwall Sentiment Analysis for Tourism , 2019, Big Data and Innovation in Tourism, Travel, and Hospitality.

[19]  Asad A. Aburumman COVID-19 impact and survival strategy in business tourism market: the example of the UAE MICE industry , 2020 .

[20]  Rui Ji,et al.  Prevalence of comorbidities and its effects in patients infected with SARS-CoV-2: a systematic review and meta-analysis , 2020, International Journal of Infectious Diseases.

[21]  Hamid Safi,et al.  Early detection of diabetic retinopathy. , 2018, Survey of ophthalmology.

[22]  Ken Thompson,et al.  Programming Techniques: Regular expression search algorithm , 1968, Commun. ACM.

[23]  UNWTO World Tourism Barometer and Statistical Annex, December 2020 , 2020, UNWTO World Tourism Barometer.

[24]  N. Kumaresh,et al.  A Comprehensive Study on Lexicon Based Approaches for Sentiment Analysis , 2019, Asian Journal of Computer Science and Technology.

[25]  Peter W. Foltz,et al.  An introduction to latent semantic analysis , 1998 .

[26]  Vikrant Kaushal,et al.  Hospitality and tourism industry amid COVID-19 pandemic: Perspectives on challenges and learnings from India , 2020, International Journal of Hospitality Management.

[27]  Amit Kumar Mishra,et al.  Advances in Computational Linguistics and Text Processing Frameworks , 2020 .

[29]  Faiza Manzoor,et al.  The Contribution of Sustainable Tourism to Economic Growth and Employment in Pakistan , 2019, International journal of environmental research and public health.

[30]  S. Becken,et al.  Sentiment Analysis in Tourism: Capitalizing on Big Data , 2019 .

[31]  Byeongki Jeong,et al.  Social media mining for product planning: A product opportunity mining approach based on topic modeling and sentiment analysis , 2017, Int. J. Inf. Manag..

[32]  Yong Yu,et al.  A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures , 2019, Neural Computation.

[33]  D. T. Alamanda,et al.  Sentiment Analysis Using Text Mining of Indonesia Tourism Reviews via Social Media , 2019, International Journal of Humanities, Arts and Social Sciences.

[34]  Amit Kumar Mishra,et al.  Various Aspects of Sentiment Analysis: A Review , 2019, SSRN Electronic Journal.