Mining Tourist's Perception toward Indonesia Tourism Destination Using Sentiment Analysis and Topic Modelling

Indonesia's Tourism industries is the second-best contributor to country's foreign exchange income for years. The growth it produced were in range of 7 to 10 percent per year since 2009. In reality, Bali is always at the top of the mind among majority of international tourists inspite of many hidden gems with spectacular qualities to rival Bali's popularity. Therefore, Indonesia's government has set new tourist destinations to increase their presence thus increasing visitation numbers. Researches on utilizing big data to support industry 4.0 model in tourism businesses are encouraged as part of national research priorities. Data Analytics models such as Sentiment Analysis and Topic Modelling can be used to reveal hidden patterns from abundant user-generated content data available in social media sites, one of which, TripAdvisor. This study aims to mine the visitors' perceptions of 10 most visited sites in Indonesia. Emotions and topics discussed in comments are two features to be extracted. Using data mining framework, five types of emotion and topics related to tourism were discovered. Data collection was done using Parsehub to acquire 3494 comments generated in 4 years. Both Sentiment Analysis and Topic Modelling were processed in Orange 3. Results shows that Joy is the most prominent emotion accompanying visitors' experiences. Topic modeling shows several important keywords toward preferences. Results of this study can be used to improve Indonesia's tourism stakeholder's decision quality especially in terms of marketing and operations.

[1]  A. Alamsyah,et al.  SENTIMENT ANALYSIS BASED ON APPRAISAL THEORY FOR MARKETING INTELLIGENCE IN INDONESIA’S MOBILE PHONE MARKET , 2015 .

[2]  S. Urbina,et al.  Psychological testing, 7th ed. , 1997 .

[3]  J. Holloway,et al.  The Business of Tourism , 1989 .

[4]  Namuk Ko,et al.  Identifying Product Opportunities Using Social Media Mining: Application of Topic Modeling and Chance Discovery Theory , 2018, IEEE Access.

[5]  Geoffrey Wall,et al.  Environmental Management, Environmental Image and the Competitive Tourist Attraction , 2005 .

[6]  C. Ngwira,et al.  What attracts tourists to a destination? Is it attractions? , 2018 .

[7]  Kadir Çakar Experiences of visitors to Gallipoli, a nostalgia-themed dark tourism destination: an insight from TripAdvisor , 2018 .

[8]  Roger G. Stone,et al.  Naive Bayes vs. Decision Trees vs. Neural Networks in the Classification of Training Web Pages , 2009 .

[9]  Ingoo Han,et al.  The Effect of On-Line Consumer Reviews on Consumer Purchasing Intention: The Moderating Role of Involvement , 2007, Int. J. Electron. Commer..

[10]  Babis Theodoulidis,et al.  Analyzing Customer Experience Feedback Using Text Mining , 2014 .

[11]  Kris Shaffer Topic Modeling: A Basic Introduction ― Journal of Digital Humanities , 2016 .

[12]  G. Lohmann,et al.  Tourism Theory: Concepts, Models and Systems , 2016 .

[13]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[14]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[15]  Enrique Herrera-Viedma,et al.  Sentiment analysis: A review and comparative analysis of web services , 2015, Inf. Sci..

[16]  Sergio Toral,et al.  Post-visit and pre-visit tourist destination image through eWOM sentiment analysis and perceived helpfulness , 2016 .

[17]  Laura Ferrari,et al.  A Comparison between Preprocessing Techniques for Sentiment Analysis in Twitter , 2016, KDWeb.

[18]  G. Cetin,et al.  Tourism Destination: Design of Experiences , 2016 .

[19]  Yong Shi,et al.  The Role of Text Pre-processing in Sentiment Analysis , 2013, ITQM.

[20]  S. F. Lopes Destination image: origins, developments and implications. , 2011 .

[21]  Saif M. Mohammad,et al.  Sentiment Analysis: Detecting Valence, Emotions, and Other Affectual States from Text , 2016, ArXiv.