A study on online travel reviews through intelligent data analysis

The purpose of this paper is to show the application of a set of intelligent data analysis techniques to about 7 million of online travel reviews, with the aim of automatically extracting useful information. The reviews, collected from two popular online tourism-related review platforms, are all those posted by reviewers about one specific Italian location, from 2010 to 2017. To carry out the study, the following methodology is applied: a preliminary statistical analysis is performed to acquire general knowledge about the datasets, such as the geographical distribution of reviewers, their activities, and a comparison among the time of visits and the average scores of the reviews. Then, Natural Language Processing techniques are applied to extract and compare the most frequent words used in the two platforms. Finally, an Association Rule Learning algorithm is applied, to extract preferred destinations for distinct groups of reviewers, by mining interesting associations among the countries of origin of the reviewers and the most frequent destinations visited. By elaborating the available data, it is possible to automatically disclose valuable information for consumers and providers. The information automatically extracted can be exploited, for example, to build a recommender system for customers or a market analysis tool for service providers.

[1]  Fabio Stella,et al.  Analyzing user reviews in tourism with topic models , 2015, Information Technology & Tourism.

[2]  Wei Chen,et al.  The influence of user-generated content on traveler behavior: An empirical investigation on the effects of e-word-of-mouth to hotel online bookings , 2011, Comput. Hum. Behav..

[3]  Bin Guo,et al.  The order effect on online review helpfulness: A social influence perspective , 2017, Decis. Support Syst..

[4]  Nikolaos Stylos,et al.  ‘You will like it!’ using open data to predict tourists' response to a tourist attraction , 2017 .

[5]  B. Gu,et al.  The impact of online user reviews on hotel room sales , 2009 .

[6]  S. Becken,et al.  Sentiment Analysis in Tourism: Capitalizing on Big Data , 2019 .

[7]  Mark I. McCarthy,et al.  The South Asian Genome , 2014, PloS one.

[8]  Stuart J. Barnes,et al.  Understanding the Impact of Online Reviews on Hotel Performance , 2017 .

[9]  Dietmar Plenz,et al.  powerlaw: A Python Package for Analysis of Heavy-Tailed Distributions , 2013, PloS one.

[10]  Lin Wang,et al.  The impact research of online reviews' sentiment polarity presentation on consumer purchase decision , 2017, Inf. Technol. People.

[11]  Shanshan Qi,et al.  An Application of Apriori Algorithm Association Rules Mining to Profiling the Heritage Visitors of Macau , 2015, ENTER.

[12]  F. Okumus,et al.  Understanding Satisfied and Dissatisfied Hotel Customers: Text Mining of Online Hotel Reviews , 2016 .

[13]  Huaping Chen,et al.  Sellers versus buyers: differences in user information sharing on social commerce sites , 2016, Inf. Technol. People.

[14]  Claudio Vitari,et al.  The Effect of Brand on the Impact of e-WOM on Hotels’ Financial Performance , 2017, Int. J. Electron. Commer..

[15]  Yen-Liang Chen,et al.  Opinion mining from online hotel reviews - A text summarization approach , 2017, Inf. Process. Manag..

[16]  Paulo Duarte,et al.  Social media use for travel purposes: a cross cultural comparison between Portugal and the UK , 2017, J. Inf. Technol. Tour..

[17]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[18]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[19]  Miriam J. Metzger,et al.  Mitigating risk in ecommerce transactions: perceptions of information credibility and the role of user-generated ratings in product quality and purchase intention , 2014, Electron. Commer. Res..

[20]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[21]  Chunxia Zhang,et al.  Identifying Helpful Online Reviews with Word Embedding Features , 2016, KSEM.

[22]  Mingshu Wang,et al.  From stay to play – A travel planning tool based on crowdsourcing user-generated contents , 2017 .

[23]  Zheng Xiang,et al.  Perceptual mapping of hotel brands using online reviews: a text analytics approach , 2016, J. Inf. Technol. Tour..

[24]  Kyung Hyan Yoo,et al.  Use and Impact of Online Travel Reviews , 2008, ENTER.

[25]  Ingrid Moerman,et al.  Pattern mining in tourist attraction visits through association rule learning on Bluetooth tracking data: A case study of Ghent, Belgium , 2014 .

[26]  Maria Lexhagen,et al.  Topic Detection: Identifying Relevant Topics in Tourism Reviews , 2016, ENTER.

[27]  Montse Cuadros,et al.  Automatic analysis of textual hotel reviews , 2015, Information Technology & Tourism.

[28]  Shu-Hsien Liao,et al.  Mining customer knowledge for tourism new product development and customer relationship management , 2010, Expert Syst. Appl..

[29]  Mohammad Salehan,et al.  Predicting the performance of online consumer reviews: A sentiment mining approach to big data analytics , 2014, Decis. Support Syst..

[30]  Dong Chen,et al.  Identifying places of interest for tourists using knowledge discovery techniques , 2014, 2014 International Conference on Industrial Automation, Information and Communications Technology.

[31]  Lucas C. Parra,et al.  Origins of power-law degree distribution in the heterogeneity of human activity in social networks , 2013, Scientific Reports.

[32]  B. Pan,et al.  A retrospective view of electronic word-of-mouth in hospitality and tourism management , 2017 .

[33]  U. Gretzel,et al.  Role of social media in online travel information search , 2010 .

[34]  M. Shamim Hossain,et al.  Word-of-Mouth Understanding: Entity-Centric Multimodal Aspect-Opinion Mining in Social Media , 2015, IEEE Transactions on Multimedia.

[35]  Alain Yee-Loong Chong,et al.  Predicting consumer product demands via Big Data: the roles of online promotional marketing and online reviews , 2017, Int. J. Prod. Res..

[36]  Rob Law,et al.  Incorporating Both Positive and Negative Association Rules into the Analysis of Outbound Tourism in Hong Kong , 2010 .

[37]  B. Sparks,et al.  The impact of online reviews on hotel booking intentions and perception of trust. , 2011 .

[38]  Alok Mishra,et al.  Factors influencing consumer intention in social commerce adoption , 2017, Inf. Technol. People.