Artificial intelligence for hospitality big data analytics: developing a prediction model of restaurant review helpfulness for customer decision-making

Purpose Big data analytics allows researchers and industry practitioners to extract hidden patterns or discover new information and knowledge from big data. Although artificial intelligence (AI) is one of the emerging big data analytics techniques, hospitality and tourism literature has shown minimal efforts to process and analyze big hospitality data through AI. Thus, this study aims to develop and compare prediction models for review helpfulness using machine learning (ML) algorithms to analyze big restaurant data. Design/methodology/approach The study analyzed 1,483,858 restaurant reviews collected from Yelp.com. After a thorough literature review, the study identified and added to the prediction model 4 attributes containing 11 key determinants of review helpfulness. Four ML algorithms, namely, multivariate linear regression, random forest, support vector machine regression and extreme gradient boosting (XGBoost), were used to find a better prediction model for customer decision-making. Findings By comparing the performance metrics, the current study found that XGBoost was the best model to predict review helpfulness among selected popular ML algorithms. Results revealed that attributes regarding a reviewer’s credibility were fundamental factors determining a review’s helpfulness. Review helpfulness even valued credibility over ratings or linguistic contents such as sentiment and subjectivity. Practical implications The current study helps restaurant operators to attract customers by predicting review helpfulness through ML-based predictive modeling and presenting potential helpful reviews based on critical attributes including review, reviewer, restaurant and linguistic content. Using AI, online review platforms and restaurant websites can enhance customers’ attitude and purchase decision-making by reducing information overload and search cost and highlighting the most crucial review helpfulness features and user-friendly automated search results. Originality/value To the best of the authors’ knowledge, the current study is the first to develop a prediction model of review helpfulness and reveal essential factors for helpful reviews. Furthermore, the study presents a state-of-the-art ML model that surpasses the conventional models’ prediction accuracy. The findings will improve practitioners’ marketing strategies by focusing on factors that influence customers’ decision-making.

[1]  Rob Law,et al.  Forecasting tourism demand with composite search index , 2017 .

[2]  A. Agnihotri,et al.  Online Review Helpfulness: Role of Qualitative Factors , 2016 .

[3]  Roland T. Rust,et al.  Artificial Intelligence in Service , 2018 .

[4]  Miyoung Jeong,et al.  Roles of negative emotions in customers’ perceived helpfulness of hotel reviews on a user-generated review website: A text mining approach , 2017 .

[5]  Kwok-Leung Tsui,et al.  Forecasting tourist arrivals with machine learning and internet search index , 2019 .

[6]  Sung-Byung Yang,et al.  Exploring the comparative importance of online hotel reviews’ heuristic attributes in review helpfulness: a conjoint analysis approach , 2017 .

[7]  Claudio Vitari,et al.  What moderates the influence of extremely negative ratings? The role of review and reviewer characteristics , 2019, International Journal of Hospitality Management.

[8]  William Stafford Noble,et al.  Support vector machine , 2013 .

[9]  Pranjal Gupta,et al.  Emotional expressions in online user reviews: How they influence consumers' product evaluations , 2012 .

[10]  Jie Zhang,et al.  Analysing Chinese citizens' intentions of outbound travel: a machine learning approach , 2014 .

[11]  Radoslaw Nielek,et al.  Influence of consumer reviews on online purchasing decisions in older and younger adults , 2018, Decis. Support Syst..

[12]  So Young Park,et al.  Forecasting campground demand in US national parks , 2019, Annals of Tourism Research.

[13]  Yi-Hsiu Cheng,et al.  Social influence's impact on reader perceptions of online reviews , 2015 .

[14]  Yue Pan,et al.  Born Unequal: A Study of the Helpfulness of User-Generated Product Reviews , 2011 .

[15]  Ying Chen Lo,et al.  Facebook marketing campaign benchmarking for a franchised hotel , 2018 .

[16]  Marcello M. Mariani,et al.  Business intelligence and big data in hospitality and tourism: a systematic literature review , 2018, International Journal of Contemporary Hospitality Management.

[17]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[18]  Minwoo Lee,et al.  Multisensory experience for enhancing hotel guest experience , 2019, International Journal of Contemporary Hospitality Management.

[19]  M. Geetha,et al.  Relationship between customer sentiment and online customer ratings for hotels - An empirical analysis , 2017 .

[20]  Fang Wang,et al.  Online review helpfulness: Impact of reviewer profile image , 2017, Decis. Support Syst..

[21]  Weiguo Fan,et al.  Understanding the determinants of online review helpfulness: A meta-analytic investigation , 2017, Decis. Support Syst..

[22]  Detelina Marinova,et al.  Unstructured data in marketing , 2018, Journal of the Academy of Marketing Science.

[23]  Fernando Batista,et al.  Hotel online reviews: creating a multi-source aggregated index , 2018, International Journal of Contemporary Hospitality Management.

[24]  B. Gu,et al.  The impact of online user reviews on hotel room sales , 2009 .

[25]  Indranil Bose,et al.  Whose online reviews to trust? Understanding reviewer trustworthiness and its impact on business , 2017, Decis. Support Syst..

[26]  Z. Schwartz,et al.  What can big data and text analytics tell us about hotel guest experience and satisfaction , 2015 .

[27]  D. Keller,et al.  Characterizing non-chain restaurants’ Yelp star-ratings: Generalizable findings from a representative sample of Yelp reviews , 2020 .

[28]  Mehrbakhsh Nilashi,et al.  Market segmentation and travel choice prediction in Spa hotels through TripAdvisor’s online reviews , 2019, International Journal of Hospitality Management.

[29]  Jan Muntermann,et al.  Explaining and predicting online review helpfulness: The role of content and reviewer-related signals , 2018, Decis. Support Syst..

[30]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[31]  Georgios Zervas,et al.  Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud , 2015, Manag. Sci..

[32]  Raffaele Filieri What makes an online consumer review trustworthy , 2016 .

[33]  Frauke Kreuter,et al.  Tree-based machine learning methods for survey research , 2019 .

[34]  Yogesh Kumar Dwivedi,et al.  Predicting the “helpfulness” of online consumer reviews , 2017 .

[35]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[36]  Ayyaz Hussain,et al.  An analysis of review content and reviewer variables that contribute to review helpfulness , 2018, Inf. Process. Manag..

[37]  Hui-Chi Chuang,et al.  Do reviewers' words affect predicting their helpfulness ratings? Locating helpful reviewers by linguistics styles , 2019, Inf. Manag..

[38]  Rob Law,et al.  How to improve the stated helpfulness of hotel reviews? A multilevel approach , 2019, International Journal of Contemporary Hospitality Management.

[39]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[40]  Srikumar Krishnamoorthy,et al.  Linguistic features for review helpfulness prediction , 2015, Expert Syst. Appl..

[41]  SchuffDavid,et al.  What makes a helpful online review? a study of customer reviews on amazon.com , 2010 .

[42]  Linchi Kwok,et al.  Factors contributing to the helpfulness of online hotel reviews , 2016 .

[43]  Minwoo Lee,et al.  Exploring the underlying factors of customer value in restaurants: A machine learning approach , 2020 .

[44]  Xin Yang,et al.  Forecasting Chinese tourist volume with search engine data , 2015 .

[45]  Sangwon Park,et al.  What makes a useful online review? Implication for travel product websites. , 2015 .

[46]  David Schuff,et al.  Is It the Review or the Reviewer? a Multi-Method Approach to Determine the Antecedents of Online Review Helpfulness , 2011, 2011 44th Hawaii International Conference on System Sciences.

[47]  Michael I. Jordan,et al.  Machine learning: Trends, perspectives, and prospects , 2015, Science.

[48]  Swagato Chatterjee,et al.  Drivers of helpfulness of online hotel reviews: A sentiment and emotion mining approach , 2020 .

[49]  Xiaowei Xu,et al.  Predicting the Helpfulness of Online Restaurant Reviews Using Different Machine Learning Algorithms: A Case Study of Yelp , 2019, Sustainability.

[50]  Zhewei Zhang,et al.  Why Aren't the Stars Aligned? An Analysis of Online Review Content and Star Ratings , 2014, 2014 47th Hawaii International Conference on System Sciences.

[51]  Wen-Chin Tsao,et al.  Compliance with eWOM: the influence of hotel reviews on booking intention from the perspective of consumer conformity. , 2015 .

[52]  María del Rocío Martínez-Torres,et al.  A machine learning approach for the identification of the deceptive reviews in the hospitality sector using unique attributes and sentiment orientation , 2019, Tourism Management.

[53]  David Schuff,et al.  What Makes a Helpful Review? A Study of Customer Reviews on Amazon.com , 2010 .

[54]  Ling Li,et al.  Big data in tourism research: A literature review , 2018, Tourism Management.

[55]  Izak Benbasat,et al.  Investigating the Influence of the Functional Mechanisms of Online Product Presentations , 2007 .

[56]  Pei-Ju Lee,et al.  Assessing the helpfulness of online hotel reviews: A classification-based approach , 2018, Telematics Informatics.

[57]  Kyung Young Lee,et al.  Unveiling the cloak of deviance: Linguistic cues for psychological processes in fake online reviews , 2020 .

[58]  Xun Xu,et al.  Does traveler satisfaction differ in various travel group compositions?: Evidence from online reviews , 2018 .

[59]  Kuanchin Chen,et al.  The effect of user-controllable filters on the prediction of online hotel reviews , 2017, Inf. Manag..

[60]  Svetlana Stepchenkova,et al.  Automated Sentiment Analysis in Tourism: Comparison of Approaches , 2018 .

[61]  B. Chae,et al.  Toward understanding the topical structure of hospitality literature , 2018, International Journal of Contemporary Hospitality Management.

[62]  T. Chai,et al.  Root mean square error (RMSE) or mean absolute error (MAE)? – Arguments against avoiding RMSE in the literature , 2014 .

[63]  Haiyan Song,et al.  New developments in tourism and hotel demand modeling and forecasting , 2017 .

[64]  Kuanchin Chen,et al.  Predicting hotel review helpfulness: The impact of review visibility, and interaction between hotel stars and review ratings , 2016, Int. J. Inf. Manag..

[65]  Sergio Toral,et al.  Post-visit and pre-visit tourist destination image through eWOM sentiment analysis and perceived helpfulness , 2016 .

[66]  A. DeFranco,et al.  Exploring influential factors affecting guest satisfaction , 2020 .