Disaggregate Hotel Evaluation by Using Diverse Aspects from User Reviews

Experienced opinions about products and services can guide a potential user for a better purchase decision. Fine-grained aspect level opinions embedded within reviews must be explored to discover experienced users' latent opinion about the aspects (i.e. features of products like cost, value for money, etc.) and their relative importance. In this paper, we present an unsupervised approach for discovering coherent hotel aspects based on the user attention. This model effectively integrates techniques like topic modeling and word embeddings along with the frequent noun-adjective co-occurrence statistics to automatically discover coherent hotel aspects. Further supervised methods are used to understand the user's relative emphasis on the aspects and finally rank the hotels. This method does not assume any predefined seed words and discovers coherent level aspects by directly using user attention and word co-occurrence statistics in addition to topic modeling and word embeddings. The performance evaluation of this method was done by collecting various hotel reviews from multiple travel websites. Results show that the proposed methods improved the baseline performance up to 90%. Hence, the results thus obtained are very promising and indicate that the system is simple, scalable and most of all accurate in ranking hotels based on the latent aspects expressed in the user reviews.

[1]  Zhen Lin,et al.  Incorporating appraisal expression patterns into topic modeling for aspect and sentiment word identification , 2014, Knowl. Based Syst..

[2]  Georgiana Dinu,et al.  Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors , 2014, ACL.

[3]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[4]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[5]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[6]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[7]  Ding Xiao,et al.  Coupled matrix factorization and topic modeling for aspect mining , 2018, Inf. Process. Manag..

[8]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  ChengXiang Zhai,et al.  Opinion-based entity ranking , 2012, Information Retrieval.

[11]  Andrew McCallum,et al.  Optimizing Semantic Coherence in Topic Models , 2011, EMNLP.

[12]  Hwee Tou Ng,et al.  An Unsupervised Neural Attention Model for Aspect Extraction , 2017, ACL.

[13]  Hao Yu,et al.  Structure-Aware Review Mining and Summarization , 2010, COLING.

[14]  Li Chen,et al.  Review mining for estimating users' ratings and weights for product aspects , 2015, Web Intell..

[15]  Temple F. Smith Occam's razor , 1980, Nature.

[16]  Noémie Elhadad,et al.  An Unsupervised Aspect-Sentiment Model for Online Reviews , 2010, NAACL.

[17]  Boi Faltings,et al.  Reporting incentives and biases in online review forums , 2010, TWEB.

[18]  Mohammad Salehan,et al.  Predicting the performance of online consumer reviews: A sentiment mining approach to big data analytics , 2014, Decis. Support Syst..

[19]  S. Weisberg Applied Linear Regression , 1981 .