Finding a needle in a haystack of reviews: cold start context-based hotel recommender system

Online hotel searching is a daunting task due to the wealth of online information. Reviews written by other travelers replace the word-of-mouth, yet turn the search into a time consuming task. Users do not rate enough hotels to enable a collaborative filtering based recommendation. Thus, a cold start recommender system is needed. In this work we design a cold start hotel recommender system, which uses the text of the reviews as its main data. We define context groups based on reviews extracted from TripAdvisor.com and Venere.com. We introduce a novel weighted algorithm for text mining. Our algorithm imitates a user that favors reviews written with the same trip intent and from people of similar background (nationality) and with similar preferences for hotel aspects, which are our defined context groups. Our approach combines numerous elements, including unsupervised clustering to build a vocabulary for hotel aspects, semantic analysis to understand sentiment towards hotel features, and the profiling of intent and nationality groups. We implemented our system which was used by the public to conduct 150 trip planning experiments. We compare our solution to the top suggestions of the mentioned web services and show that users were, on average, 20% more satisfied with our hotel recommendations. We outperform these web services even more in cities where hotel prices are high.

[1]  Siamak Faridani Using canonical correlation analysis for generalized sentiment analysis, product recommendation and search , 2011, RecSys '11.

[2]  Lei Zhang,et al.  Alleviating Cold-Start Problem by Using Implicit Feedback , 2009, ADMA.

[3]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[4]  Anh Duc Duong,et al.  Addressing cold-start problem in recommendation systems , 2008, ICUIMC '08.

[5]  Noémie Elhadad,et al.  An Unsupervised Aspect-Sentiment Model for Online Reviews , 2010, NAACL.

[6]  Wei Chu,et al.  Information Services]: Web-based services , 2022 .

[7]  Alexander Tuzhilin,et al.  The long tail of recommender systems and how to leverage it , 2008, RecSys '08.

[8]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[9]  Sean M. McNee,et al.  Getting to know you: learning new user preferences in recommender systems , 2002, IUI '02.

[10]  Gediminas Adomavicius,et al.  Context-aware recommender systems , 2008, RecSys '08.

[11]  Bamshad Mobasher,et al.  Context-Aware Recommendation Based On Review Mining , 2011, ITWP@IJCAI.

[12]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Kenneth Ward Church,et al.  Using Statistics in Lexical Analysis , 2003, Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon.

[14]  John K. Debenham,et al.  Informed Recommender: Basing Recommendations on Consumer Product Reviews , 2007, IEEE Intelligent Systems.

[15]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[16]  David M. Pennock,et al.  Categories and Subject Descriptors , 2001 .

[17]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[18]  Uri Zernik,et al.  Lexical acquisition: Exploiting on-line resources to build a lexicon. , 1991 .

[19]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[20]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[21]  Tianyi Jiang,et al.  Improving Personalization Solutions through Optimal Segmentation of Customer Bases , 2006, IEEE Transactions on Knowledge and Data Engineering.

[22]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[23]  Wolfgang Woerndl,et al.  Introducing Context into Recommender Systems , 2007 .

[24]  Stefan Bornholdt,et al.  Detecting fuzzy community structures in complex networks with a Potts model. , 2004, Physical review letters.

[25]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[26]  Giorgos Stamou,et al.  Context-sensitive semantic query expansion , 2002, Proceedings 2002 IEEE International Conference on Artificial Intelligence Systems (ICAIS 2002).

[27]  J. Reichardt,et al.  Statistical mechanics of community detection. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  John Riedl,et al.  Learning preferences of new users in recommender systems: an information theoretic approach , 2008, SKDD.

[29]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[30]  Pádraig Cunningham,et al.  An on-line evaluation framework for recommender systems , 2002 .

[31]  Iryna Gurevych,et al.  Beyond the stars: exploiting free-text user reviews to improve the accuracy of movie recommendations , 2009, TSA@CIKM.