Hotel2vec: Learning Attribute-Aware Hotel Embeddings with Self-Supervision

We propose a neural network architecture for learning vector representations of hotels. Unlike previous works, which typically only use user click information for learning item embeddings, we propose a framework that combines several sources of data, including user clicks, hotel attributes (e.g., property type, star rating, average user rating), amenity information (e.g., the hotel has free Wi-Fi or free breakfast), and geographic information. During model training, a joint embedding is learned from all of the above information. We show that including structured attributes about hotels enables us to make better predictions in a downstream task than when we rely exclusively on click data. We train our embedding model on more than 40 million user click sessions from a leading online travel platform and learn embeddings for more than one million hotels. Our final learned embeddings integrate distinct sub-embeddings for user clicks, hotel attributes, and geographic information, providing an interpretable representation that can be used flexibly depending on the application. We show empirically that our model generates high-quality representations that boost the performance of a hotel recommendation system in addition to other applications. An important advantage of the proposed neural model is that it addresses the cold-start problem for hotels with insufficient historical click information by incorporating additional hotel attributes which are available for all hotels.

[1]  Aapo Hyvärinen,et al.  Noise-contrastive estimation: A new estimation principle for unnormalized statistical models , 2010, AISTATS.

[2]  Xinhua Zhang,et al.  Decomposition-Invariant Conditional Gradient for General Polytopes with Line Search , 2017, NIPS.

[3]  Steven Schockaert,et al.  Embedding Geographic Locations for Modelling the Natural Environment using Flickr Tags and Structured Data , 2019, ECIR.

[4]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[5]  Jiahui Liu,et al.  Personalized news recommendation based on click behavior , 2010, IUI '10.

[6]  Rachid Guerraoui,et al.  Sequences, Items And Latent Links: Recommendation With Consumed Item Packs , 2017, ArXiv.

[7]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[8]  Alejandro Bellogín,et al.  Content-based recommendation in social tagging systems , 2010, RecSys '10.

[9]  Longbing Cao,et al.  Attention-Based Transactional Context Embedding for Next-Item Recommendation , 2018, AAAI.

[10]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[11]  Jianhua Z. Huang,et al.  Robust Negative Sampling for Network Embedding , 2019, AAAI.

[12]  Haibin Cheng,et al.  Real-time Personalization using Embeddings for Search Ranking at Airbnb , 2018, KDD.

[13]  Shreya Singh,et al.  One Embedding To Do Them All , 2019, ArXiv.

[14]  Miguel E. Rodríguez,et al.  Temporal Reasoning Over Event Knowledge Graphs , 2018 .

[15]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[16]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[17]  Ondrej Bojar,et al.  Training Tips for the Transformer Model , 2018, Prague Bull. Math. Linguistics.

[18]  Paul Covington,et al.  Deep Neural Networks for YouTube Recommendations , 2016, RecSys.

[19]  Xing Xie,et al.  Session-based Recommendation with Graph Neural Networks , 2018, AAAI.

[20]  Elena Smirnova,et al.  Meta-Prod2Vec: Product Embeddings Using Side-Information for Recommendation , 2016, RecSys.

[21]  Deepak Agarwal,et al.  Content recommendation on web portals , 2013, CACM.

[22]  Nemanja Djuric,et al.  E-commerce in Your Inbox: Product Recommendations at Scale , 2015, KDD.

[23]  Danna Zhou,et al.  d. , 1934, Microbial pathogenesis.

[24]  Oren Barkan,et al.  ITEM2VEC: Neural item embedding for collaborative filtering , 2016, 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP).