Forecasting Hotel Room Sales within Online Travel Agencies by Combining Multiple Feature Sets

Hotel Room Sales prediction using previous booking data is a prominent research topic for the online travel agency (OTA) sector. Various approaches have been proposed to predict hotel room sales for different prediction horizons, such as yearly demand or daily number of reservations. An OTA website includes offers of many companies for the same hotel, and the position of the company’s offer in OTA website depends on the bid amount given for each click by the company. Therefore, the accurate prediction of the sales amount for a given bid is a crucial need in revenue and cost management for the companies in the sector. In this paper, we forecast the next day’s sales amount in order to provide an estimate of daily revenue generated per hotel. An important contribution of our study is to use an enriched dataset constructed by combining the most informative features proposed in various related studies for hotel sales prediction. Moreover, we enrich this dataset with a set of OTA specific features that possess information about the relative position of the company’s offers to that of its competitors in a travel metasearch engine website. We provide a real application on the hotel room sales data of a large OTA in Turkey. The comparative results show that enrichment of the input representation with the OTA-specific additional features increases the generalization ability of the prediction models, and tree-based boosting algorithms perform the best results on this task.

[1]  Marvin N. Wright,et al.  SoilGrids250m: Global gridded soil information based on machine learning , 2017, PloS one.

[2]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[3]  Karen L. Xie,et al.  Hotels at Our Fingertips: Understanding Consumer Conversion from Search, Click-Through, to Book , 2015 .

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  Ji Feng,et al.  Deep Forest: Towards An Alternative to Deep Neural Networks , 2017, IJCAI.

[6]  Y. Poon,et al.  Analyzing the Use of an Advance Booking Curve in Forecasting Hotel Reservations , 2015 .

[7]  P. Pellegrini,et al.  Are traditional forecasting models suitable for hotels in Italian cities , 2014 .

[8]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[9]  Gourav G. Shenoy,et al.  Kaggle Competition: Expedia Hotel Recommendations , 2017, ArXiv.

[10]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[11]  Asunur Cezar,et al.  Analyzing conversion rates in online hotel booking: The role of customer reviews, recommendations and rank order in search listings , 2016 .

[12]  Nishant Kumar,et al.  Using big data to enhance the bosch production line performance: A Kaggle challenge , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[13]  Serol Bulkan,et al.  Capacity Management in Hotel Industry for Turkey , 2017 .

[14]  Misuk Lee,et al.  Modeling and forecasting hotel room demand based on advance booking information , 2018, Tourism Management.

[15]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..