Foreign arrivals nowcasting in Italy with Google Trends data

The development of the ICT has deeply transformed the tourism industry. ICT has become a key determinant for competitiveness that deeply impacts on marketing and communication strategies. Online Travel Agency is accumulating a huge mass of valuable information. Web Data (Big Data) can actually represent an up-to-date information, which can be used as a support to improve statistical information, especially for monitoring current phenomena, as arrivals, spent nights, or the average length of stay. In this respect, an interesting issue is the assessment of the contribution of Web data for forecasting tourism flows. Specifically, nowcasting is a special case of forecasting as it deals with the knowledge of the present, immediate past and very near future. The aim of the paper is to assess the effective advantage of Google Trends (GT) data in forecasting tourist arrivals in Italy. The analysis is related to monthly foreign arrivals in tourist accommodations facilities. Google Trends data are used to predict the monthly number of foreign arrivals released by the Italian national statistical office, which is the dependent variable. Specifically, we have assessed the contribution of lagged GT variables in a standard ARIMA model and in a time series regression model with seasonal dummies and autoregressive components.

[1]  Yan Carrière-Swallow,et al.  Nowcasting With Google Trends in an Emerging Market , 2013 .

[2]  Bing Pan,et al.  Online information search: vacation planning process. , 2006 .

[3]  L. Magee A note on Cochrane−Orcutt estimation , 1987 .

[4]  C. Artola,et al.  Can internet searches forecast tourism inflows , 2015 .

[5]  Shirley Almon The Distributed Lag Between Capital Appropriations and Expenditures , 1965 .

[6]  Yang Yang,et al.  Spatial effects in regional tourism growth , 2014 .

[7]  Torsten Schmidt,et al.  Forecasting Private Consumption: Survey-Based Indicators vs. Google Trends , 2009 .

[8]  Declan Butler,et al.  When Google got flu wrong , 2013, Nature.

[9]  Konstantinos Nikolopoulos,et al.  The Tourism Forecasting Competition , 2011 .

[10]  Rob J Hyndman,et al.  Automatic Time Series Forecasting: The forecast Package for R , 2008 .

[11]  Haiyan Song,et al.  Recent Developments in Econometric Modeling and Forecasting , 2005 .

[12]  H. Varian,et al.  Predicting the Present with Google Trends , 2009 .

[13]  C. Granger,et al.  Co-integration and error correction: representation, estimation and testing , 1987 .

[14]  Jiekuan Zhang,et al.  Comparative Advantage: Explaining tourism flows , 2007 .

[15]  Rein Ahas,et al.  Measuring tourism destinations using mobile tracking data , 2016 .

[16]  P. Phillips,et al.  Asymptotic Properties of Residual Based Tests for Cointegration , 1990 .

[17]  M. Fuchs,et al.  Big data analytics for knowledge generation in tourism destinations – A case from Sweden , 2014 .

[18]  H. Varian,et al.  Predicting the Present with Google Trends , 2012 .

[19]  A. Zeileis Econometric Computing with HC and HAC Covariance Matrix Estimators , 2004 .

[20]  Prosper F. Bangwayo-Skeete,et al.  Can Google data improve the forecasting performance of tourist arrivals? Mixed-data sampling approach , 2015 .

[21]  Wolfgang Lehner,et al.  Forecasting the data cube: A model configuration advisor for multi-dimensional data sets , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[22]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[23]  G. Crouch Destination Competitiveness: An Analysis of Determinant Attributes , 2011 .

[24]  Mahalia Jackman,et al.  Research Note: Nowcasting Tourist Arrivals in Barbados – Just Google it! , 2015 .

[25]  Emmanuel Sirimal Silva,et al.  Forecasting with Big Data: A Review , 2015, Annals of Data Science.

[26]  Chaang-Iuan Ho,et al.  Web users' behavioural patterns of tourism information search: from online to offline. , 2012 .

[27]  Josep Blat,et al.  An Analysis of Visitors' Behavior in the Louvre Museum: A Study Using Bluetooth Data , 2014, ArXiv.

[28]  Klaus F. Zimmermann,et al.  Google Econometrics and Unemployment Forecasting , 2009 .

[29]  T. Chai,et al.  Root mean square error (RMSE) or mean absolute error (MAE)? – Arguments against avoiding RMSE in the literature , 2014 .

[30]  Allan M. Williams,et al.  Tourism: Conceptualizations, Institutions, and Issues , 2008 .

[31]  Irem Önder,et al.  Forecasting city arrivals with Google Analytics , 2016 .

[32]  D. Cochrane,et al.  Application of Least Squares Regression to Relationships Containing Auto-Correlated Error Terms , 1949 .

[33]  Haiyan Song,et al.  Tourism demand modelling and forecasting—A review of recent research , 2008 .

[34]  David M. Pennock,et al.  Predicting consumer behavior with Web search , 2010, Proceedings of the National Academy of Sciences.

[35]  Peter Nijkamp,et al.  Quantitative methods in tourism economics , 2013 .

[36]  Jean-Marie Dufour,et al.  The Cochrane-Orcutt Procedure: Numerical Examples of Multiple Admissible Minima , 1980 .

[37]  Roberto Rivera,et al.  A dynamic linear model to forecast hotel registrations in Puerto Rico using Google Trends data , 2015, 1512.08097.

[38]  Richard A. Ashley,et al.  Statistically significant forecasting improvements: how much out-of-sample data is likely necessary? ☆ , 2003 .

[39]  Noam Shoval,et al.  Tracking tourists in the digital age , 2007 .