A MIDAS modelling framework for Chinese inflation index forecast incorporating Google search data

We propose a MIDAS framework which incorporates Google search data to predict Chinese inflation index.We extract five groups of keywords, which include both subject terms and trends terms to construct search data.The high frequency part of Google search data is directly used in MIDAS model to predict monthly CPI.The MIDAS model outperforms other benchmark models in CPI forecast. Increased internet penetration makes it possible for user generated content (UGC) to reflect people's insights and expectations on economic activities. As representative and easily accessible UGC data that reflect public opinions on economic issues, Google search data have been used to forecast macroeconomic indicators in existing literatures. However, very little empirical research has directly used Google search data to improve the forecast accuracy. This paper proposes an integrated framework, which constructs keywords base and extracts search data accordingly, and then incorporates the search data into a mixed data sampling (MIDAS) model. Five groups of search data are extracted based on the constructed keywords and are then used in MIDAS model to forecast Chinese consumer price index (CPI) from 2004 to 2012. The empirical results indicate that the search data are strongly correlated with CPI, which is officially released by the Statistic Bureau of China; the MIDAS model including the search data outperforms the benchmark models, with the average reduction of root mean square error (RMSE) being 32.9%. This research provides a rigorous and generalizable framework for macroeconomic trend prediction using Google search data, and would have great potential in supporting business decisions by eliciting relevant information from UGC data in the Internet.

[1]  David F. Findley,et al.  New Capabilities and Methods of the X-12-ARIMA Seasonal-Adjustment Program , 1998 .

[2]  Jeffrey A. Frankel,et al.  Can Leading Indicators Assess Country Vulnerability? Evidence from the 2008-09 Global Financial Crisis , 2011 .

[3]  Yan Carrière-Swallow,et al.  Nowcasting With Google Trends in an Emerging Market , 2013 .

[4]  John Randolph Norsworthy,et al.  Macroeconomic Policy as Implicit Industrial Policy: Its Industry and Enterprise Effects , 1997 .

[5]  Werner Antweiler,et al.  Is All that Talk Just Noise? The Information Content of Internet Stock Message Boards , 2001 .

[6]  H. Varian,et al.  Predicting the Present with Google Trends , 2012 .

[7]  Bin Gu,et al.  Confirmation Bias, Overconfidence, and Investment Performance: Evidence from Stock Message Boards , 2010 .

[8]  D. Dickey,et al.  Testing for unit roots in autoregressive-moving average models of unknown order , 1984 .

[9]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[10]  Torsten Schmidt,et al.  A monthly consumption indicator for Germany based on Internet search query data , 2010 .

[11]  Min Wei,et al.  Do macro variables, asset markets, or surveys forecast inflation better? , 2006 .

[12]  Simon van Norden,et al.  The reliability of inflation forecasts based on output gap estimates in real time , 2004 .

[13]  Lars E. O. Svensson,et al.  Open-Economy Inflation Targeting , 1997 .

[14]  J. Stock,et al.  Macroeconomic Forecasting Using Diffusion Indexes , 2002 .

[15]  Ying Liu,et al.  A Study on Correlation between Web Search Data and CPI , 2012 .

[16]  Raphael N. Markellos,et al.  Information Demand and Stock Market Volatility , 2012 .

[17]  E. Ghysels,et al.  MIDAS Regressions: Further Results and New Directions , 2006 .

[18]  C. Sims MACROECONOMICS AND REALITY , 1977 .

[19]  James D. Hamilton Specification testing in Markov-switching time-series models , 1996 .

[20]  Scott D. Schuh An evaluation of recent macroeconomic forecast errors , 2001 .

[21]  Hsinchun Chen,et al.  A Lexicon-Enhanced Method for Sentiment Classification: An Experiment on Online Product Reviews , 2010, IEEE Intelligent Systems.

[22]  David M. Pennock,et al.  Predicting consumer behavior with Web search , 2010, Proceedings of the National Academy of Sciences.

[23]  Shouyang Wang,et al.  Webpage Mining for Inflation Emergency Early Warning , 2013, WAIM Workshops.

[24]  Kin Keung Lai,et al.  CRUDE OIL PRICE FORECASTING WITH TEI@I METHODOLOGY ∗ , 2005 .

[25]  Adnan Haider,et al.  Neural network models for inflation forecasting: an appraisal , 2012 .

[26]  P. Phillips Testing for a Unit Root in Time Series Regression , 1988 .

[27]  Michael P. Clements,et al.  Macroeconomic Forecasting With Mixed-Frequency Data , 2008 .

[28]  Matthias Bank,et al.  Google search volume and its influence on liquidity and returns of German stocks , 2010 .

[29]  Zhi Da,et al.  In Search of Attention , 2009 .

[30]  E. Brynjolfsson,et al.  The Future of Prediction: How Google Searches Foreshadow Housing Prices and Sales , 2013, ICIS 2013.

[31]  Eric Ghysels,et al.  Série Scientifique Scientific Series the Midas Touch: Mixed Data Sampling Regression Models the Midas Touch: Mixed Data Sampling Regression Models* , 2022 .

[32]  Chenchuramaiah T. Bathala Giving Content to Investor Sentiment: The Role of Media in the Stock Market , 2007 .

[33]  P. Wachtel,et al.  Inflation Thresholds and the Finance-Growth Nexus , 2002 .

[34]  Olivia Sheng,et al.  Investigating Predictive Power of Stock Micro Blog Sentiment in Forecasting Future Stock Price Directional Movement , 2011, ICIS.

[35]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[36]  Joel Peress,et al.  Media Coverage and the Cross-Section of Stock Returns , 2008 .

[37]  Terence C. Mills,et al.  Time series techniques for economists , 1990 .

[38]  Emi Nakamura,et al.  Inflation forecasting using a neural network , 2005 .

[39]  Qun Liu,et al.  HHMM-based Chinese Lexical Analyzer ICTCLAS , 2003, SIGHAN.

[40]  Maximilian Podstawski,et al.  Google Searches as a Means of Improving the Nowcasts of Key Macroeconomic Variables , 2009 .

[41]  Giselle C. Guzman,et al.  Internet Search Behavior as an Economic Forecasting Tool: The Case of Inflation Expectations , 2011 .

[42]  Rachana Shanbhogue,et al.  Using Internet Search Data as Economic Indicators , 2011 .

[43]  Francesco Ravazzolo,et al.  Real-Time Inflation Forecasting in a Changing World , 2013 .

[44]  Jeffrey M. Woodbridge Econometric Analysis of Cross Section and Panel Data , 2002 .

[45]  Marco Lippi,et al.  The Generalized Dynamic Factor Model , 2002 .

[46]  C. Emre Alper,et al.  Forecasting Stock Market Volatilities Using MIDAS Regressions: An Application to the Emerging Markets , 2008 .

[47]  Torsten Schmidt,et al.  Forecasting Private Consumption: Survey-Based Indicators vs. Google Trends , 2009 .

[48]  H. Varian,et al.  Predicting the Present with Google Trends , 2009 .

[49]  Chien Chin Chen,et al.  Business Cycle Indication Using Query Logs of Search Engines , 2010, 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[50]  Eric Ghysels,et al.  Regression Models with Mixed Sampling Frequencies , 2010 .

[51]  E. Britton,et al.  The Inflation Report projections: understanding the fan chart , 1997 .

[52]  N. Askitas,et al.  Google Econometrics and Unemployment Forecasting , 2009, SSRN Electronic Journal.

[53]  C. Granger,et al.  Co-integration and error correction: representation, estimation and testing , 1987 .

[54]  John H. Gerdes,et al.  Using web-based search data to predict macroeconomic statistics , 2005, CACM.