Can Web Search Queries Predict Prices Change on the Real Estate Market?

This study aims to explore whether the intensity of internet searches, according to the Google Trends search volume index (SVI), is a predictor of changes in real estate prices. The motivation of this study is the possibility to extend the understanding of the extra predictive power of Google search engine query volume of future housing price change (shift direction) by (i) the introduction of a research approach that combines the advantages of the complementary use of cross-correlation analysis and machine learning classification algorithms; (ii) applying the multi-class HPI values classifier which allows predicting the housing price increase, decrease or relative stability; (iii) exploiting the SVI that relates to interests in both ‘real estate’ and ‘credit to buy real estate’; (iv) evaluation of the introduced approach in the context of the Polish real estate market. The main theoretical contribution of our work is a confirmation that the freely available information regarding Google user searches can provide an in-depth insight into enriching the generally accepted statistics on supply and demand in the real estate market. From the practical perspective, this research confirms that SVI can be associated as a sole determinant to anticipate the housing price change with time-lag sufficient for making decisions regarding the purchase (sale) of individual property or the real estate market control. Such findings can be also helpful for researchers who intend to use Google Trends data as an extra variable from demand side to improve the prediction accuracy if it is included in the model which is based on the existing housing prices determinants.

[1]  Hrvoje Jerkovic,et al.  Real Estate Market Price Prediction Framework Based on Public Data Sources with Case Study from Croatia , 2020, ACIIDS.

[2]  Rui Tao,et al.  Forecasting Crude Oil Prices Based on An Internet Search Driven Model , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[3]  Irina Matijosaitiene,et al.  Predicting Safe Parking Spaces: A Machine Learning Approach to Geospatial Urban and Crime Data , 2019, Sustainability.

[4]  Weiguo Fan,et al.  Crowd characteristics and crowd wisdom: Evidence from an online investment community , 2019, J. Assoc. Inf. Sci. Technol..

[5]  Eli Beracha,et al.  Forecasting Residential Real Estate Price Changes from Online Search Activity , 2014 .

[6]  Hieu Pham,et al.  Optimizing Ensemble Weights for Machine Learning Models: A Case Study for Housing Price Prediction , 2019 .

[7]  Terry Harris,et al.  Credit scoring using the clustered support vector machine , 2015, Expert Syst. Appl..

[8]  Saurabh Jain,et al.  Comprehensive Analysis of Housing Price Prediction in Pune Using Multi-Featured Random Forest Approach , 2018, 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA).

[9]  Ankur Chaturvedi,et al.  Parameterized Comparison of Regularized Regression Models to Develop Models for Real Estate , 2021 .

[10]  Kuriakose Athappilly,et al.  A comparative predictive analysis of neural networks (NNs), nonlinear regression and classification and regression tree (CART) models , 2005, Expert Syst. Appl..

[11]  S. Masrom,et al.  An Overview of Real Estate Modelling Techniques for House Price Prediction , 2020 .

[12]  I. Ajzen,et al.  Belief, Attitude, Intention, and Behavior: An Introduction to Theory and Research , 1977 .

[13]  Hoang Nguyen,et al.  A new technique to predict fly-rock in bench blasting based on an ensemble of support vector regression and GLMNET , 2019, Engineering with Computers.

[14]  Wiwik Anggraeni,et al.  Using Google Trend data in forecasting number of dengue fever cases with ARIMAX method case study: Surabaya, Indonesia , 2016, 2016 International Conference on Information & Communication Technology and Systems (ICTS).

[15]  Geonwoo Kim,et al.  Forecasting the Volatility of Stock Market Index Using the Hybrid Models with Google Domestic Trends , 2019, Fluctuation and Noise Letters.

[16]  Scott Loveridge,et al.  Forecasting state-level premature deaths from alcohol, drugs, and suicides using Google Trends data. , 2017, Journal of affective disorders.

[17]  Manfred Kochen,et al.  On the economics of information , 1972, J. Am. Soc. Inf. Sci..

[18]  Rajendra Kulkarni,et al.  Forecasting Housing Prices with Google Econometrics , 2009 .

[19]  Suraya Masrom,et al.  Machine Learning Price Prediction on Green Building Prices , 2020, 2020 IEEE Symposium on Industrial Electronics & Applications (ISIEA).

[20]  M. Dilmaghani,et al.  Workopolis or The Pirate Bay: what does Google Trends say about the unemployment rate? , 2019, Journal of Economic Studies.

[21]  Rachana Shanbhogue,et al.  Using Internet Search Data as Economic Indicators , 2011 .

[22]  G. Kent Webb,et al.  Internet Search Statistics as a Source of Business Intelligence: Searches on Foreclosure as an Estimate of Actual Home Foreclosures , 2009 .

[23]  Thulasyammal Ramiah Pillai,et al.  Accuracy Comparison of Machine Learning Algorithms for Predictive Analytics in Higher Education , 2019 .

[24]  P. Molnár,et al.  Fear of the coronavirus and the stock markets , 2020, Finance Research Letters.

[25]  José David Martín-Guerrero,et al.  Regularized extreme learning machine for regression problems , 2011, Neurocomputing.

[26]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[27]  Qibo Sun,et al.  Improve the House Price Prediction Accuracy with a Stacked Generalization Ensemble Model , 2019, IOV.

[28]  Anka Lisec,et al.  Estimating the Performance of Random Forest versus Multiple Regression for Predicting Prices of the Apartments , 2018, ISPRS Int. J. Geo Inf..

[29]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[30]  Enrico Foscolo,et al.  High spatial and temporal detail in timely prediction of tourism demand , 2020 .

[31]  Irene Cara,et al.  THE POTENTIAL OF APPLYING MACHINE LEARNING FOR PREDICTING CUT-IN BEHAVIOUR OF SURROUNDING TRAFFIC FOR TRUCK-PLATOONING SAFETY , 2017 .

[32]  Seow Eng Ong,et al.  Determinants of House Price: A Decision Tree Approach , 2006 .

[33]  Venkatesh Panchapagesan,et al.  WORKING PAPER NO : 537 Does Internet Search Intensity Predict House Prices in Emerging Markets ? A Case of India , 2017 .

[34]  H. Varian,et al.  Predicting the Present with Google Trends , 2009 .

[35]  You-Shan Feng,et al.  Forecasting tuberculosis using diabetes-related google trends data , 2020, Pathogens and global health.

[36]  G. Caldarelli,et al.  Credit Default Swaps networks and systemic risk , 2014, Scientific Reports.

[37]  Sulin Pang,et al.  C5.0 Classification Algorithm and Application on Individual Credit Evaluation of Banks , 2009 .

[38]  Keunje Yoo,et al.  Classification and Regression Tree Approach for Prediction of Potential Hazards of Urban Airborne Bacteria during Asian Dust Events , 2018, Scientific Reports.

[39]  Sahil Shah,et al.  Predicting stock and stock price index movement using Trend Deterministic Data Preparation and machine learning techniques , 2015, Expert Syst. Appl..

[40]  M. Vani Pujitha,et al.  House Resale Price Prediction Using Classification Algorithms , 2019, 2019 International Conference on Smart Structures and Systems (ICSSS).

[41]  E. Brynjolfsson,et al.  The Future of Prediction: How Google Searches Foreshadow Housing Prices and Sales , 2013, ICIS 2013.

[42]  I. Ahn,et al.  Forecasting seasonal influenza-like illness in South Korea after 2 and 30 weeks using Google Trends and influenza data from Argentina , 2020, PloS one.

[43]  Hieu Pham,et al.  Bagged ensembles with tunable parameters , 2018, Comput. Intell..

[44]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[45]  Jae Kwon Bae,et al.  Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data , 2015, Expert Syst. Appl..

[46]  Afees A. Salisu,et al.  Google trends and the predictability of precious metals , 2020 .

[47]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[48]  Erik Christian Montes Schütte,et al.  In Search of a Job: Forecasting Employment Growth Using Google Trends , 2019, Journal of Business & Economic Statistics.

[49]  J. R. Quinlan,et al.  Data Mining Tools See5 and C5.0 , 2004 .

[50]  Ralf Hohenstatt,et al.  "Geco" and its potential for real estate research: Evidence from the US housing market , 2011 .

[51]  Guido Caldarelli,et al.  Web Search Queries Can Predict Stock Market Volumes , 2011, PloS one.

[52]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[53]  Ali Etem Gürel,et al.  Prediction of daily global solar radiation using different machine learning algorithms: Evaluation and comparison , 2021, Renewable and Sustainable Energy Reviews.

[54]  Abhijit Sarma,et al.  House Price Prediction Using Machine Learning and Neural Networks , 2018, 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT).

[55]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[56]  Yihao Zhang,et al.  Real estate price forecasting based on SVM optimized by PSO , 2014 .

[57]  Krzysztof Wołk,et al.  Advanced social media sentiment analysis for short‐term cryptocurrency price prediction , 2020, Expert Syst. J. Knowl. Eng..

[58]  Seung-Pyo Jun,et al.  Ten years of research change using Google Trends: From the perspective of big data utilizations and applications , 2017 .

[59]  Stefan Gindl,et al.  Exploring the predictive ability of LIKES of posts on the Facebook pages of four major city DMOs in Austria , 2018, Tourism Economics.

[60]  Simionescu Mihaela Improving unemployment rate forecasts at regional level in Romania using Google Trends , 2020 .

[61]  Hyun-young Choi,et al.  Predicting Initial Claims for Unemployment Benefits , 2009 .

[62]  Charles Elkan,et al.  Predictive analytics and data mining , 2010 .

[63]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[64]  P. Yip,et al.  Developing an early warning system of suicide using Google Trends and media reporting. , 2019, Journal of affective disorders.

[65]  Kelvin K. W. Yau,et al.  Predicting electricity energy consumption: A comparison of regression analysis, decision tree and neural networks , 2007 .

[66]  Theophilos Papadimitriou,et al.  Forecasting the U.S. Real House Price Index , 2014 .

[67]  Fumiko Takeda,et al.  Google Search Intensity and Its Relationship with Returns and Trading Volume of Japanese Stocks , 2013 .

[68]  N. S. Narahari,et al.  Machine Learning based Predicting House Prices using Regression Techniques , 2020, 2020 2nd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA).

[69]  Md Nasir Sulaiman,et al.  Improved method of classification algorithms for crime prediction , 2014, 2014 International Symposium on Biometrics and Security Technologies (ISBAST).

[70]  Wolfgang Schäfers,et al.  Sentiment-Based Commercial Real Estate Forecasting with Google Search Volume Data , 2014 .

[71]  Hoang Nguyen,et al.  A Lasso and Elastic-Net Regularized Generalized Linear Model for Predicting Blast-Induced Air Over-pressure in Open-Pit Mines , 2022, Inżynieria Mineralna.

[72]  G. Daqi,et al.  Support vector machine classifiers using RBF kernels with clustering-based centers and widths , 2007, 2007 International Joint Conference on Neural Networks.

[73]  B. Grzegorz Zastosowanie Google Trends w prognozowaniu zmian na rynku nieruchomości , 2014 .

[74]  Yufang Wang,et al.  Using networks and partial differential equations to forecast bitcoin price movement. , 2020, Chaos.

[75]  Paul P Schneider,et al.  Using web search queries to monitor influenza-like illness: an exploratory retrospective analysis, Netherlands, 2017/18 influenza season , 2020, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[76]  M Balamurugan,et al.  Performance analysis of cart and C5.0 using sampling techniques , 2016, 2016 IEEE International Conference on Advances in Computer Applications (ICACA).

[77]  Adi Wibowo,et al.  Predicting house sale price using fuzzy logic, Artificial Neural Network and K-Nearest Neighbor , 2017, 2017 1st International Conference on Informatics and Computational Sciences (ICICoS).

[78]  Mahmoud Qadan,et al.  Investor sentiment and the price of oil , 2018 .

[79]  Vishnuprasad Business APPLICATION OF HYBRID METHODOLOGY TO PREDICT HOUSING LOAN DEFAULTS IN INDIA , 2015 .

[80]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[81]  Hossein Hassani,et al.  Googling Fashion: Forecasting Fashion Consumer Behaviour Using Google Trends , 2019, Social Sciences.

[82]  Rinkle Rani,et al.  Churn prediction in telecommunication using machine learning , 2017, 2017 International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS).

[83]  J. Meulman,et al.  Prediction accuracy and stability of regression optimal scaling transformations , 1996 .

[84]  G. Moisen,et al.  Random forests and stochastic gradient boosting for predicting tree canopy cover: Comparing tuning processes and model performance , 2016 .

[85]  Joana M. Barros,et al.  The Validity of Google Trends Search Volumes for Behavioral Forecasting of National Suicide Rates in Ireland , 2019, International journal of environmental research and public health.

[86]  Li Tang,et al.  Predicting the direction of stock markets using optimized neural networks with Google Trends , 2018, Neurocomputing.