Revisiting the use of web search data for stock market movements

Advances in Big Data make it possible to make short-term forecasts for market trends from previously unexplored sources. Trading strategies were recently developed by exploiting a link between the online search activity of certain terms semantically related to finance and market movements. Here we build on these earlier results by exploring a data-driven strategy which adaptively leverages the Google Correlate service and automatically chooses a new set of search terms for every trading decision. In a backtesting experiment run from 2008 to 2017 we obtained a 499% cumulative return which compares favourably with benchmark strategies. A crowdsourcing exercise reveals that the term selection process preferentially selects highly specific terms semantically related to finance (e.g. Wells Fargo Bank), which may capture the transient interests of investors, but at the cost of a shorter span of validity. The adaptive strategy quickly updates the set of search terms when a better combination is found, leading to more consistent predictability. We anticipate that this adaptive decision framework can be of value not only for financial applications, but also in other areas of computational social science, where linkages between facets of collective human behavior and online searches can be inferred from digital footprint data.

[1]  Raphael H. Heiberger,et al.  Collective Attention and Stock Prices: Evidence from Google Trends Data on Standard and Poor's 100 , 2015, PloS one.

[2]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[3]  H. Simon,et al.  A Behavioral Model of Rational Choice , 1955 .

[4]  N. Askitas,et al.  Google Econometrics and Unemployment Forecasting , 2009, SSRN Electronic Journal.

[5]  David M. Pennock,et al.  Predicting consumer behavior with Web search , 2010, Proceedings of the National Academy of Sciences.

[6]  Peter Molnár,et al.  Google Searches and Stock Returns , 2016 .

[7]  Matthias Bank,et al.  Google search volume and its influence on liquidity and returns of German stocks , 2010 .

[8]  M. B. Wintoki,et al.  Forecasting Abnormal Stock Returns and Trading Volume Using Investor Sentiment: Evidence from Online Search ? , 2011 .

[9]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[10]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[11]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[12]  Agostino Di Ciaccio,et al.  Computational Statistics and Data Analysis Measuring the Prediction Error. a Comparison of Cross-validation, Bootstrap and Covariance Penalty Methods , 2022 .

[13]  Ladislav Kristoufek,et al.  Can Google Trends search queries contribute to risk diversification? , 2013, Scientific Reports.

[14]  Zhi Da,et al.  In Search of Attention , 2009 .

[15]  Thomas Dimpfl,et al.  Can Internet Search Queries Help to Predict Stock Market Volatility? , 2016 .

[16]  H. Stanley,et al.  Quantifying Trading Behavior in Financial Markets Using Google Trends , 2013, Scientific Reports.

[17]  Mauricio Santillana,et al.  Accurate estimation of influenza epidemics using Google search data via ARGO , 2015, Proceedings of the National Academy of Sciences.

[18]  Ladislav Kristoufek,et al.  Nowcasting Unemployment Rates with Google Searches: Evidence from the Visegrad Group Countries , 2014, PloS one.

[19]  H Eugene Stanley,et al.  Quantifying the semantics of search behavior before stock market moves , 2014, Proceedings of the National Academy of Sciences.

[20]  Tobias Preis,et al.  Early Signs of Financial Market Moves Reflected by Google Searches , 2015 .

[21]  H Eugene Stanley,et al.  Complex dynamics of our economic life on different scales: insights from search engine query data , 2010, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[22]  Ladislav Kristoufek,et al.  BitCoin meets Google Trends and Wikipedia: Quantifying the relationship between phenomena of the Internet era , 2013, Scientific Reports.

[23]  Tomaso Aste,et al.  When Can Social Media Lead Financial Markets? , 2014, Scientific Reports.

[24]  H. Eugene Stanley,et al.  Quantifying Wikipedia Usage Patterns Before Stock Market Moves , 2013, Scientific Reports.

[25]  James M. Hyman,et al.  Forecasting the 2013–2014 Influenza Season Using Wikipedia , 2014, PLoS Comput. Biol..

[26]  Tobias Preis,et al.  Adaptive nowcasting of influenza outbreaks using Google searches , 2014, Royal Society Open Science.