Yule's "Nonsense Correlation" Solved!

In this paper, we resolve a longstanding open statistical problem. The problem is to mathematically confirm Yule's 1926 empirical finding of "nonsense correlation" (\cite{Yule}). We do so by analytically determining the second moment of the empirical correlation coefficient \beqn \theta := \frac{\int_0^1W_1(t)W_2(t) dt - \int_0^1W_1(t) dt \int_0^1 W_2(t) dt}{\sqrt{\int_0^1 W^2_1(t) dt - \parens{\int_0^1W_1(t) dt}^2} \sqrt{\int_0^1 W^2_2(t) dt - \parens{\int_0^1W_2(t) dt}^2}}, \eeqn of two {\em independent} Wiener processes, $W_1,W_2$. Using tools from Fred- holm integral equation theory, we successfully calculate the second moment of $\theta$ to obtain a value for the standard deviation of $\theta$ of nearly .5. The "nonsense" correlation, which we call "volatile" correlation, is volatile in the sense that its distribution is heavily dispersed and is frequently large in absolute value. It is induced because each Wiener process is "self-correlated" in time. This is because a Wiener process is an integral of pure noise and thus its values at different time points are correlated. In addition to providing an explicit formula for the second moment of $\theta$, we offer implicit formulas for higher moments of $\theta$.

[1]  M. Hughes,et al.  Proxy-based reconstructions of hemispheric and global surface temperature variations over the past two millennia , 2008, Proceedings of the National Academy of Sciences.

[2]  B. McShane,et al.  A statistical analysis of multiple temperature proxies: Are reconstructions of surface temperatures over the last 1000 years reliable? , 2011, 1104.4002.

[3]  P. Phillips Understanding spurious regressions in econometrics , 1986 .

[4]  Real Zeros of Random Polynomials , 1968 .

[5]  D. Hendry Econometric Modelling with Cointegrated Variables: An Overview , 2009 .

[6]  J. Magnus The Exact Moments of a Ratio of Quadratic Forms in Normal Variables , 1986 .

[7]  L. Shepp,et al.  The correlation of the maxima of correlated Brownian motions , 2006, Journal of Applied Probability.

[8]  C. Mallows,et al.  Limit Distributions of Self-normalized Sums , 1973 .

[9]  G. Yule Why do we Sometimes get Nonsense-Correlations between Time-Series?--A Study in Sampling and the Nature of Time-Series , 1926 .

[10]  Ralph P. Boas,et al.  OF ENTIRE FUNCTIONS , 2016 .

[11]  P. Erdös,et al.  On certain limit theorems of the theory of probability , 1946 .

[12]  L. Shepp The joint density of the maximum and its location for a Wiener process with drift , 1979, Journal of Applied Probability.

[13]  Feller William,et al.  An Introduction To Probability Theory And Its Applications , 1950 .

[14]  J. Aldrich Correlations Genuine and Spurious in Pearson and Yule , 1995 .

[15]  Peter C. B. Phillips,et al.  New Tools for Understanding Spurious Regressions , 1998 .