Leveraging information storage to select forecast-optimal parameters for delay-coordinate reconstructions.

Delay-coordinate reconstruction is a proven modeling strategy for building effective forecasts of nonlinear time series. The first step in this process is the estimation of good values for two parameters, the time delay and the embedding dimension. Many heuristics and strategies have been proposed in the literature for estimating these values. Few, if any, of these methods were developed with forecasting in mind, however, and their results are not optimal for that purpose. Even so, these heuristics-intended for other applications-are routinely used when building delay coordinate reconstruction-based forecast models. In this paper, we propose an alternate strategy for choosing optimal parameter values for forecast methods that are based on delay-coordinate reconstructions. The basic calculation involves maximizing the shared information between each delay vector and the future state of the system. We illustrate the effectiveness of this method on several synthetic and experimental systems, showing that this metric can be calculated quickly and reliably from a relatively short time series, and that it provides a direct indication of how well a near-neighbor based forecasting method will work on a given delay reconstruction of that time series. This allows a practitioner to choose reconstruction parameters that avoid any pathologies, regardless of the underlying mechanism, and maximize the predictive information contained in the reconstruction.

[1]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[2]  E. Lorenz Atmospheric Predictability as Revealed by Naturally Occurring Analogues , 1969 .

[3]  Y. Pesin CHARACTERISTIC LYAPUNOV EXPONENTS AND SMOOTH ERGODIC THEORY , 1977 .

[4]  Andreas S. Weigend,et al.  Time Series Prediction: Forecasting the Future and Understanding the Past , 1994 .

[5]  Rob J Hyndman,et al.  Another look at measures of forecast accuracy , 2006 .

[6]  E. Lorenz Deterministic nonperiodic flow , 1963 .

[7]  M. Rosenstein,et al.  Reconstruction expansion as a geometry-based framework for choosing proper delay times , 1994 .

[8]  H. Schuster,et al.  Proper choice of the time delay for the analysis of chaotic time series , 1989 .

[9]  A. Kraskov,et al.  Estimating mutual information. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  James P. Crutchfield,et al.  Bayesian Structural Inference for Hidden Processes , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  E. Lorenz Predictability of Weather and Climate: Predictability – a problem partly solved , 2006 .

[12]  D. Kugiumtzis State space reconstruction parameters in the analysis of chaotic time series—the role of the time window length , 1996, comp-gas/9602002.

[13]  Amer Diwan,et al.  Supporting experiments in computer systems research , 2010 .

[14]  K. Pawelzik,et al.  Optimal Embeddings of Chaotic Attractors from Topological Considerations , 1991 .

[15]  Amer Diwan,et al.  Measurement and Dynamical Analysis of Computer Performance Data , 2010, IDA.

[16]  George Sugihara,et al.  Nonlinear forecasting as a way of distinguishing chaos from measurement error in time series , 1990, Nature.

[17]  John L. Henning SPEC CPU2006 benchmark descriptions , 2006, CARN.

[18]  Konstantine P. Georgakakos,et al.  Estimating the Dimension of Weather and Climate Attractors: Important Issues about the Procedure and Interpretation , 1993 .

[19]  S. Frenzel,et al.  Partial mutual information for coupling analysis of multivariate time series. , 2007, Physical review letters.

[20]  L. Cao Practical method for determining the minimum embedding dimension of a scalar time series , 1997 .

[21]  James P. Crutchfield,et al.  Geometry from a Time Series , 1980 .

[22]  James P. Crutchfield,et al.  Many Roads to Synchrony: Natural Time Scales and Their Algorithms , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  H. Kantz,et al.  Nonlinear time series analysis , 1997 .

[24]  Joshua Garland,et al.  Prediction in projection. , 2015, Chaos.

[25]  Gerd Pfister,et al.  Comparison of algorithms calculating optimal embedding parameters for delay time coordinates , 1992 .

[26]  Fraser,et al.  Independent coordinates for strange attractors from mutual information. , 1986, Physical review. A, General physics.

[27]  Pfister,et al.  Optimal delay time and embedding dimension for delay-time coordinates by analysis of the global static and local dynamical behavior of strange attractors. , 1992, Physical review. A, Atomic, molecular, and optical physics.