Clustering economic and financial time series : Exploring the existence of stable correlation conditions

Clustering plays an important role in extracting information from the noise in economic and financial time series; it is one way to perform a coarse-graining of the empirical data sets to extract stable dependence information from the surrounding noise. This discussion paper explores clustering as an econometric tool for studying dependencies between variables. It places clustering in the context of the classical statistical tools for dependence analysis (i.e., correlation, regression, cointegration and structural change); discusses similarity, the key concept in time-series clustering; and reviews the principal approaches to quantifying similarity with the distance function. It argues that concepts of similarity, such as dynamic time-warping, used in a wide range of application domains (e.g., speech recognition, medicine, biomathematics, seismology) might be useful in the context of finance and economics.

[1]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[2]  Christos Faloutsos,et al.  FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.

[3]  R. Mantegna Hierarchical structure in financial markets , 1998, cond-mat/9802256.

[4]  J. Bouchaud,et al.  RANDOM MATRIX THEORY AND FINANCIAL CORRELATIONS , 2000 .

[5]  Dimitrios Gunopulos,et al.  Finding Similar Time Series , 1997, PKDD.

[6]  Eamonn J. Keogh,et al.  A Simple Dimensionality Reduction Technique for Fast Similarity Search in Large Time Series Databases , 2000, PAKDD.

[7]  Hannu Toivonen,et al.  Mining for similarities in aligned time series using wavelets , 1999, Defense, Security, and Sensing.

[8]  Dimitrios Gunopulos,et al.  Time-series similarity problems and well-separated geometric sets , 1997, SCG '97.

[9]  J. Bouchaud,et al.  Noise Dressing of Financial Correlation Matrices , 1998, cond-mat/9810255.

[10]  Haixun Wang,et al.  Landmarks: a new model for similarity-based pattern querying in time series databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[11]  Mantegna,et al.  Taxonomy of stock market indices , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[12]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[13]  T. Crone Using State Indexes to Define Economic Regions in the U.S. , 1999 .

[14]  G. Yule Why do we Sometimes get Nonsense-Correlations between Time-Series?--A Study in Sampling and the Nature of Time-Series , 1926 .