Efficient Retrieval of Similar Time Sequences Using DFT

We propose an improvement of the known DFT-based indexing technique for fast retrieval of similar time sequences. We use the last few Fourier coefficients in the distance computation without storing them in the index since every coefficient at the end is the complex conjugate of a coefficient at the beginning and as strong as its counterpart. We show analytically that this observation can accelerate the search time of the index by more than a factor of two. This result was confirmed by our experiments, which were carried out on real stock prices and synthetic data.

[1]  Clu-istos Foutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[2]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[3]  C. Faloutsos Eecient Similarity Search in Sequence Databases , 1993 .

[4]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[5]  Stephen A. Dyer,et al.  Digital signal processing , 2018, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..

[6]  Benoit B. Mandelbrot,et al.  Fractal Geometry of Nature , 1984 .

[7]  Raghu Ramakrishnan,et al.  MIMSY: A System for Analyzing Time Series Data in the Stock Market Domain , 1993, Workshop on Programming with Logic Databases , ILPS.

[8]  Giuseppe Psaila,et al.  Querying Shapes of Histories , 1995, VLDB.

[9]  Divesh Srivastava,et al.  CORAL - Control, Relations and Logic , 1992, VLDB.

[10]  Jürg Nievergelt,et al.  The Grid File: An Adaptable, Symmetric Multikey File Structure , 1984, TODS.

[11]  Chris Chatfield,et al.  The Analysis of Time Series: An Introduction , 1981 .

[12]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[13]  Dina Q. Goldin,et al.  On Similarity Queries for Time-Series Data: Constraint Specification and Implementation , 1995, CP.

[14]  Davood Rafiei,et al.  On similarity-based queries for time series data , 1997, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[15]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[16]  Manfred Schroeder,et al.  Fractals, Chaos, Power Laws: Minutes From an Infinite Paradise , 1992 .

[17]  P. A. Blight The Analysis of Time Series: An Introduction , 1991 .

[18]  Miron Livny,et al.  Sequence query processing , 1994, SIGMOD '94.

[19]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[20]  Alberto O. Mendelzon,et al.  Similarity-based queries , 1995, PODS '95.