In this paper, we address efficient processing of subsequence matching in time-series databases. We first point out the performance problems occurring in the index searching of a prior method for subsequence matching. Then, we propose a new method that resolves these problems. Our method starts with viewing the index searching of subsequence matching from a new angle, thereby regarding it as a kind of a spatial-join called a window-join. For speeding up the window-join, our method builds an R*-tree in main memory for f query sequence at starting of sub-sequence matching. Our method also includes a novel algorithm for joining effectively one R*-tree in disk, which is for data sequences, and another R*-tree in main memory, which is for a query sequence. This algorithm accesses each R*-tree page built on data sequences exactly cure without incurring any index-level false alarms. Therefore, in terms of the number of disk accesses, the proposed algorithm proves to be optimal. Also, performance evaluation through extensive experiments shows the superiority of our method quantitatively.
[1]
Young-Koo Lee,et al.
Spatial Join Processing Using Corner Transformation
,
1999,
IEEE Trans. Knowl. Data Eng..
[2]
Alberto O. Mendelzon,et al.
Similarity-based queries for time series data
,
1997,
SIGMOD '97.
[3]
Ada Wai-Chee Fu,et al.
Efficient time series matching by wavelets
,
1999,
Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).
[4]
Yang-Sae Moon,et al.
Duality-based subsequence matching in time-series databases
,
2001,
Proceedings 17th International Conference on Data Engineering.
[5]
Wesley W. Chu,et al.
An index-based approach for similarity search supporting time warping in large sequence databases
,
2001,
Proceedings 17th International Conference on Data Engineering.