Detecting Approximate Periodic Patterns

Given e∈[0, 1), the e-Relative Error Periodic Pattern Problem (REPP) is the following: INPUT: An n-long sequence S of numbers si∈ℕ in increasing order. OUTPUT: The longest e-relative error periodic pattern, i.e., the longest subsequence $s_{i_1}, s_{i_2},\ldots, s_{i_k}$ of S, for which there exists a number p such that the absolute difference between any two consecutive numbers in the subsequence is at least p and at most p(1+e). The best known algorithm for this problem has O(n3) time complexity. This bound is too high for large inputs in practice. In this paper we give a new algorithm for finding the longest e-relative error periodic pattern (the REPP problem). Our method is based on a transformation of the input sequence into a different representation: the e-active maximal intervals listL, defined in this paper. We show that the transformation of S to the list L can be done efficiently (quadratic in n and linear in the size of L) and prove that our algorithm is linear in the size of L. This enables us to prove that our algorithm works in sub-cubic time on inputs for which the best known algorithm works in O(n3) time. Moreover, though it may happen that our algorithm would still be cubic, it is never worse than the known O(n3)-algorithm and in many situations its complexity is O(n2) time.

[1]  Jiawei Han,et al.  Efficient mining of partial periodic patterns in time series database , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[2]  Arun Iyengar,et al.  Capacity planning tools for web and grid environments , 2006, valuetools '06.

[3]  Beat Gfeller Finding Longest Approximate Periodic Patterns , 2011, WADS.

[4]  Vincent W. Freeh,et al.  Instantaneous offloading of web server load , 2002 .

[5]  Mohammed Al-Shalalfa,et al.  Efficient Periodicity Mining in Time Series Databases Using Suffix Trees , 2011, IEEE Transactions on Knowledge and Data Engineering.

[6]  Philip S. Yu,et al.  Mining Asynchronous Periodic Patterns in Time Series Data , 2003, IEEE Trans. Knowl. Data Eng..

[7]  Young-Koo Lee,et al.  Discovering Periodic-Frequent Patterns in Transactional Databases , 2009, PAKDD.

[8]  Joseph L. Hellerstein,et al.  Mining partially periodic event patterns with unknown periods , 2001, Proceedings 17th International Conference on Data Engineering.