Mining event periodicity from incomplete observations

Advanced technology in GPS and sensors enables us to track physical events, such as human movements and facility usage. Periodicity analysis from the recorded data is an important data mining task which provides useful insights into the physical events and enables us to report outliers and predict future behaviors. To mine periodicity in an event, we have to face real-world challenges of inherently complicated periodic behaviors and imperfect data collection problem. Specifically, the hidden temporal periodic behaviors could be oscillating and noisy, and the observations of the event could be incomplete. In this paper, we propose a novel probabilistic measure for periodicity and design a practical method to detect periods. Our method has thoroughly considered the uncertainties and noises in periodic behaviors and is provably robust to incomplete observations. Comprehensive experiments on both synthetic and real datasets demonstrate the effectiveness of our method.

[1]  Jiawei Han,et al.  Mining periodic behaviors for moving objects , 2010, KDD.

[2]  N. Lomb Least-squares frequency analysis of unequally spaced data , 1976 .

[3]  Jie Chen,et al.  Bioinformatics Original Paper Detecting Periodic Patterns in Unevenly Spaced Gene Expression Time Series Using Lomb–scargle Periodograms , 2022 .

[4]  J. Scargle Studies in astronomical time series analysis. II - Statistical aspects of spectral analysis of unevenly spaced data , 1982 .

[5]  Walid G. Aref,et al.  WARP: time warping for periodicity detection , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[6]  Ilya Shmulevich,et al.  Robust regression for periodicity detection in non-uniformly sampled time-course gene expression data , 2007, BMC Bioinformatics.

[7]  Marta C. González,et al.  Understanding individual human mobility patterns , 2008, Nature.

[8]  D. B. Preston Spectral Analysis and Time Series , 1983 .

[9]  Gwenn Englebienne,et al.  Accurate activity recognition in a home setting , 2008, UbiComp.

[10]  Joseph L. Hellerstein,et al.  Mining partially periodic event patterns with unknown periods , 2001, Proceedings 17th International Conference on Data Engineering.

[11]  Walid G. Aref,et al.  Periodicity detection in time series databases , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  Philip S. Yu,et al.  On Periodicity Detection and Structural Periodic Similarity , 2005, SDM.

[13]  Philip S. Yu,et al.  Mining Asynchronous Periodic Patterns in Time Series Data , 2003, IEEE Trans. Knowl. Data Eng..

[14]  Piotr Indyk,et al.  Identifying Representative Trends in Massive Time Series Data Sets Using Sketches , 2000, VLDB.

[15]  M Schimmel,et al.  Emphasizing Difficulties in the Detection of Rhythms with Lomb-Scargle Periodograms , 2001, Biological rhythm research.

[16]  Ivan Junier,et al.  Periodic pattern detection in sparse boolean sequences , 2010, Algorithms for Molecular Biology.