论文信息 - Finding K Most Significant Motifs in Big Time Series Data

Finding K Most Significant Motifs in Big Time Series Data

Abstract An efficient discovery algorithm of frequently occurring patterns, called motifs, in a time series would be useful as a tool for summarizing and visualizing big time series databases. In this paper, we propose an efficient approximate algorithm, called DiscMotifs, to discover the K most significant (KMS) motifs from time series. First, the proposed algorithm transforms the time series into a SAX representation and then the algorithm divides the SAX representation into subsequences. Next, these subsequences are linearized by projecting them into a one-dimensional space based on their distances form a randomly selected reference point, or a subsequence. By utilizing the linear ordering of subsequences, DiscMotifs efficiently discovers the KMS motifs. DiscMotifs algorithm requires a storage space linear to the number of subsequences. We demonstrate the feasibility of this approach on several synthetic and real application datasets.

Zaher Al Aghbari | Ayoub Al-Hamadi

[1] Bernhard Sick,et al. Performing event detection in time series with SwiftEvent: an algorithm with supervised learning of detection criteria , 2018, Pattern Analysis and Applications.

[2] Karsten M. Borgwardt,et al. Association mapping in biomedical time series via statistically significant shapelet mining , 2018, Bioinform..

[3] Tim Oates,et al. GrammarViz 3.0 , 2018, ACM Trans. Knowl. Discov. Data.

[4] Sergey V. Kovalchuk,et al. Motif identification in vital signs of chronic patients , 2019 .

[5] Ibrahim Kamel,et al. On clustering large number of data streams , 2012, Intell. Data Anal..

[6] G. P. Chuiko,et al. Trends and seasonality extracting from Home Blood Pressure Monitoring readings , 2018 .

[7] Zaher Al Aghbari,et al. Array-index: a plug&search K nearest neighbors method for high-dimensional data , 2005, Data Knowl. Eng..

[8] Tijl De Bie,et al. SIMIT: Subjectively Interesting Motifs in Time Series , 2019, Entropy.

[9] Clara E Yoon,et al. Earthquake detection through computationally efficient similarity search , 2015, Science Advances.

[10] Catherine Garbay,et al. Knowledge construction from time series data using a collaborative exploration system , 2007, J. Biomed. Informatics.

[11] Kuniaki Uehara,et al. Discovery of Time-Series Motif from Multi-Dimensional Data Based on MDL Principle , 2005, Machine Learning.