相关论文

Anomaly Detection and Characterization in Spatial Time Series Data: A Cluster-Centric Approach

Abstract:Anomaly detection in spatial time series (spatiotemporal data) is a challenging problem with numerous potential applications. A comprehensive anomaly detection approach not only should be able to detect and identify the emerging anomalies but has to characterize the essence of these anomalies by visualizing the structures revealed within data in a way that is understandable to the end-user as well. In this paper, we consider fuzzy c-means (FCM) as a conceptual and algorithmic setting to deal with the problem of anomaly detection. Using a sliding window, the time series are divided into a number of subsequences, and the available spatiotemporal structure within each time window is discovered using the FCM method. In the sequel, an anomaly score is assigned to each cluster, and using a fuzzy relation formed between revealed structures, a propagation of anomalies occurring in consecutive time intervals is visualized. To illustrate the proposed method, several datasets (synthetic data, a simulated disease outbreak scenario, and Alberta temperature data) have been investigated.

参考文献

[1]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[2]  Vipin Kumar,et al.  Anomaly Detection for Discrete Sequences: A Survey , 2012, IEEE Transactions on Knowledge and Data Engineering.

[3]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[4]  M. Nanni,et al.  Spatio-Temporal Clustering : a Survey Spatio-Temporal Clustering : a Survey , 2010 .

[5]  Eyal Amir,et al.  Real-time Bayesian Anomaly Detection for Environmental Sensor Data , 2007 .

[6]  Witold Pedrycz Proximity-Based Clustering: A Search for Structural Consistency in Data With Semantic Blocks of Features , 2013, IEEE Transactions on Fuzzy Systems.

[7]  P. Protopapas,et al.  Finding outlier light curves in catalogues of periodic variable stars , 2005, astro-ph/0505495.

[8]  Witold Pedrycz,et al.  Anomaly detection in time series data using a fuzzy c-means clustering , 2013, 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS).

[9]  Kenji Yamanishi,et al.  A unifying framework for detecting outliers and change points from time series , 2006, IEEE Transactions on Knowledge and Data Engineering.

[10]  Ricardo J. G. B. Campello,et al.  A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment , 2007, Pattern Recognit. Lett..

[11]  Eyke Hüllermeier,et al.  Comparing Fuzzy Partitions: A Generalization of the Rand Index and Related Measures , 2012, IEEE Transactions on Fuzzy Systems.

[12]  Maoguo Gong,et al.  Image change detection based on an improved rough fuzzy c-means clustering algorithm , 2013, International Journal of Machine Learning and Cybernetics.

[13]  Miguel A. Sanz-Bobi,et al.  Auto-Regressive Processes Explained by Self-Organized Maps. Application to the Detection of Abnormal Behavior in Industrial Processes , 2011, IEEE Transactions on Neural Networks.

[14]  Witold Pedrycz,et al.  Agreement-based fuzzy C-means for clustering data with blocks of features , 2014, Neurocomputing.

[15]  James M. Keller,et al.  Comparing Fuzzy, Probabilistic, and Possibilistic Partitions , 2010, IEEE Transactions on Fuzzy Systems.

[16]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[17]  Han-Xiong Li,et al.  Spatially Constrained Fuzzy-Clustering-Based Sensor Placement for Spatiotemporal Fuzzy-Control System , 2010, IEEE Transactions on Fuzzy Systems.

[18]  M. Kulldorff,et al.  A Space–Time Permutation Scan Statistic for Disease Outbreak Detection , 2005, PLoS medicine.

[19]  A. Khatkhate,et al.  Symbolic time-series analysis for anomaly detection in mechanical systems , 2006, IEEE/ASME Transactions on Mechatronics.

[20]  J. Ma,et al.  Time-series novelty detection using one-class support vector machines , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[21]  Pierpaolo D'Urso,et al.  Fuzzy Clustering for Data Time Arrays With Inlier and Outlier Time Trajectories , 2005, IEEE Transactions on Fuzzy Systems.

[22]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[23]  Andrzej Bargiela,et al.  Fuzzy clustering with semantically distinct families of variables: Descriptive and predictive aspects , 2010, Pattern Recognit. Lett..

[24]  Thomas G. Dietterich,et al.  Spatiotemporal Models for Data-Anomaly Detection in Dynamic Environmental Monitoring Campaigns , 2011, TOSN.

[25]  Slava Kisilevich,et al.  Spatio-temporal clustering , 2010, Data Mining and Knowledge Discovery Handbook.

[26]  Derek Anderson,et al.  Comparing Fuzzy, Probabilistic, and Possibilistic Partitions Using the Earth Mover’s Distance , 2013, IEEE Transactions on Fuzzy Systems.

[27]  Daniel B. Neill,et al.  Expectation-based scan statistics for monitoring spatial time series data , 2009 .

[28]  Eamonn J. Keogh,et al.  Towards parameter-free data mining , 2004, KDD.

[29]  Witold Pedrycz,et al.  A Development of Fuzzy Encoding and Decoding Through Fuzzy Clustering , 2008, IEEE Transactions on Instrumentation and Measurement.

[30]  A. Hill,et al.  The North American Animal Disease Spread Model: a simulation model to assist decision making in evaluating animal disease incursions. , 2007, Preventive veterinary medicine.

[31]  Dipankar Dasgupta,et al.  Novelty detection in time series data using ideas from immunology , 1996 .

[32]  Weina Wang,et al.  On fuzzy cluster validity indices , 2007, Fuzzy Sets Syst..

[33]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[34]  Vipin Kumar,et al.  Comparative Evaluation of Anomaly Detection Techniques for Sequence Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[35]  Pang-Ning Tan,et al.  A Robust Graph-Based Algorithm for Detection and Characterization of Anomalies in Noisy Multivariate Time Series , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[36]  Srinivasan Parthasarathy,et al.  Anomaly detection and spatio-temporal analysis of global climate system , 2009, SensorKDD '09.

[37]  Yohsuke Kinouchi,et al.  Neural networks for event extraction from time series: a back propagation algorithm approach , 2005, Future Gener. Comput. Syst..

[38]  Witold Pedrycz,et al.  Clustering Spatiotemporal Data: An Augmented Fuzzy C-Means , 2013, IEEE Transactions on Fuzzy Systems.

引用
A Pre-clustering Method To Improve Anomaly Detection
SECRYPT
2016
Towards Visual Exploration of Large Temporal Datasets
2018 International Symposium on Big Data Visual and Immersive Analytics (BDVA)
2018
Time Series Data Mining Methods : A Review
2015
Bound smoothing based time series anomaly detection using multiple similarity measures
J. Intell. Manuf.
2020
A Survey on Data Mining Methods for Clustering Complex Spatiotemporal Data
BDAS
2017
Takagi–Sugeno Fuzzy Modeling Using Mixed Fuzzy Clustering
IEEE Transactions on Fuzzy Systems
2017
A fixed-charge transportation problem in two-stage supply chain network in Gaussian type-2 fuzzy environments
Inf. Sci.
2015
A survey on novelty detection using level set methods
2017 International Conference on Inventive Communication and Computational Technologies (ICICCT)
2017
A heuristic approach to detect novelty data using improved level set methods
2017 International Conference on Computing Methodologies and Communication (ICCMC)
2017
A conceptual paper on novelty detection for temporal data using level set methods
2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)
2017
A Self-Learning and Online Algorithm for Time Series Anomaly Detection, with Application in CPU Manufacturing
CIKM
2016
Exact variable-length anomaly detection algorithm for univariate and multivariate time series
Data Mining and Knowledge Discovery
2018
UK - Means Clustering for Uncertain Time Series Based on ULDTW Distance
IDEAL
2017
Entropic One-Class Classifiers
IEEE Transactions on Neural Networks and Learning Systems
2014
ADARC: An anomaly detection algorithm based on relative outlier distance and biseries correlation
Softw. Pract. Exp.
2020
From Rocks to Pebbles
ACM Trans. Spatial Algorithms Syst.
2019
COPE: Interactive Exploration of Co-Occurrence Patterns in Spatial Time Series
IEEE Transactions on Visualization and Computer Graphics
2019
A Geometric Approach to Clustering Based Anomaly Detection for Industrial Applications
IECON 2018 - 44th Annual Conference of the IEEE Industrial Electronics Society
2018
Anomaly Detection Guidelines for Data Streams in Big Data
2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI)
2016
Root-cause Analysis for Time-series Anomalies via Spatiotemporal Graphical Modeling in Distributed Complex Systems
Knowl. Based Syst.
2018