Group behavior time series anomaly detection in specific network space based on separation degree

Specific network space, including virtual space and practical space, is a space for executing group behavior on specified regions via network. Due to the variability and unpredictability of time series in group behavior in special network space, the detection of normal and abnormal borders faces significant challenges. The parameters in traditional time series mode need to be predefined such as clustering method and anomaly detection methods science the results influentially depend on the selection of parameters. According to the characteristics of data, this paper proposes an efficient method called separation degree algorithm that can construct the self-adaptive interval based on the separation degree model to filter out anomaly data in virtual and practical spaces. The advantage allows us to automatically find the self-adaptive interval to improve the accuracy and applicability of anomaly detection based on the characteristics of the data instead of set parameters of traditional methods in network space. The extensive experimental result shows that the proposed method can effectively detect anomaly data from different spaces.

[1]  Hao Tu,et al.  An Efficient Clustering Algorithm for Microblogging Hot Topic Detection , 2012, 2012 International Conference on Computer Science and Service System.

[2]  ChenKuan-Yu,et al.  Hot Topic Extraction Based on Timeline Analysis and Multidimensional Sentence Modeling , 2007 .

[3]  Kuan-Yu Chen,et al.  Hot Topic Extraction Based on Timeline Analysis and Multidimensional Sentence Modeling , 2007, IEEE Transactions on Knowledge and Data Engineering.

[4]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[5]  Jia-Jin Le,et al.  An Efficient and Effective Clustering Algorithm for Time Series of Hot Topics , 2012 .

[6]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[7]  Daniel G. Sbarbaro-Hofer,et al.  Outliers detection in environmental monitoring databases , 2011, Eng. Appl. Artif. Intell..

[8]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD 2000.

[9]  Xiaolong Wang,et al.  Online topic detection and tracking of financial news based on hierarchical clustering , 2010, 2010 International Conference on Machine Learning and Cybernetics.

[10]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[11]  Meir Kalech,et al.  Online data-driven anomaly detection in autonomous robots , 2014, Knowledge and Information Systems.

[12]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[13]  Carlos Alberto Ochoa Ortíz Zezzatti,et al.  Outlier Analysis for Plastic Card Fraud Detection a Hybridized and Multi-Objective Approach , 2011, HAIS.

[14]  Hiroyuki Kitagawa,et al.  Top-k Outlier Detection from Uncertain Data , 2014, Int. J. Autom. Comput..

[15]  Joshua M. Dudik,et al.  A comparative analysis of DBSCAN, K-means, and quadratic variation algorithms for automatic identification of swallows from swallowing accelerometry signals , 2015, Comput. Biol. Medicine.

[16]  Anastasios Tefas,et al.  A distributed framework for trimmed Kernel k-Means clustering , 2015, Pattern Recognit..

[17]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[18]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.