论文信息 - A review on outlier detection techniques on data stream by using different approaches of K-Means algorithm

A review on outlier detection techniques on data stream by using different approaches of K-Means algorithm

Data Stream mining has gained attraction from many researchers as there is need to mine large dataset which pose different challenges for researchers. Stream data is different compared to normal data as they are continuously produced from different applications which impose different challenges like massive, infinite, concept drift for processing. An object that does not obey the behavior of normal data object is called outliers. Outlier detection is used in different applications like fraud detection, intrusion detection, track environmental changes, medical diagnosis so there is need to detect outliers from data streams. Various approaches are used for outlier detection. Some of them use K-Means algorithm for outlier detection in data streams which help to create a similar group or cluster of data points. Data stream clustering techniques are highly helpful to cluster similar data items in data streams and also to detect the outliers from them, so they are called cluster based outlier detection. K-means algorithm is partition based algorithm which is used for clustering datasets into number of clusters. It is most common and popular algorithm for clustering due to its simplicity and efficiency. Purpose of this paper is to review of different approaches of outlier detection which is used for K-Means algorithm for clustering dataset with some other methods. Different application areas of outlier detection are discussed in this paper.

Madhu Shukla | Prashant Chauhan | M. Shukla | Prashant Chauhan

[1] Ian Witten,et al. Data Mining , 2000 .

[2] Durga Toshniwal,et al. A Framework for Outlier Detection in Evolving Data Streams by Weighting Attributes in Clustering , 2012 .

[3] Suhaimi Ibrahim,et al. Outlier Detection in Stream Data by Clustering Method , 2014 .

[4] Deepika Pahuja,et al. A Critical Review on Outlier Detection Techniques , 2014 .

[5] J. Gerberding,et al. From the Fifth International Conference on the , 1998 .

[6] Kun Li,et al. Efficient Clustering-Based Outlier Detection Algorithm for Dynamic Data Stream , 2008, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery.

[7] Le Gruenwald,et al. Research issues in outlier detection for data streams , 2014, SKDD.

[8] Rohini Balkrishna Gurav,et al. Hybrid Approach for Outlier Detection in High Dimensional Dataset , 2014 .

[9] Shian-Shyong Tseng,et al. Two-phase clustering process for outliers detection , 2001, Pattern Recognit. Lett..

[10] Suhaimi Ibrahim,et al. Outlier Detection in Stream Data by Machine Learning and Feature Selection Methods , 2014 .

[11] Durga Toshniwal,et al. Unsupervised outlier detection in streaming data using weighted clustering , 2012, 2012 12th International Conference on Intelligent Systems Design and Applications (ISDA).