A survey of outlier detection algorithms for data streams

Data mining is characterized as the process of examining hidden patterns and outlining into some useful information. It is an exciting field of research for researchers. Data streams are continuous instance of records and mining interesting knowledge from this instance is known as data stream mining. Outlier detection is currently an important research problem in many fields and is also involved in many of the applications. Outlier detection in streaming data is a challenging task as only one scan is possible and they need huge amount of storage which is practically infeasible. There are many existing methods for outlier detection based on distance measure but are not efficient for data stream as they are dynamic in nature. This paper discusses on various algorithms for outlier detection on data streams.

[1]  Madhu Shukla,et al.  Analysis and evaluation of outlier detection algorithms in data streams , 2015, 2015 International Conference on Computer, Communication and Control (IC4).

[2]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[3]  Madhu Shukla,et al.  A review on outlier detection techniques on data stream by using different approaches of K-Means algorithm , 2015, 2015 International Conference on Advances in Computer Engineering and Applications.

[4]  Kun Li,et al.  Efficient Clustering-Based Outlier Detection Algorithm for Dynamic Data Stream , 2008, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery.

[5]  S. Vijayarani Ms. P. Jothi Detecting Outliers in Data streams usingClustering Algorithms , 2013 .

[6]  Matthew O. Ward,et al.  Neighbor-based pattern detection for windows over streaming data , 2009, EDBT '09.

[7]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[8]  Jiawei Han,et al.  CLARANS: A Method for Clustering Objects for Spatial Data Mining , 2002, IEEE Trans. Knowl. Data Eng..

[9]  Yannis Manolopoulos,et al.  Continuous outlier detection in data streams: an extensible framework and state-of-the-art algorithms , 2013, SIGMOD '13.

[10]  Madhu Shukla,et al.  A novel approach for clustering data streams using granularity technique , 2015, 2015 International Conference on Advances in Computer Engineering and Applications.

[11]  Darshali Thoriya,et al.  Study of Density Based Clustering Techniques on Data Streams , 2015 .

[12]  Fabrizio Angiulli,et al.  Distance-based outlier queries in data streams: the novel task and algorithms , 2010, Data Mining and Knowledge Discovery.

[13]  Le Gruenwald,et al.  Research issues in outlier detection for data streams , 2014, SKDD.

[14]  Yannis Manolopoulos,et al.  Continuous monitoring of distance-based outliers over data streams , 2011, 2011 IEEE 27th International Conference on Data Engineering.