A Survey: Approaches for Handling Evolving Data Streams

The increasing use of technology in diverse field has caused generation of huge volumes of information streams. Data streams contains bulk of data points generated at high speed continuously from various applications like log records, web clicks etc. With recent advancement in technology need for analysis of such unbounded streams is increasing day by day. Data mining process helps to excavate useful knowledge from rapidly generated raw data streams. In context with the continuously generated data, mining data streams is emerging challenging task in which several issues like limited space, limited time, accuracy, handling evolving data need to be considered. This paper provides an overview of various approaches for handling changing and evolving data streams.

[1]  Kapil Keshao Wankhade,et al.  A fast and light classifier for data streams , 2010, Evol. Syst..

[2]  Philip S. Yu,et al.  Mining concept-drifting data streams using ensemble classifiers , 2003, KDD '03.

[3]  T. Soni Madhulatha Overview of streaming-data algorithms , 2012, ArXiv.

[4]  Hai Huang,et al.  rDenStream, A Clustering Algorithm over an Evolving Data Stream , 2009, 2009 International Conference on Information Engineering and Computer Science.

[5]  Bhavani M. Thuraisingham,et al.  A Multi-partition Multi-chunk Ensemble Technique to Classify Concept-Drifting Data Streams , 2009, PAKDD.

[6]  K. Prasanna Lakshmi,et al.  A survey on different trends in data streams , 2010, 2010 International Conference on Networking and Information Technology.

[7]  Sudipto Guha,et al.  Streaming-data algorithms for high-quality clustering , 2002, Proceedings 18th International Conference on Data Engineering.

[8]  William Nick Street,et al.  A streaming ensemble algorithm (SEA) for large-scale classification , 2001, KDD '01.

[9]  Quanyuan Wu,et al.  Mining Concept-Drifting and Noisy Data Streams Using Ensemble Classifiers , 2009, 2009 International Conference on Artificial Intelligence and Computational Intelligence.

[10]  Aoying Zhou,et al.  Density-Based Clustering over an Evolving Data Stream with Noise , 2006, SDM.

[11]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[12]  Daniel Barbará,et al.  Requirements for clustering data streams , 2002, SKDD.

[13]  Edwin Lughofer Dynamic Evolving Cluster Models Using On-line Split-and-Merge Operations , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[14]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[15]  Jia-Jin Le,et al.  Dynamic Mining Hierarchical Topic from Web News Stream Data Using Divisive-Agglomerative Clustering Method , 2005, PAKDD.

[16]  Chen Jia,et al.  A Grid and Density-Based Clustering Algorithm for Processing Data Stream , 2008, 2008 Second International Conference on Genetic and Evolutionary Computing.

[17]  Sudipto Guha,et al.  Clustering Data Streams , 2000, FOCS.

[18]  Amr Magdy,et al.  Discovering Clusters with Arbitrary Shapes and Densities in Data Streams , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[19]  Yi-Hong Lu,et al.  Mining data streams using clustering , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[20]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.