An Efficient Continuous Attributes Handling Method for Mining Concept-Drifting Data Streams Based on Skip List

This paper focuses on continuous attributes handling for mining data stream with concept drift. CVFDT is one of the most successful methods for handling concept drift efficiently. In this paper, we revisit this problem and present an algorithm named SL_CVFDT on top of CVFDT. It is fast as hash table when inserting, seeking or deleting attribute value, and it also can sort the attribute value. The average time cost of search, insertion and deletion is O(log2n),and average memory cost of point is O(n).At the same time, it can get best split point just traverse the skip list once.

[1]  Pedro M. Domingos,et al.  Mining massive data streams , 2005 .

[2]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[3]  Ludmila I. Kuncheva,et al.  Classifier Ensembles for Detecting Concept Change in Streaming Data: Overview and Perspectives , 2008 .

[4]  Gerhard Widmer,et al.  Adapting to Drift in Continuous Domains , 2007 .

[5]  Philip S. Yu,et al.  Mining concept-drifting data streams using ensemble classifiers , 2003, KDD '03.

[6]  Richard Granger,et al.  Incremental Learning from Noisy Data , 1986, Machine Learning.

[7]  Yoav Freund,et al.  A new Hedging algorithm and its application to inferring latent random variables , 2008, ArXiv.

[8]  Ralf Klinkenberg,et al.  Learning drifting concepts: Example selection vs. example weighting , 2004, Intell. Data Anal..

[9]  Claude Sammut,et al.  Extracting Hidden Context , 1998, Machine Learning.

[10]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[11]  Wu Quan-yuan An Ensemble Classifier Framework for Mining Imbalanced Data Streams , 2010 .

[12]  Mads Haahr,et al.  A Case-Based Approach to Spam Filtering that Can Track Concept Drift , 2003 .

[13]  Marcos Salganicoff,et al.  Tolerating Concept and Sampling Shift in Lazy Learning Using Prediction Error Context Switching , 1997, Artificial Intelligence Review.

[14]  Marcus A. Maloof,et al.  Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts , 2007, J. Mach. Learn. Res..

[15]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.