A novel concept drift detection method in data streams using ensemble classifiers

Concept drift, change in the underlying distribution that data points come from, is an inevitable phenomenon in data streams. Due to increase in the number of data streams’ applications such as network intrusion detection, weather forecasting, and detection of unconventional behavior in financial transactions; numerous researches have recently been conducted in the area of concept drift detection. An ideal method for concept drift detection should be able to rapidly and correctly identify changes in the underlying distribution of data points and adapt its model as quickly as possible while the memory and processing time is limited. In this paper, we propose a novel explicit method based on ensemble classifiers for detecting concept drift. The method processes samples one by one, and monitors the distribution of ensemble’s error in order to detect probable drifts. After detection of a drift, a new classifier will be trained on the new concept in order to keep the model up-to-date. The proposed method has been evaluated on some artificial and real benchmark data sets. The experiments’ results show that the proposed method is capable of detecting and adjusting to concept drifts from different types, and it has outperformed well-known stateof-the-art methods. Especially, in the case of high-speed concept drifts.

[1]  Shai Ben-David,et al.  Detecting Change in Data Streams , 2004, VLDB.

[2]  Peter Secretan Learning , 1965, Mental Health.

[3]  Hamid Beigy,et al.  Using a classifier pool in accuracy based tracking of recurring concepts in data stream classification , 2013, Evol. Syst..

[4]  Ning Lu,et al.  Concept drift detection via competence models , 2014, Artif. Intell..

[5]  Hamid Beigy,et al.  New Drift Detection Method for Data Streams , 2011, ICAIS.

[6]  Quanyuan Wu,et al.  Mining Concept-Drifting and Noisy Data Streams Using Ensemble Classifiers , 2009, 2009 International Conference on Artificial Intelligence and Computational Intelligence.

[7]  Jie Lu,et al.  Concept Drift Detection Based on Anomaly Analysis , 2014, ICONIP.

[8]  Indre Zliobaite,et al.  Learning under Concept Drift: an Overview , 2010, ArXiv.

[9]  João Gama,et al.  Learning with Drift Detection , 2004, SBIA.

[10]  Xin Yao,et al.  The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift , 2010, IEEE Transactions on Knowledge and Data Engineering.

[11]  Hamid Beigy,et al.  Semi-supervised Ensemble Learning of Data Streams in the Presence of Concept Drift , 2012, HAIS.

[12]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[13]  Lei Du,et al.  A Selective Detector Ensemble for Concept Drift Detection , 2015, Comput. J..

[14]  Philip S. Yu,et al.  A framework for on-demand classification of evolving data streams , 2006, IEEE Transactions on Knowledge and Data Engineering.

[15]  Koichiro Yamauchi,et al.  Detecting Concept Drift Using Statistical Testing , 2007, Discovery Science.

[16]  Sameer Singh,et al.  Novelty detection: a review - part 1: statistical approaches , 2003, Signal Process..

[17]  Xin Yao,et al.  DDD: A New Ensemble Approach for Dealing with Concept Drift , 2012, IEEE Transactions on Knowledge and Data Engineering.

[18]  Marcus A. Maloof,et al.  Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts , 2007, J. Mach. Learn. Res..

[19]  Geoff Holmes,et al.  New ensemble methods for evolving data streams , 2009, KDD.

[20]  D. Brzezinski MINING DATA STREAMS WITH CONCEPT DRIFT , 2010 .

[21]  Yunjun Gao,et al.  Concept Drifting Detection on Noisy Streaming Data in Random Ensemble Decision Trees , 2009, MLDM.

[22]  A. Bifet,et al.  Early Drift Detection Method , 2005 .

[23]  Koichiro Yamauchi,et al.  Learning, detecting, understanding, and predicting concept changes , 2009, 2009 International Joint Conference on Neural Networks.

[24]  Kyosuke Nishida,et al.  Learning and Detecting Concept Drift , 2008 .

[25]  Sameer Singh,et al.  Novelty detection: a review - part 2: : neural network based approaches , 2003, Signal Process..

[26]  Lei Du,et al.  Detecting concept drift: An information entropy based method using an adaptive sliding window , 2014, Intell. Data Anal..

[27]  Philip S. Yu,et al.  Mining concept-drifting data streams using ensemble classifiers , 2003, KDD '03.

[28]  Takashi Omori,et al.  ACE: Adaptive Classifiers-Ensemble System for Concept-Drifting Environments , 2005, Multiple Classifier Systems.

[29]  Christopher M. Bishop,et al.  Novelty detection and neural network validation , 1994 .

[30]  Hamid Beigy,et al.  Novel class detection in data streams using local patterns and neighborhood graph , 2015, Neurocomputing.

[31]  Geoff Holmes,et al.  MOA: Massive Online Analysis , 2010, J. Mach. Learn. Res..

[32]  Hamid Beigy,et al.  A new method of mining data streams using harmony search , 2012, Journal of Intelligent Information Systems.