Adapting dynamic classifier selection for concept drift

Abstract One popular approach employed to tackle classification problems in a static environment consists in using a Dynamic Classifier Selection (DCS)-based method to select a custom classifier/ensemble for each test instance according to its neighborhood in a validation set, where the selection can be considered region-dependent. This idea can be extended to concept drift scenarios, where the distribution or the a posteriori probabilities may change over time. Nevertheless, in these scenarios, the classifier selection becomes not only region but also time-dependent. By adding a time dependency, in this work, we hypothesize that any DCS-based approach can be used to handle concept drift problems. Since some regions may not be affected by a concept drift, we introduce the idea of concept diversity, which shows that a pool containing classifiers trained under different concepts may be beneficial when dealing with concept drift problems through a DCS approach. The impacts of pruning mechanisms are discussed and seven well-known DCS methods are evaluated in the proposed framework, using a robust experimental protocol based on 12 common concept drift problems with different properties, and the PKLot dataset considering an experimental protocol specially designed in this work to test concept drift methods. The experimental results have shown that the DCS approach comes out ahead in terms of stability, i.e., it performs well in most cases requiring almost no parameter tuning.

[1]  Sameer Singh,et al.  Novelty detection: a review - part 1: statistical approaches , 2003, Signal Process..

[2]  Ludmila I. Kuncheva,et al.  Classifier Ensembles for Changing Environments , 2004, Multiple Classifier Systems.

[3]  Talel Abdessalem,et al.  Adaptive random forests for evolving data stream classification , 2017, Machine Learning.

[4]  Xindong Wu,et al.  Dynamic classifier selection for effective mining from noisy data streams , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[5]  Xue Li,et al.  Classifier Ensemble for Uncertain Data Stream Classification , 2010, PAKDD.

[6]  Sun Jie,et al.  Dynamic financial distress prediction modeling based on slip time window and multiple classifiers , 2010, 2010 International Conference on Management Science & Engineering 17th Annual Conference Proceedings.

[7]  José Ramón Quevedo,et al.  Using ensembles for problems with characterizable changes in data distribution: A case study on quantification , 2017, Inf. Fusion.

[8]  Hamid Beigy,et al.  Detection of evolving concepts in non-stationary data streams: A multiple kernel learning approach , 2018, Expert Syst. Appl..

[9]  Dmitry O. Gorodnichy,et al.  Skew-sensitive boolean combination for adaptive ensembles - An application to face recognition in video surveillance , 2014, Inf. Fusion.

[10]  Yun Sing Koh,et al.  Proactive drift detection: Predicting concept drifts in data streams using probabilistic networks , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[11]  Jerzy Stefanowski,et al.  Accuracy Updated Ensemble for Data Streams with Concept Drift , 2011, HAIS.

[12]  Robi Polikar,et al.  Learning concept drift in nonstationary environments using an ensemble of classifiers based approach , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[13]  Robert Sabourin,et al.  K-Nearest Oracle for Dynamic Ensemble Selection , 2007 .

[14]  Fabio Roli,et al.  Methods for dynamic classifier selection , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[15]  Mykola Pechenizkiy,et al.  Handling Local Concept Drift with Dynamic Integration of Classifiers: Domain of Antibiotic Resistance in Nosocomial Infections , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).

[16]  Geoff Holmes,et al.  Leveraging Bagging for Evolving Data Streams , 2010, ECML/PKDD.

[17]  Richard Granger,et al.  Incremental Learning from Noisy Data , 1986, Machine Learning.

[18]  Ilia Nouretdinov,et al.  Transcend: Detecting Concept Drift in Malware Classification Models , 2017, USENIX Security Symposium.

[19]  Gian Luca Marcialis,et al.  A study on the performances of dynamic classifier selection based on local accuracy estimation , 2005, Pattern Recognit..

[20]  Robert Sabourin,et al.  From dynamic classifier selection to dynamic ensemble selection , 2008, Pattern Recognit..

[21]  Bartosz Krawczyk,et al.  One-class classifiers with incremental learning and forgetting for data streams with concept drift , 2015, Soft Comput..

[22]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[23]  Cesare Alippi,et al.  Just-In-Time Classifiers for Recurrent Concepts , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[24]  George D. C. Cavalcanti,et al.  Dynamic classifier selection: Recent advances and perspectives , 2018, Inf. Fusion.

[25]  Philip S. Yu,et al.  Mining concept-drifting data streams using ensemble classifiers , 2003, KDD '03.

[26]  Robi Polikar,et al.  Incremental Learning of Concept Drift in Nonstationary Environments , 2011, IEEE Transactions on Neural Networks.

[27]  C. Kaynak Cascading Classiiers , 1998 .

[28]  William Nick Street,et al.  A streaming ensemble algorithm (SEA) for large-scale classification , 2001, KDD '01.

[29]  Geoff Holmes,et al.  MOA: Massive Online Analysis , 2010, J. Mach. Learn. Res..

[30]  Xin Yao,et al.  DDD: A New Ensemble Approach for Dealing with Concept Drift , 2012, IEEE Transactions on Knowledge and Data Engineering.

[31]  Kevin W. Bowyer,et al.  Combination of multiple classifiers using local accuracy estimates , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[32]  Marcus A. Maloof,et al.  Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts , 2007, J. Mach. Learn. Res..

[33]  Luiz Eduardo Soares de Oliveira,et al.  Dynamic selection of classifiers - A comprehensive review , 2014, Pattern Recognit..

[34]  João Gama,et al.  Ensemble learning for data stream analysis: A survey , 2017, Inf. Fusion.

[35]  Robert Sabourin,et al.  A dynamic optimization approach for adaptive incremental learning , 2011, Int. J. Intell. Syst..

[36]  Albert Bifet Classifier Concept Drift Detection and the Illusion of Progress , 2017, ICAISC.

[37]  Ludmila I. Kuncheva,et al.  On the window size for classification in changing environments , 2009, Intell. Data Anal..

[38]  Gregory Ditzler,et al.  Learning in Nonstationary Environments: A Survey , 2015, IEEE Computational Intelligence Magazine.

[39]  Luiz Eduardo Soares de Oliveira,et al.  Handling Concept Drifts Using Dynamic Selection of Classifiers , 2016, 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI).

[40]  Marley M. B. R. Vellasco,et al.  Learning under Concept Drift using a Neuro-Evolutionary Ensemble , 2013, Int. J. Comput. Intell. Appl..

[41]  G. Hommel,et al.  Improvements of General Multiple Test Procedures for Redundant Systems of Hypotheses , 1988 .

[42]  Ricard Gavaldà,et al.  Adaptive Learning from Evolving Data Streams , 2009, IDA.

[43]  Hui Li,et al.  Dynamic financial distress prediction using instance selection for the disposal of concept drift , 2011, Expert Syst. Appl..

[44]  Jean Paul Barddal,et al.  A Survey on Ensemble Learning for Data Stream Classification , 2017, ACM Comput. Surv..

[45]  Antoine Cornuéjols,et al.  A New On-Line Learning Method for Coping with Recurring Concepts: The ADACC System , 2013, ICONIP.

[46]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[47]  Enrique Alegre,et al.  Class distribution estimation based on the Hellinger distance , 2013, Inf. Sci..

[48]  Ricard Gavaldà,et al.  Learning from Time-Changing Data with Adaptive Windowing , 2007, SDM.

[49]  Francesca Mangili,et al.  Should We Really Use Post-Hoc Tests Based on Mean-Ranks? , 2015, J. Mach. Learn. Res..

[50]  Nojun Kwak,et al.  Feature extraction for classification problems and its application to face recognition , 2008, Pattern Recognit..

[51]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[52]  Nitesh V. Chawla,et al.  Noname manuscript No. (will be inserted by the editor) Learning from Streaming Data with Concept Drift and Imbalance: An Overview , 2022 .

[53]  Girijesh Prasad,et al.  A combination of transductive and inductive learning for handling non-stationarities in motor imagery classification , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[54]  Mykola Pechenizkiy,et al.  Dynamic integration of classifiers for handling concept drift , 2008, Inf. Fusion.

[55]  João Gama,et al.  Learning with Drift Detection , 2004, SBIA.

[56]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[57]  Xin Yao,et al.  The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift , 2010, IEEE Transactions on Knowledge and Data Engineering.

[58]  Andries Petrus Engelbrecht,et al.  Training feedforward neural networks with dynamic particle swarm optimisation , 2012, Swarm Intelligence.

[59]  Mehmed M. Kantardzic,et al.  A grid density based framework for classifying streaming data in the presence of concept drift , 2015, Journal of Intelligent Information Systems.

[60]  Robi Polikar,et al.  Ensemble of classifiers based incremental learning with dynamic voting weight update , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[61]  Luiz Eduardo Soares de Oliveira,et al.  PKLot - A robust dataset for parking lot classification , 2015, Expert Syst. Appl..

[62]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.

[63]  Patrick P. K. Chan,et al.  Dynamic base classifier pool for classifier selection in Multiple Classifier Systems , 2011, 2011 International Conference on Machine Learning and Cybernetics.

[64]  Leandro L. Minku,et al.  FEDD: Feature Extraction for Explicit Concept Drift Detection in time series , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[65]  A. Bifet,et al.  Early Drift Detection Method , 2005 .

[66]  S. García,et al.  An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons , 2008 .

[67]  Geoff Holmes,et al.  New ensemble methods for evolving data streams , 2009, KDD.

[68]  Xue Li,et al.  Dynamic classifier ensemble for positive unlabeled text stream classification , 2012, Knowledge and Information Systems.

[69]  Roberto Souto Maior de Barros,et al.  A comparative study on concept drift detectors , 2014, Expert Syst. Appl..