A Critical Review on Adverse Effects of Concept Drift over Machine Learning Classification Models

Big Data (BD) is participating in the current computing revolution in a big way. Industries and organizations are utilizing their insights for Business Intelligence using Machine Learning Models (ML-Models). Deep Learning Models (DL-Models) have been proven to be a better selection than Shallow Learning Models (SL-Models). However, the dynamic characteristics of BD introduce many critical issues for DL-Models, Concept Drift (CD) is one of them. CD issue frequently appears in Online Supervised Learning environments in which data trends change over time. The problem may even worsen in the BD environment due to veracity and variability factors. Due to the CD issue, the accuracy of classification results degrades in ML-Models, which may make ML-Models not applicable. Therefore, ML-Models need to adapt quickly to changes to maintain the accuracy level of the results. In current solutions, a substantial improvement in accuracy and adaptability is needed to make ML-Models robust in a non-stationary environment. In the existing literature, the consolidated information on this issue is not available. Therefore, in this study, we have carried out a systematic critical literature review to discuss the Concept Drift taxonomy and identify the adverse effects and existing approaches to mitigate CD.

[1]  Gregory Ditzler,et al.  Incremental Learning of Concept Drift from Streaming Imbalanced Data , 2013, IEEE Transactions on Knowledge and Data Engineering.

[2]  Xin Zhang,et al.  Classification of Uncertain Data Streams Based on Extreme Learning Machine , 2014, Cognitive Computation.

[3]  Mobashar Rehman,et al.  Accuracy Performance Degradation in Image Classification Models due to Concept Drift , 2019, International Journal of Advanced Computer Science and Applications.

[4]  Jerzy Stefanowski,et al.  Reacting to Different Types of Concept Drift: The Accuracy Updated Ensemble Algorithm , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Narasimhan Sundararajan,et al.  A Fast and Accurate Online Sequential Learning Algorithm for Feedforward Networks , 2006, IEEE Transactions on Neural Networks.

[6]  Mykola Pechenizkiy,et al.  Dynamic integration of classifiers for handling concept drift , 2008, Inf. Fusion.

[7]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[8]  Han Wang,et al.  Ensemble Based Extreme Learning Machine , 2010, IEEE Signal Processing Letters.

[9]  Xizhao Wang,et al.  Ensemble online sequential extreme learning machine for large data set classification , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[10]  Indre Zliobaite,et al.  Learning under Concept Drift: an Overview , 2010, ArXiv.

[11]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[12]  Khaled Ghédira,et al.  Self-Adaptive Windowing Approach for Handling Complex Concept Drift , 2015, Cognitive Computation.

[13]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  Katarzyna Musial,et al.  Next challenges for adaptive learning systems , 2012, SKDD.

[15]  Junhong Wang,et al.  Dynamic extreme learning machine for data stream classification , 2017, Neurocomputing.

[16]  Bartosz Krawczyk,et al.  Reacting to different types of concept drift with adaptive and incremental one-class classifiers , 2015, 2015 IEEE 2nd International Conference on Cybernetics (CYBCONF).

[17]  Niall M. Adams,et al.  The impact of changing populations on classifier performance , 1999, KDD '99.

[18]  Junhong Wang,et al.  A fast incremental extreme learning machine algorithm for data streams classification , 2016, Expert Syst. Appl..

[19]  Robi Polikar,et al.  Incremental Learning of Concept Drift in Nonstationary Environments , 2011, IEEE Transactions on Neural Networks.

[20]  André van Schaik,et al.  Online and adaptive pseudoinverse solutions for ELM weights , 2015, Neurocomputing.

[21]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[22]  Zhiping Lin,et al.  Meta-cognitive online sequential extreme learning machine for imbalanced and concept-drifting data classification , 2016, Neural Networks.

[23]  Mohamad Ivan Fanany,et al.  Adaptive Online Sequential ELM for Concept Drift Tackling , 2016, Comput. Intell. Neurosci..

[24]  Yuan Lan,et al.  A constructive enhancement for Online Sequential Extreme Learning Machine , 2009, 2009 International Joint Conference on Neural Networks.

[25]  Li Guo,et al.  Comparative study between incremental and ensemble learning on data streams: Case study , 2014, Journal Of Big Data.

[26]  Mohamad Ivan Fanany,et al.  Adaptive Convolutional ELM For Concept Drift Handling in Online Stream Data , 2016, ArXiv.