Detecting concept drift in data streams using model explanation

A novel concept drift detector for data streams is proposed.The drift detector can be combined with an arbitrary classification algorithm.The drift detector uses model explanation to detect concept drift.The approach features good drift detection, accuracy, robustness and sensitivity.Interpretable macro- and micro- visualization of concept drift is proposed. Learning from data streams (incremental learning) is increasingly attracting research focus due to many real-world streaming problems and due to many open challenges, among which is the detection of concept drift a phenomenon when the data distribution changes and makes the current prediction model inaccurate or obsolete. Current state-of-the art detection methods can be roughly split into performance monitoring algorithms and distribution comparing algorithms. In this work we propose a novel concept drift detector that can be combined with an arbitrary classification algorithm. The proposed concept drift detector is based on computing multiple model explanations over time and observing the magnitudes of their changes. The model explanation is computed using a methodology that yields attribute-value contributions for prediction outcomes and thus provides insight into the models decision-making process and enables its transparency. The evaluation has revealed that the methods surpass the baseline methods in terms of concept drift detection, accuracy, robustness and sensitivity. To even further augment interpretability, we visualized the detection of concept drift, enabling macro and micro views of the data.

[1]  João Gama,et al.  Regression Trees from Data Streams with Drift Detection , 2009, Discovery Science.

[2]  Ivan Bruha,et al.  Postprocessing in machine learning and data mining , 2000, SKDD.

[3]  Saso Dzeroski,et al.  Learning model trees from evolving data streams , 2010, Data Mining and Knowledge Discovery.

[4]  Ricard Gavaldà,et al.  Adaptive Learning from Evolving Data Streams , 2009, IDA.

[5]  Georgios Paliouras,et al.  Machine Learning and Its Applications: advanced lectures , 2001 .

[6]  Geoff Holmes,et al.  MOA: Massive Online Analysis , 2010, J. Mach. Learn. Res..

[7]  Harry Wechsler,et al.  On the Detection of Concept Changes in Time-Varying Data Stream by Testing Exchangeability , 2005, UAI.

[8]  Shai Ben-David,et al.  Detecting Change in Data Streams , 2004, VLDB.

[9]  Richard Granger,et al.  Incremental Learning from Noisy Data , 1986, Machine Learning.

[10]  W. Cleveland,et al.  Graphical Perception: Theory, Experimentation, and Application to the Development of Graphical Methods , 1984 .

[11]  Ricard Gavaldà,et al.  Learning from Time-Changing Data with Adaptive Windowing , 2007, SDM.

[12]  Khaled Ghédira,et al.  Discussion and review on evolving data streams and concept drift adapting , 2018, Evol. Syst..

[13]  Edwin Lughofer,et al.  Self-adaptive and local strategies for a smooth treatment of drifts in data streams , 2014, Evol. Syst..

[14]  Francisco Herrera,et al.  Enhancing the effectiveness and interpretability of decision tree and rule induction classifiers with evolutionary training set selection over imbalanced problems , 2009, Appl. Soft Comput..

[15]  Nikola Kasabov,et al.  Evolving Connectionist Systems: The Knowledge Engineering Approach , 2007 .

[16]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[17]  Edwin Lughofer,et al.  Recognizing input space and target concept drifts in data streams with scarcely labeled and unlabelled instances , 2016, Inf. Sci..

[18]  Jerzy Stefanowski,et al.  Reacting to Different Types of Concept Drift: The Accuracy Updated Ensemble Algorithm , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Lucy T. Nowell,et al.  ThemeRiver: visualizing theme changes over time , 2000, IEEE Symposium on Information Visualization 2000. INFOVIS 2000. Proceedings.

[20]  Edwin Lughofer,et al.  Evolving Fuzzy Systems - Methodologies, Advanced Concepts and Applications , 2011, Studies in Fuzziness and Soft Computing.

[21]  Michel C. A. Klein,et al.  Concept drift and how to identify it , 2011, J. Web Semant..

[22]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[23]  Michaela M. Black,et al.  The use of time stamps in handling latency and concept drift in online learning , 2012, Evol. Syst..

[24]  Marcus A. Maloof,et al.  Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts , 2007, J. Mach. Learn. Res..

[25]  Charu C. Aggarwal,et al.  Detecting Recurring and Novel Classes in Concept-Drifting Data Streams , 2011, 2011 IEEE 11th International Conference on Data Mining.

[26]  Dimitrios Gunopulos,et al.  Mining Time Series Data , 2005, Data Mining and Knowledge Discovery Handbook.

[27]  Vipin Kumar,et al.  Chapman & Hall/CRC Data Mining and Knowledge Discovery Series , 2008 .

[28]  Rodrigo Fernandes de Mello,et al.  Multidimensional surrogate stability to detect data stream concept drift , 2017, Expert Syst. Appl..

[29]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.

[30]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[31]  Edward R. Tufte,et al.  The Visual Display of Quantitative Information , 1986 .

[32]  Plamen Angelov,et al.  Evolving Intelligent Systems: Methodology and Applications , 2010 .

[33]  Benoit B. Mandelbrot,et al.  Fractal Geometry of Nature , 1984 .

[34]  Erik Strumbelj,et al.  An Efficient Explanation of Individual Classifications using Game Theory , 2010, J. Mach. Learn. Res..

[35]  Jude W. Shavlik,et al.  Understanding Time-Series Networks: A Case Study in Rule Extraction , 1997, Int. J. Neural Syst..

[36]  Latesh G. Malik,et al.  A review on real time data stream classification and adapting to various concept drift scenarios , 2014, 2014 IEEE International Advance Computing Conference (IACC).

[37]  William S. Cleveland,et al.  Visualizing Data , 1993 .

[38]  Alexey Tsymbal,et al.  The problem of concept drift: definitions and related work , 2004 .

[39]  W. Cleveland The elements of graphing data , 1986 .

[40]  Erik Strumbelj,et al.  Explaining instance classifications with interactions of subsets of feature values , 2009, Data Knowl. Eng..

[41]  Ryszard S. Michalski,et al.  A Theory and Methodology of Inductive Learning , 1983, Artificial Intelligence.

[42]  M. C. Monard,et al.  Extracting rules from neural networks using symbolic algorithms: preliminary results , 2001, Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001.

[43]  Edwin Lughofer,et al.  Advanced linguistic explanations of classifier decisions for users' annotation support , 2016, 2016 IEEE 8th International Conference on Intelligent Systems (IS).

[44]  A. Goshtasby Similarity and Dissimilarity Measures , 2012 .

[45]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[46]  Xin Yao,et al.  The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift , 2010, IEEE Transactions on Knowledge and Data Engineering.

[47]  Andrew Vande Moere Time-Varying Data Visualization Using Information Flocking Boids , 2004 .

[48]  Geoffrey Holmes,et al.  Mining data streams using option trees , 2003 .

[49]  Edward R. Tufte,et al.  Envisioning Information , 1990 .

[50]  Ádamo L. de Santana,et al.  Strategies for improving the modeling and interpretability of Bayesian networks , 2007, Data Knowl. Eng..

[51]  Charu C. Aggarwal,et al.  The Internet of Things: A Survey from the Data-Centric Perspective , 2013, Managing and Mining Sensor Data.

[52]  Edwin Lughofer,et al.  On-line assurance of interpretability criteria in evolving fuzzy systems - Achievements, new concepts and open issues , 2013, Inf. Sci..

[53]  Marc Alexa,et al.  Visualizing time-series on spirals , 2001, IEEE Symposium on Information Visualization, 2001. INFOVIS 2001..

[54]  James T. Enns,et al.  Attention and Visual Memory in Visualization and Computer Graphics , 2012, IEEE Transactions on Visualization and Computer Graphics.

[55]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[56]  Igor Kononenko,et al.  Visualization and Concept Drift Detection Using Explanations of Incremental Models , 2014, Informatica.

[57]  Geoff Holmes,et al.  CD-MOA: Change Detection Framework for Massive Online Analysis , 2013, IDA.

[58]  Brian Mac Namee,et al.  Drift detection using uncertainty distribution divergence , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[59]  Raphaël Féraud,et al.  Contact personalization using a score understanding method , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[60]  Nitesh V. Chawla,et al.  Noname manuscript No. (will be inserted by the editor) Learning from Streaming Data with Concept Drift and Imbalance: An Overview , 2022 .

[61]  Jesús S. Aguilar-Ruiz,et al.  Knowledge discovery from data streams , 2009, Intell. Data Anal..

[62]  Mahardhika Pratama,et al.  An Incremental Learning of Concept Drifts Using Evolving Type-2 Recurrent Fuzzy Neural Networks , 2017, IEEE Transactions on Fuzzy Systems.

[63]  Michael L. Pack Wide-area, web-based mobility analysis using probe data , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[64]  Stephen Grossberg,et al.  Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps , 1992, IEEE Trans. Neural Networks.

[65]  John Attia,et al.  Using Bayes’ nomogram to help interpret odds ratios , 2003, ACP journal club.

[66]  Rodrigo Fernandes de Mello,et al.  Proposal of a new stability concept to detect changes in unsupervised data streams , 2014, Expert Syst. Appl..

[67]  Marko Robnik-Sikonja,et al.  Explaining Classifications For Individual Instances , 2008, IEEE Transactions on Knowledge and Data Engineering.

[68]  Lucy T. Nowell,et al.  Change blindness in information visualization: a case study , 2001, IEEE Symposium on Information Visualization, 2001. INFOVIS 2001..

[69]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[70]  Marie Persson,et al.  Improved concept drift handling in surgery prediction and other applications , 2015, Knowledge and Information Systems.

[71]  Edwin Lughofer,et al.  Learning in Non-Stationary Environments: Methods and Applications , 2012 .