McDiarmid Drift Detection Methods for Evolving Data Streams

Increasingly, Internet of Things (IoT) domains, such as sensor networks, smart cities, and social networks, generate vast amounts of data. Such data are not only unbounded and rapidly evolving. Rather, the content thereof dynamically evolves over time, often in unforeseen ways. These variations are due to so-called concept drifts, caused by changes in the underlying data generation mechanisms. In a classification setting, concept drift causes the previously learned models to become inaccurate, unsafe and even unusable. Accordingly, concept drifts need to be detected, and handled, as soon as possible. In medical applications and emergency response settings, for example, change in behaviours should be detected in near real-time, to avoid potential loss of life. To this end, we introduce the McDiarmid Drift Detection Method (MDDM), which utilizes McDiarmid’s inequality [1] in order to detect concept drift. The MDDM approach proceeds by sliding a window over prediction results, and associate window entries with weights. Higher weights are assigned to the most recent entries, in order to emphasize their importance. As instances are processed, the detection algorithm compares a weighted mean of elements inside the sliding window with the maximum weighted mean observed so far. A significant difference between the two weighted means, upper-bounded by the McDiarmid inequality, implies a concept drift. Our extensive experimentation against synthetic and real-world data streams show that our novel method outperforms the state-of-the-art. Specifically, MDDM yields shorter detection delays as well as lower false negative rates, while maintaining high classification accuracies.

[1]  Marcus A. Maloof,et al.  Paired Learners for Concept Drift , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[2]  Herna L. Viktor,et al.  A Framework for Classification in Data Streams Using Multi-strategy Learning , 2016, DS.

[3]  Heiko Wersing,et al.  KNN Classifier with Self Adjusting Memory for Heterogeneous Concept Drift , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[4]  Roberto Souto Maior de Barros,et al.  RDDM: Reactive drift detection method , 2017, Expert Syst. Appl..

[5]  Shie Mannor,et al.  Concept Drift Detection Through Resampling , 2014, ICML.

[6]  Piotr Duda,et al.  Convergent Time-Varying Regression Models for Data Streams: Tracking Concept Drift by the Recursive Parzen-Based Generalized Regression Neural Networks , 2017, Int. J. Neural Syst..

[7]  Indre Zliobaite,et al.  Learning under Concept Drift: an Overview , 2010, ArXiv.

[8]  Denis J. Dean,et al.  Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables , 1999 .

[9]  Eyke Hüllermeier,et al.  Open challenges for data stream mining research , 2014, SKDD.

[10]  S. W. Roberts Control chart tests based on geometric moving averages , 2000 .

[11]  João Gama,et al.  Learning with Drift Detection , 2004, SBIA.

[12]  Yun Sing Koh,et al.  Detecting concept change in dynamic data streams , 2013, Machine Learning.

[13]  Indre Zliobaite,et al.  How good is the Electricity benchmark for evaluating concept drift adaptation , 2013, ArXiv.

[14]  João Gama,et al.  Issues in evaluation of stream learning algorithms , 2009, KDD.

[15]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[16]  Heiko Wersing,et al.  Incremental on-line learning: A review and comparison of state of the art algorithms , 2018, Neurocomputing.

[17]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[18]  Colin McDiarmid,et al.  Surveys in Combinatorics, 1989: On the method of bounded differences , 1989 .

[19]  Cesare Alippi,et al.  Hierarchical Change-Detection Tests , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[21]  Jeffrey Scott Vitter,et al.  Random sampling with a reservoir , 1985, TOMS.

[22]  Cesare Alippi,et al.  Just-In-Time Classifiers for Recurrent Concepts , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[23]  Žliobait . e,et al.  Learning under Concept Drift: an Overview , 2010 .

[24]  Mohamed Medhat Gaber,et al.  Learning from Data Streams: Processing Techniques in Sensor Networks , 2007 .

[25]  Marley M. B. R. Vellasco,et al.  DetectA: abrupt concept drift detection in non-stationary environments , 2018, Appl. Soft Comput..

[26]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[27]  Geoff Holmes,et al.  New ensemble methods for evolving data streams , 2009, KDD.

[28]  Dimitris K. Tasoulis,et al.  Exponentially weighted moving average charts for detecting concept drift , 2012, Pattern Recognit. Lett..

[29]  A E Bostwick,et al.  THE THEORY OF PROBABILITIES. , 1896, Science.

[30]  Ralf Klinkenberg,et al.  Learning drifting concepts: Example selection vs. example weighting , 2004, Intell. Data Anal..

[31]  João Gama,et al.  Fading histograms in detecting distribution and concept changes , 2017, International Journal of Data Science and Analytics.

[32]  Albert Bifet,et al.  DATA STREAM MINING A Practical Approach , 2009 .

[33]  Yun Sing Koh,et al.  One Pass Concept Change Detection for Data Streams , 2013, PAKDD.

[34]  Gillian Dobbie,et al.  Drift Detection Using Stream Volatility , 2015, ECML/PKDD.

[35]  Herna L. Viktor,et al.  Dynamic adaptation of online ensembles for drifting data streams , 2017, Journal of Intelligent Information Systems.

[36]  Ricard Gavaldà,et al.  Learning from Time-Changing Data with Adaptive Windowing , 2007, SDM.

[37]  Herna Viktor,et al.  Reservoir of diverse adaptive learners and stacking fast hoeffding drift detection methods for evolving data streams , 2017, Machine Learning.

[38]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[39]  João Gama,et al.  Ensemble learning for data stream analysis: A survey , 2017, Inf. Fusion.

[40]  Marcin Budka,et al.  Towards cost-sensitive adaptation: When is it worth updating your predictive model? , 2015, Neurocomputing.

[41]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[42]  Roberto Souto Maior de Barros,et al.  Wilcoxon Rank Sum Test Drift Detector , 2018, Neurocomputing.

[43]  A. Bifet,et al.  Early Drift Detection Method , 2005 .

[44]  Bingsheng He,et al.  Revisiting the Design of Data Stream Processing Systems on Multi-Core Processors , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[45]  Herna L. Viktor,et al.  Fast Hoeffding Drift Detection Method for Evolving Data Streams , 2016, ECML/PKDD.

[46]  Albert Bifet Classifier Concept Drift Detection and the Illusion of Progress , 2017, ICAISC.

[47]  Geoff Holmes,et al.  MOA: Massive Online Analysis , 2010, J. Mach. Learn. Res..