Reservoir of diverse adaptive learners and stacking fast hoeffding drift detection methods for evolving data streams

The last decade has seen a surge of interest in adaptive learning algorithms for data stream classification, with applications ranging from predicting ozone level peaks, learning stock market indicators, to detecting computer security violations. In addition, a number of methods have been developed to detect concept drifts in these streams. Consider a scenario where we have a number of classifiers with diverse learning styles and different drift detectors. Intuitively, the current ‘best’ (classifier, detector) pair is application dependent and may change as a result of the stream evolution. Our research builds on this observation. We introduce the Tornado framework that implements a reservoir of diverse classifiers, together with a variety of drift detection algorithms. In our framework, all (classifier, detector) pairs proceed, in parallel, to construct models against the evolving data streams. At any point in time, we select the pair which currently yields the best performance. To this end, we introduce the CAR measure, which is employed to balance classification, adaptation and resource utilization requirements. We further incorporate two novel stacking-based drift detection methods, namely the FHDDMS and $$\hbox {FHDDMS}_{\mathrm{add}}$$FHDDMSadd approaches. The experimental evaluation confirms that the current ‘best’ (classifier, detector) pair is not only heavily dependent on the characteristics of the stream, but also that this selection evolves as the stream flows. Further, our FHDDMS variants detect concept drifts accurately in a timely fashion while outperforming the state-of-the-art.

[1]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[2]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT.

[3]  Yun Sing Koh,et al.  One Pass Concept Change Detection for Data Streams , 2013, PAKDD.

[4]  Geoff Holmes,et al.  Fast Perceptron Decision Tree Learning from Evolving Data Streams , 2010, PAKDD.

[5]  Ron Kohavi,et al.  Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid , 1996, KDD.

[6]  Albert Bifet,et al.  DATA STREAM MINING A Practical Approach , 2009 .

[7]  Gerhard Widmer,et al.  Adapting to Drift in Continuous Domains , 2007 .

[8]  Gillian Dobbie,et al.  Drift Detection Using Stream Volatility , 2015, ECML/PKDD.

[9]  João Gama,et al.  Decision trees for mining data streams , 2006, Intell. Data Anal..

[10]  Sung-Bae Cho,et al.  Activity recognition based on wearable sensors using selection/fusion hybrid ensemble , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[11]  João Gama,et al.  Ensemble learning for data stream analysis: A survey , 2017, Inf. Fusion.

[12]  Nathalie Japkowicz,et al.  Big Data Analysis: New Algorithms for a New Society , 2015 .

[13]  A. Bifet,et al.  Early Drift Detection Method , 2005 .

[14]  Antanas Verikas,et al.  Hybrid and ensemble-based soft computing techniques in bankruptcy prediction: a survey , 2010, Soft Comput..

[15]  Gerhard Widmer,et al.  Adapting to Drift in Continuous Domains (Extended Abstract) , 1995, ECML.

[16]  Herna L. Viktor,et al.  Fast Hoeffding Drift Detection Method for Evolving Data Streams , 2016, ECML/PKDD.

[17]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[18]  Shai Ben-David,et al.  Detecting Change in Data Streams , 2004, VLDB.

[19]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[20]  Takaaki Ohishi,et al.  A Hybrid Ensemble Model Applied to the Short-Term Load Forecasting Problem , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[21]  Roberto Souto Maior de Barros,et al.  Wilcoxon Rank Sum Test Drift Detector , 2018, Neurocomputing.

[22]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[23]  Marcin Budka,et al.  Towards cost-sensitive adaptation: When is it worth updating your predictive model? , 2015, Neurocomputing.

[24]  Mykola Pechenizkiy,et al.  An Overview of Concept Drift Applications , 2016 .

[25]  N. Fisher,et al.  Probability Inequalities for Sums of Bounded Random Variables , 1994 .

[26]  Dimitris K. Tasoulis,et al.  Exponentially weighted moving average charts for detecting concept drift , 2012, Pattern Recognit. Lett..

[27]  A E Bostwick,et al.  THE THEORY OF PROBABILITIES. , 1896, Science.

[28]  Herna L. Viktor,et al.  Intelligent Adaptive Ensembles for Data Stream Mining: A High Return on Investment Approach , 2015, NFMCP.

[29]  Mohamed Medhat Gaber,et al.  Pocket Data Mining: Big Data on Small Devices , 2013 .

[30]  Kuo-Wei Hsu,et al.  A Theoretical Analysis of Why Hybrid Ensembles Work , 2017, Comput. Intell. Neurosci..

[31]  Denis J. Dean,et al.  Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables , 1999 .

[32]  Jeffrey Scott Vitter,et al.  Random sampling with a reservoir , 1985, TOMS.

[33]  K TasoulisDimitris,et al.  Exponentially weighted moving average charts for detecting concept drift , 2012 .

[34]  Yun Sing Koh,et al.  Detecting concept change in dynamic data streams , 2013, Machine Learning.

[35]  Lauri Koskinen,et al.  Towards Hardware-driven Design of Low-energy Algorithms for Data Analysis , 2015, SGMD.

[36]  Koichiro Yamauchi,et al.  Detecting Concept Drift Using Statistical Testing , 2007, Discovery Science.

[37]  Ricard Gavaldà,et al.  Learning from Time-Changing Data with Adaptive Windowing , 2007, SDM.

[38]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[39]  João Gama,et al.  Learning with Drift Detection , 2004, SBIA.

[40]  Ivan Bratko,et al.  Machine Learning by Function Decomposition , 1997, ICML.

[41]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[42]  Indre Zliobaite,et al.  How good is the Electricity benchmark for evaluating concept drift adaptation , 2013, ArXiv.

[43]  João Gama,et al.  Fading histograms in detecting distribution and concept changes , 2017, International Journal of Data Science and Analytics.

[44]  João Gama,et al.  On evaluating stream learning algorithms , 2012, Machine Learning.

[45]  Yuzuru Tanaka,et al.  New Frontiers in Mining Complex Patterns , 2017, Lecture Notes in Computer Science.

[46]  José del Campo-Ávila,et al.  Online and Non-Parametric Drift Detection Methods Based on Hoeffding’s Bounds , 2015, IEEE Transactions on Knowledge and Data Engineering.

[47]  Roberto Souto Maior de Barros,et al.  RDDM: Reactive drift detection method , 2017, Expert Syst. Appl..

[48]  Erfu Yang,et al.  A Novel Active Semisupervised Convolutional Neural Network Algorithm for SAR Image Recognition , 2017, Comput. Intell. Neurosci..

[49]  Geoff Holmes,et al.  New ensemble methods for evolving data streams , 2009, KDD.

[50]  Mohamed Medhat Gaber,et al.  Pocket Data Mining , 2014 .

[51]  Marcus A. Maloof,et al.  Paired Learners for Concept Drift , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[52]  Herna L. Viktor,et al.  A Framework for Classification in Data Streams Using Multi-strategy Learning , 2016, DS.

[53]  S. W. Roberts,et al.  Control Chart Tests Based on Geometric Moving Averages , 2000, Technometrics.