Regression trees for streaming data with local performance guarantees

Online predictive modeling of streaming data is a key task for big data analytics. In this paper, a novel approach for efficient online learning of regression trees is proposed, which continuously updates, rather than retrains, the tree as more labeled data become available. A conformal predictor outputs prediction sets instead of point predictions; which for regression translates into prediction intervals. The key property of a conformal predictor is that it is always valid, i.e., the error rate, on novel data, is bounded by a preset significance level. Here, we suggest applying Mondrian conformal prediction on top of the resulting models, in order to obtain regression trees where not only the tree, but also each and every rule, corresponding to a path from the root node to a leaf, is valid. Using Mondrian conformal prediction, it becomes possible to analyze and explore the different rules separately, knowing that their accuracy, in the long run, will not be below the preset significance level. An empirical investigation, using 17 publicly available data sets, confirms that the resulting rules are independently valid, but also shows that the prediction intervals are smaller, on average, than when only the global model is required to be valid. All-in-all, the suggested method provides a data miner or a decision maker with highly informative predictive models of streaming data.

[1]  Raul Fidalgo-Merino,et al.  Self-Adaptive Induction of Regression Trees , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Bernard Zenko,et al.  Speeding-Up Hoeffding-Based Regression Trees With Options , 2011, ICML.

[3]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[4]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[5]  Jesús Alcalá-Fdez,et al.  KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework , 2011, J. Multiple Valued Log. Soft Comput..

[6]  Vladimir Vovk,et al.  Conformal predictors in early diagnostics of ovarian and breast cancers , 2012, Progress in Artificial Intelligence.

[7]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[8]  Haris Haralambous,et al.  Neural Networks Regression Inductive Conformal Predictor and Its Application to Total Electron Content Prediction , 2010, ICANN.

[9]  Haris Haralambous,et al.  Reliable prediction intervals with regression neural networks , 2011, Neural Networks.

[10]  Tuve Löfström,et al.  Signed-Error Conformal Regression , 2014, PAKDD.

[11]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[12]  Joachim Diederich,et al.  Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..

[13]  Harris Papadopoulos,et al.  Inductive Conformal Prediction: Theory and Application to Neural Networks , 2008 .

[14]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[15]  Harris Papadopoulos,et al.  Inductive Confidence Machines for Regression , 2002, ECML.

[16]  Geoff Holmes,et al.  New Options for Hoeffding Trees , 2007, Australian Conference on Artificial Intelligence.

[17]  Thomas Seidl,et al.  Adaptive Model Tree for Streaming Data , 2013, 2013 IEEE 13th International Conference on Data Mining.

[18]  Harris Papadopoulos,et al.  Reliable diagnosis of acute abdominal pain with conformal prediction , 2009 .

[19]  Mark Last,et al.  Induction of Mean Output Prediction Trees from Continuous Temporal Meteorological Data , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[20]  Saso Dzeroski,et al.  Learning model trees from evolving data streams , 2010, Data Mining and Knowledge Discovery.

[21]  Alex A. Freitas,et al.  A survey of evolutionary algorithms for data mining and knowledge discovery , 2003 .

[22]  W. Gasarch,et al.  The Book Review Column 1 Coverage Untyped Systems Simple Types Recursive Types Higher-order Systems General Impression 3 Organization, and Contents of the Book , 2022 .

[23]  PietruczukLena,et al.  Decision Trees for Mining Data Streams Based on the McDiarmid's Bound , 2013 .

[24]  Henrik Boström,et al.  Regression conformal prediction with random forests , 2014, Machine Learning.

[25]  Harris Papadopoulos,et al.  Reliable Confidence Intervals for Software Effort Estimation , 2009, AIAI Workshops.

[26]  Abraham Kandel,et al.  Knowledge discovery in data streams with regression tree methods , 2012, WIREs Data Mining Knowl. Discov..

[27]  Alexander Gammerman,et al.  Hedging Predictions in Machine Learning: The Second Computer Journal Lecture , 2006, Comput. J..

[28]  Jude W. Shavlik,et al.  in Advances in Neural Information Processing , 1996 .

[29]  Piotr Duda,et al.  Decision Trees for Mining Data Streams Based on the McDiarmid's Bound , 2013, IEEE Transactions on Knowledge and Data Engineering.