evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R

Commonly used classification and regression tree methods like the CART algorithm are recursive partitioning methods that build the model in a forward stepwise search. Although this approach is known to be an efficient heuristic, the results of recursive tree methods are only locally optimal, as splits are chosen to maximize homogeneity at the next step only. An alternative way to search over the parameter space of trees is to use global optimization methods like evolutionary algorithms. This paper describes the evtree package, which implements an evolutionary algorithm for learning globally optimal classification and regression trees in R. Computationally intensive tasks are fully computed in C++ while the partykit package is leveraged for representing the resulting trees in R, providing unified infrastructure for summaries, visualizations, and predictions. evtree is compared to the open-source CART implementation rpart, conditional inference trees (ctree), and the open-source C4.5 implementation J48. A benchmark study of predictive accuracy and complexity is carried out in which evtree achieved at least similar and most of the time better results compared to rpart, ctree, and J48. Furthermore, the usefulness of evtree in practice is illustrated in a textbook customer classification task.

[1]  K. Hornik,et al.  Unbiased Recursive Partitioning: A Conditional Inference Framework , 2006 .

[2]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[3]  Michael Kirchler,et al.  Trading strategies and trading profits in experimental asset markets with cumulative information , 2010 .

[4]  Steven Salzberg,et al.  Decision Tree Induction: How Effective is the Greedy Heuristic? , 1995, KDD.

[5]  Shaul Markovitch,et al.  Anytime Learning of Decision Trees , 2007, J. Mach. Learn. Res..

[6]  Chandrika Kamath,et al.  Inducing oblique decision trees with evolutionary algorithms , 2003, IEEE Trans. Evol. Comput..

[7]  Kim Fung Man,et al.  Genetic algorithms for control and signal processing , 1997, Proceedings of the IECON'97 23rd International Conference on Industrial Electronics, Control, and Instrumentation (Cat. No.97CH36066).

[8]  Hans Zantema,et al.  Finding Small Equivalent Decision Trees is Hard , 2000, Int. J. Found. Comput. Sci..

[9]  Engelbert Theurl,et al.  Working Papers in Economics and Statistics Stronger Sex but Earlier Death: a Multi-level Socioeconomic Analysis of Gender Differences in Mortality in Austria Stronger Sex but Earlier Death: a Multi-level Socioeconomic Analysis of Gender Differences in Mortality in Austria , 2022 .

[10]  Lawrence J. Fogel,et al.  Artificial Intelligence through Simulated Evolution , 1966 .

[11]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[12]  Dimitrios Kalles,et al.  Breeding Decision Trees Using Evolutionary Techniques , 2001, ICML.

[13]  Engelbert Theurl,et al.  Health status convergence at the local level: empirical evidence from Austria , 2011, International journal for equity in health.

[14]  Jean-Robert Tyran,et al.  Inequality aversion and voting on redistribution☆ , 2012, European economic review.

[15]  Matthias Sutter,et al.  University of Innsbruck Working Papers in Economics and Statistics Household Decision Making in Rural China : Using Experiments to Estimate the Influences of Spouses , 2010 .

[16]  Crespo Cuaresma,et al.  Octavio Fernandez-Amador Business cycle convergence in EMU : A second look at the second moment , 2010 .

[17]  Achim Zeileis,et al.  Near Real-Time Disturbance Detection in Terrestrial Ecosystems Using Satellite Image Time Series: Drought Detection in Somalia , 2011 .

[18]  M. Sutter,et al.  Working Papers in Economics and Statistics Guilt from Promise-breaking and Trust in Markets for Expert Services -theory and Experiment Guilt from Promise-breaking and Trust in Markets for Expert Services – Theory and Experiment * , 2022 .

[19]  Francisco Herrera,et al.  Using evolutionary algorithms as instance selection for data reduction in KDD: an experimental study , 2003, IEEE Trans. Evol. Comput..

[20]  Achim Zeileis,et al.  For a list of recent papers see the backpages of this paper. Structural Breaks in Inflation Dynamics within the European Monetary Union , 2022 .

[21]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[22]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[23]  Alex A. Freitas,et al.  A survey of evolutionary algorithms for data mining and knowledge discovery , 2003 .

[24]  M. Gächter,et al.  Retaining the thin blue line: What shapes workers' intentions not to quit the current work environment , 2013 .

[25]  Matthias Sutter,et al.  Working Papers in Economics and Statistics Group polarization in the team dictator game reconsidered , 2007 .

[26]  Luca Scrucca,et al.  GA: A Package for Genetic Algorithms in R , 2013 .

[27]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[28]  Marc V. Lenz,et al.  For a list of recent papers see the backpages of this paper. evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R , 2022 .

[29]  Ronald L. Rivest,et al.  Constructing Optimal Binary Decision Trees is NP-Complete , 1976, Inf. Process. Lett..

[30]  T. Therneau,et al.  An Introduction to Recursive Partitioning Using the RPART Routines , 2015 .

[31]  Adrian F. M. Smith,et al.  A Bayesian CART algorithm , 1998 .

[32]  A. E. Eiben,et al.  Introduction to Evolutionary Computing , 2003, Natural Computing Series.

[33]  A. M. Hilliard AFFILIATION , 1910 .

[34]  Ingo Rechenberg,et al.  Evolutionsstrategie : Optimierung technischer Systeme nach Prinzipien der biologischen Evolution , 1973 .

[35]  Achim Zeileis,et al.  Generalized Measurement Invariance Tests with Application to Factor Analysis , 2011 .

[36]  M. Kirchler,et al.  Thar She Bursts: Reducing Confusion Reduces Bubbles , 2012 .

[37]  John M. Chambers,et al.  Software for Data Analysis: Programming with R , 2008 .

[38]  John R. Koza,et al.  Concept Formation and Decision Tree Induction Using the Genetic Programming Paradigm , 1990, PPSN.

[39]  Francisco Herrera,et al.  A Survey on the Application of Genetic Programming to Classification , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[40]  Matthias Sutter,et al.  The Development of Egalitarianism, Altruism, Spite and Parochialism in Childhood and Adolescence , 2011, SSRN Electronic Journal.

[41]  Kurt Hornik,et al.  The Design and Analysis of Benchmark Experiments , 2005 .

[42]  M. Hanke,et al.  Football championships and jersey sponsors’ stock prices: an empirical investigation , 2013 .

[43]  David Huffman,et al.  University of Innsbruck Working Papers in Economics and Statistics Group Membership , Competition , and Altruistic versus Antisocial Punishment : Evidence from Randomly Assigned Army Groups , 2010 .

[44]  W. Loh,et al.  Improving the precision of classification trees , 2010, 1011.0608.

[45]  Achim Zeileis,et al.  A new method for detecting differential item functioning in the Rasch model , 2011 .

[46]  Guangzhe Fan,et al.  Classification tree analysis using TARGET , 2008, Comput. Stat. Data Anal..

[47]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[48]  Matthias Sutter,et al.  Gender, Competition and the Efficiency of Policy Interventions , 2010, SSRN Electronic Journal.

[49]  Paul A. Raschky,et al.  A Spatial Econometric Analysis of Compliance with an International Environmental Agreement on Open Access Resources , 2011 .

[50]  Matthias Sutter,et al.  Social Preferences in Childhood and Adolescence: A Large-Scale Experiment , 2010, SSRN Electronic Journal.

[51]  Vincent Calcagno,et al.  glmulti: An R Package for Easy Automated Model Selection with (Generalized) Linear Models , 2010 .

[52]  Jesús Crespo-Cuaresma,et al.  Business cycle convergence in EMU : A first look at the second moment * , 2010 .

[53]  Martin Larch,et al.  Monetary Policy and its Impact on Stock Market Liquidity: Evidence from the Euro Zone , 2011 .

[54]  Matthias Sutter,et al.  Teams Make You Smarter: Learning and Knowledge Transfer in Auctions and Markets by Teams and Individuals , 2010, SSRN Electronic Journal.

[55]  Josef Baumgartner,et al.  Milking The Prices: The Role of Asymmetries in the Price Transmission Mechanism for Milk Products in Austria , 2010 .

[56]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[57]  Matthias Sutter,et al.  Social preferences during childhood and the role of gender and age -- An experiment in Austria and Sweden , 2011 .

[58]  Samir W. Mahfoud Crowding and Preselection Revisited , 1992, PPSN.

[59]  Achim Zeileis,et al.  partykit : A Toolkit for Recursive Partytioning , 2015 .

[60]  Michael K. Rich Marketing Engineering: : Computer‐Assisted Marketing Analysis and Planning , 1998 .

[61]  Guangzhe Fan,et al.  Regression Tree Analysis Using TARGET , 2005 .

[62]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[63]  Matthias Sutter,et al.  Psychological Pressure in Competitive Environments: Evidence from a Randomized Natural Experiment: Comment , 2010, SSRN Electronic Journal.

[64]  Gottfried Tappeiner,et al.  Multivariate Stochastic Volatility via Wishart Processes - A Continuation , 2011 .

[65]  Engelbert Theurl,et al.  Socioeconomic Environment and Mortality: A two-level Decomposition by Sex and Cause of Death , 2010 .

[66]  Doreen Schweizer Genetic Algorithms Plus Data Structures Equals Evolution Programs , 2016 .

[67]  Kurt Hornik,et al.  Open-source machine learning: R meets Weka , 2009, Comput. Stat..

[68]  Ronald L. Rivest,et al.  Inferring Decision Trees Using the Minimum Description Length Principle , 1989, Inf. Comput..

[69]  Stefan Lang,et al.  Working Papers in Economics and Statistics Modeling House Prices Using Multilevel Structured Additive Regression Modeling House Prices Using Multilevel Structured Additive Regression , 2022 .

[70]  Matthias Sutter,et al.  Role Selection and Team Performance , 2011, SSRN Electronic Journal.

[71]  Matthias Sutter,et al.  Strategic Sophistication of Adolescents: Evidence from Experimental Normal-Form Games , 2010, SSRN Electronic Journal.

[72]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[73]  Karl P. Pfeiffer,et al.  Regression tree construction by bootstrap: Model search for DRG-systems applied to Austrian health-data , 2010, BMC Medical Informatics Decis. Mak..

[74]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[75]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[76]  The Austrian DRG system , 2010 .

[77]  Achim Zeileis,et al.  Partykit: a modular toolkit for recursive partytioning in R , 2015, J. Mach. Learn. Res..

[78]  Mikkel T. Jensen,et al.  Generating robust and flexible job shop schedules using genetic algorithms , 2003, IEEE Trans. Evol. Comput..

[79]  Engelbert Theurl,et al.  Entry and Exit of Physicians in a two-tiered public/private Health Care System , 2011 .

[80]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[81]  Michael Pfaffermayr,et al.  Does Going Green Pay Off? the Effect of an International Environmental Agreement on Tropical Timber Trade Does Going Green Pay off ? the Effect of an International Environmental Agreement on Tropical Timber Trade , 2022 .

[82]  P. Raschky,et al.  Uncertainty of Governmental Relief and the Crowding out of Flood Insurance , 2013 .

[83]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[84]  Matthias Sutter,et al.  What Drives Taxi Drivers? A Field Experiment on Fraud in a Market for Credence Goods , 2011, SSRN Electronic Journal.

[85]  Francesco Feri,et al.  Psychological Pressure in Competitive Environments: Evidence from A Randomized Natural Experiment: Comment , 2011 .

[86]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[87]  Ruth M. Pfeiffer,et al.  Working Papers in Economics and Statistics Comparing Penalized Splines and Fractional Polynomials for Flexible Modelling of the Effects of Continuous Predictor Variables , 2010 .

[88]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[89]  M. Sutter,et al.  Strategic Sophistication of Individuals and Teams in Experimental Normal-Form Games , 2010, SSRN Electronic Journal.

[90]  P. Raschky,et al.  Working Papers in Economics and Statistics Uncertainty of Governmental Relief and the Crowding out of Insurance Uncertainty of Governmental Relief and the Crowding out of Insurance , 2022 .

[91]  T. Hothorn,et al.  Simultaneous Inference in General Parametric Models , 2008, Biometrical journal. Biometrische Zeitschrift.

[92]  Matthias Sutter,et al.  Equality, Equity and Incentives: An Experiment , 2013, SSRN Electronic Journal.

[93]  Rupert Sausgruber,et al.  The pivotal mechanism revisited: some evidence on group manipulation , 2013 .

[94]  David B. Fogel,et al.  Evolutionary algorithms in theory and practice , 1997, Complex.

[95]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[96]  Stefan Lang,et al.  Working Papers in Economics and Statistics Applications of Multilevel Structured Additive Regression Models to Insurance Data Applications of Multilevel Structured Additive Regression Models to Insurance Data , 2022 .