Bayesian treed response surface models

Tree‐based regression and classification, popularized in the 1980s with the advent of the classification and regression trees (CART) has seen a recent resurgence in popularity alongside a boom in modern computing power. The new methodologies take advantage of simulation‐based inference, and ensemble methods, to produce higher fidelity response surfaces with competitive out‐of‐sample predictive performance while retaining many of the attractive features of classic trees: thrifty divide‐and‐conquer nonparametric inference, variable selection and sensitivity analysis, and nonstationary modeling features. In this paper, we review recent advances in Bayesian modeling for trees, from simple Bayesian CART models, treed Gaussian process, sequential inference via dynamic trees, to ensemble modeling via Bayesian additive regression trees (BART). We outline open source R packages supporting these methods and illustrate their use.

[1]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[2]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[3]  H. Chipman,et al.  BART: Bayesian Additive Regression Trees , 2008, 0806.3286.

[4]  Herbert K. H. Lee,et al.  Bayesian Guided Pattern Search for Robust Local Optimization , 2009, Technometrics.

[5]  Stefan M. Wild,et al.  Variable selection and sensitivity analysis using dynamic trees, with an application to computer code performance tuning , 2011, 1108.4739.

[6]  Robert B. Gramacy,et al.  Ja n 20 08 Bayesian Treed Gaussian Process Models with an Application to Computer Modeling , 2009 .

[7]  Nicholas G. Polson,et al.  Particle Learning and Smoothing , 2010, 1011.1098.

[8]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[9]  H. Chipman,et al.  Bayesian CART Model Search , 1998 .

[10]  Christoforos Anagnostopoulos,et al.  Dynamic trees for streaming and massive data contexts , 2012, 1201.5568.

[11]  Robert B. Gramacy,et al.  Gaussian processes and limiting linear models , 2008, Comput. Stat. Data Anal..

[12]  Sébastien Le Digabel,et al.  The mesh adaptive direct search algorithm with treed Gaussian process surrogates , 2011 .

[13]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[14]  Robert B. Gramacy,et al.  Adaptive Design and Analysis of Supercomputer Experiments , 2008, Technometrics.

[15]  Robert B. Gramacy,et al.  Dynamic Trees for Learning and Design , 2009, 0912.1586.

[16]  J. Morgan,et al.  Problems in the Analysis of Survey Data, and a Proposal , 1963 .

[17]  Edward I. George,et al.  Bayesian Treed Models , 2002, Machine Learning.

[18]  H. Chipman,et al.  Bayesian Additive Regression Trees , 2006 .