Bayesian Optimization of Hyperparameters when the Marginal Likelihood is Estimated by MCMC

Bayesian models often involve a small set of hyperparameters determined by maximizing the marginal likelihood. Bayesian optimization is a popular iterative method where a Gaussian process posterior of the underlying function is sequentially updated by new function evaluations. An acquisition strategy uses this posterior distribution to decide where to place the next function evaluation. We propose a novel Bayesian optimization framework for situations where the user controls the computational effort, and therefore the precision of the function evaluations. This is a common situation in econometrics where the marginal likelihood is often computed by Markov Chain Monte Carlo (MCMC) methods, with the precision of the marginal likelihood estimate determined by the number of MCMC draws. The proposed acquisition strategy gives the optimizer the option to explore the function with cheap noisy evaluations and therefore finds the optimum faster. Prior hyperparameter estimation in the steady-state Bayesian vector autoregressive (BVAR) model on US macroeconomic time series data is used for illustration. The proposed method is shown to find the optimum much quicker than traditional Bayesian optimization or grid search.

[1]  Mattias Villani,et al.  Bayesian Analysis of DSGE Models—Some Comments , 2007 .

[2]  Barry G. Quinn,et al.  Order Determination for a Multivariate Autoregression , 1980 .

[3]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[4]  Sveriges Riksbank STEADY STATE PRIORS FOR VECTOR AUTOREGRESSIONS , 2008 .

[5]  Alistair Dieppe,et al.  The BEAR Toolbox , 2016, SSRN Electronic Journal.

[6]  J. Geweke,et al.  Smoothly mixing regressions , 2007 .

[7]  Pär Österholm,et al.  The limited usefulness of macroeconomic Bayesian VARs when estimating the probability of a US recession , 2012 .

[8]  Michele Lenza,et al.  Prior Selection for Vector Autoregressions , 2012, Review of Economics and Statistics.

[9]  W. Newey,et al.  Hypothesis Testing with Efficient Method of Moments Estimation , 1987 .

[10]  S. Chib,et al.  Marginal Likelihood From the Metropolis–Hastings Output , 2001 .

[11]  Dimitrios P. Louzis Steady‐state modeling and macroeconomic forecasting quality , 2018, Journal of Applied Econometrics.

[12]  David J. Nott,et al.  Generalized smooth finite mixtures , 2012 .

[13]  G. Kapetanios,et al.  Forecasting Government Bond Yields with Large Bayesian Vars , 2010 .

[14]  D. Giannone,et al.  Large Bayesian vector auto regressions , 2010 .

[15]  Peter Gustafsson,et al.  Macroeconomic effects of a decline in housing prices in Sweden , 2016 .

[16]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[17]  John Geweke,et al.  Federal Reserve Bank of Minneapolis Research Department Staff Report 249 Using Simulation Methods for Bayesian Econometric Models: Inference, Development, and Communication , 2022 .

[18]  Pär Österholm,et al.  The Impact of US Uncertainty Shocks on Small Open Economies , 2017 .

[19]  S. Chib Marginal Likelihood from the Gibbs Output , 1995 .

[20]  Frank Schorfheide,et al.  Bayesian Analysis of DSGE Models , 2005 .

[21]  Nando de Freitas,et al.  A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning , 2010, ArXiv.

[22]  Robert B. Litterman,et al.  Forecasting and Conditional Projection Using Realistic Prior Distributions , 1983 .

[23]  B. G. Quinn,et al.  The determination of the order of an autoregression , 1979 .

[24]  Sune Karlsson Forecasting with Bayesian Vector Autoregression , 2013 .

[25]  Hugh A. Chipman,et al.  GPfit: An R Package for Fitting a Gaussian Process Model to Deterministic Simulator Outputs , 2013, 1305.0759.