论文信息 - Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search

Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search

High dimensional black-box optimization has broad applications but remains a challenging problem to solve. Given a set of samples $\{\vx_i, y_i\}$, building a global model (like Bayesian Optimization (BO)) suffers from the curse of dimensionality in the high-dimensional search space, while a greedy search may lead to sub-optimality. By recursively splitting the search space into regions with high/low function values, recent works like LaNAS shows good performance in Neural Architecture Search (NAS), reducing the sample complexity empirically. In this paper, we coin LA-MCTS that extends LaNAS to other domains. Unlike previous approaches, LA-MCTS learns the partition of the search space using a few samples and their function values in an online fashion. While LaNAS uses linear partition and performs uniform sampling in each region, our LA-MCTS adopts a nonlinear decision boundary and learns a local model to pick good candidates. If the nonlinear partition function and the local model fits well with ground-truth black-box function, then good partitions and candidates can be reached with much fewer samples. LA-MCTS serves as a \emph{meta-algorithm} by using existing black-box optimizers (e.g., BO, TuRBO) as its local models, achieving strong performance in general black-box optimization and reinforcement learning benchmarks, in particular for high-dimensional problems.

Yuandong Tian | Rodrigo Fonseca | Linnan Wang

[1] Andreas Krause,et al. Efficient High Dimensional Bayesian Optimization with Additivity and Quadrature Fourier Features , 2018, NeurIPS.

[2] Ameet Talwalkar,et al. Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization , 2016, J. Mach. Learn. Res..

[3] Neil D. Lawrence,et al. Gaussian Processes for Big Data , 2013, UAI.

[4] Rémi Munos,et al. From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning , 2014, Found. Trends Mach. Learn..

[5] Yuandong Tian,et al. Sample-Efficient Neural Architecture Search by Learning Action Space , 2019, ArXiv.

[6] Kirthevasan Kandasamy,et al. High Dimensional Bayesian Optimisation and Bandits via Additive Models , 2015, ICML.

[7] Matthias Poloczek,et al. A Framework for Bayesian Optimization in Embedded Subspaces , 2019, ICML.

[8] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[9] Rainer Storn,et al. Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[10] Volkan Cevher,et al. High-Dimensional Bayesian Optimization via Additive Models with Overlapping Groups , 2018, AISTATS.

[11] Xi Chen,et al. Evolution Strategies as a Scalable Alternative to Reinforcement Learning , 2017, ArXiv.

[12] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[13] Zi Wang,et al. Batched Large-scale Bayesian Optimization in High-dimensional Spaces , 2017, AISTATS.

[14] Nando de Freitas,et al. Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[15] Andreas Krause,et al. Joint Optimization and Variable Selection of High-dimensional Gaussian Processes , 2012, ICML.

[16] I. Sobol. On the distribution of points in a cube and the approximate evaluation of integrals , 1967 .

[17] Sham M. Kakade,et al. Towards Generalization and Simplicity in Continuous Control , 2017, NIPS.

[18] Max Welling,et al. BOCK : Bayesian Optimization with Cylindrical Kernels , 2018, ICML.

[19] Matthias Poloczek,et al. Scalable Global Optimization via Local Bayesian Optimization , 2019, NeurIPS.

[20] Zoubin Ghahramani,et al. Sparse Gaussian Processes using Pseudo-inputs , 2005, NIPS.

[21] Michael L. Littman,et al. Sample-Based Planning for Continuous Action Markov Decision Processes , 2011, ICAPS.

[22] Leslie Pack Kaelbling,et al. Bayesian Optimization with Exponential Convergence , 2015, NIPS.

[23] Abraham Bernstein,et al. Machines Tuning Machines: Configuring Distributed Stream Processors with Bayesian Optimization , 2015, 2015 IEEE International Conference on Cluster Computing.

[24] Stefano Ermon,et al. Sparse Gaussian Processes for Bayesian Optimization , 2016, UAI.

[25] Benjamin Recht,et al. Simple random search provides a competitive approach to reinforcement learning , 2018, ArXiv.

[26] Roman Garnett,et al. Discovering and Exploiting Additive Structure for Bayesian Optimization , 2017, AISTATS.

[27] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[28] Kevin Leyton-Brown,et al. Sequential Model-Based Optimization for General Algorithm Configuration , 2011, LION.

[29] Csaba Szepesvári,et al. –armed Bandits , 2022 .

[30] Lin Ma,et al. Self-Driving Database Management Systems , 2017, CIDR.

[31] Demis Hassabis,et al. Mastering Atari, Go, chess and shogi by planning with a learned model , 2019, Nature.

[32] Petros Koumoutsakos,et al. Reducing the Time Complexity of the Derandomized Evolution Strategy with Covariance Matrix Adaptation (CMA-ES) , 2003, Evolutionary Computation.

[33] Neil D. Lawrence,et al. Fast Forward Selection to Speed Up Sparse Gaussian Process Regression , 2003, AISTATS.

[34] Michael L. Littman,et al. Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes , 2012, ICAPS.

[35] Peter I. Frazier,et al. A Tutorial on Bayesian Optimization , 2018, ArXiv.

[36] Robert L. Smith,et al. Hit-and-Run Algorithms for Generating Multivariate Distributions , 1993, Math. Oper. Res..

[37] A. A. Goldstein,et al. Optimization of lipschitz continuous functions , 1977, Math. Program..

[38] Nando de Freitas,et al. Bayesian Multi-Scale Optimistic Optimization , 2014, AISTATS.

[39] Nando de Freitas,et al. Bayesian Optimization in a Billion Dimensions via Random Embeddings , 2013, J. Artif. Intell. Res..

[40] David Ginsbourger,et al. A Warped Kernel Improving Robustness in Bayesian Optimization Via Random Embeddings , 2014, LION.

[41] Jürgen Branke,et al. Evolutionary optimization in uncertain environments-a survey , 2005, IEEE Transactions on Evolutionary Computation.

[42] Simon M. Lucas,et al. A Survey of Monte Carlo Tree Search Methods , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[43] Adrian F. M. Smith,et al. Sampling-Based Approaches to Calculating Marginal Densities , 1990 .

[44] Yoshua Bengio,et al. Algorithms for Hyper-Parameter Optimization , 2011, NIPS.

[45] Rémi Munos,et al. Optimistic Optimization of Deterministic Functions , 2011, NIPS 2011.

[46] Aaron Klein,et al. Bayesian Optimization with Robust Bayesian Neural Networks , 2016, NIPS.

[47] Martin Pincus,et al. Letter to the Editor - A Monte Carlo Method for the Approximate Solution of Certain Types of Constrained Optimization Problems , 1970, Oper. Res..

[48] David Ginsbourger,et al. On the choice of the low-dimensional domain for global optimization via random embeddings , 2017, J. Glob. Optim..

[49] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.

[50] Aaron Klein,et al. BOHB: Robust and Efficient Hyperparameter Optimization at Scale , 2018, ICML.

[51] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[52] Nando de Freitas,et al. Bayesian optimization in high dimensions via random embeddings , 2013, IJCAI 2013.

[53] Leslie Pack Kaelbling,et al. Monte Carlo Tree Search in Continuous Spaces Using Voronoi Optimistic Optimization with Regret Bounds , 2020, AAAI.

[54] Nando de Freitas,et al. A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning , 2010, ArXiv.

[55] Quoc V. Le,et al. Chip Placement with Deep Reinforcement Learning , 2020, ArXiv.

[56] Olivier Teytaud,et al. Versatile black-box optimization , 2020, GECCO.

[57] Yiyang Zhao,et al. AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search , 2019, ArXiv.