论文信息 - Efficient Benchmarking of Algorithm Configuration Procedures via Model-Based Surrogates

Efficient Benchmarking of Algorithm Configuration Procedures via Model-Based Surrogates

The optimization of algorithm (hyper-)parameters is crucial for achieving peak performance across a wide range of domains, ranging from deep neural networks to solvers for hard combinatorial problems. The resulting algorithm configuration (AC) problem has attracted much attention from the machine learning community. However, the proper evaluation of new AC procedures is hindered by two key hurdles. First, AC benchmarks are hard to set up. Second and even more significantly, they are computationally expensive: a single run of an AC procedure involves many costly runs of the target algorithm whose performance is to be optimized in a given AC benchmark scenario. One common workaround is to optimize cheap-to-evaluate artificial benchmark functions (e.g., Branin) instead of actual algorithms; however, these have different properties than realistic AC problems. Here, we propose an alternative benchmarking approach that is similarly cheap to evaluate but much closer to the original AC problem: replacing expensive benchmarks by surrogate benchmarks constructed from AC benchmarks. These surrogate benchmarks approximate the response surface corresponding to true target algorithm performance using a regression model, and the original and surrogate benchmark share the same (hyper-)parameter space. In our experiments, we construct and evaluate surrogate benchmarks for hyperparameter optimization as well as for AC problems that involve performance optimization of solvers for hard combinatorial problems, drawing training data from the runs of existing AC procedures. We show that our surrogate benchmarks capture overall important characteristics of the AC scenarios, such as high- and low-performing regions, from which they were derived, while being much easier to use and orders of magnitude cheaper to evaluate.

[1] Kevin Leyton-Brown,et al. Algorithm runtime prediction: Methods & evaluation , 2012, Artif. Intell..

[2] Teresa Bernarda Ludermir,et al. Predicting the Performance of Learning Algorithms Using Support Vector Machines as Meta-regressors , 2008, ICANN.

[3] Nando de Freitas,et al. A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning , 2010, ArXiv.

[4] Yuri Malitsky,et al. Deep Learning for Algorithm Portfolios , 2016, AAAI.

[5] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[6] Marius Thomas Lindauer,et al. A Portfolio Solver for Answer Set Programming: Preliminary Report , 2011, LPNMR.

[7] M. Fox,et al. The 3rd International Planning Competition: Results and Analysis , 2003, J. Artif. Intell. Res..

[8] Thomas Stützle,et al. AClib: A Benchmark Library for Algorithm Configuration , 2014, LION.

[9] Kevin Leyton-Brown,et al. SATzilla: Portfolio-based Algorithm Selection for SAT , 2008, J. Artif. Intell. Res..

[10] Luca Pulina,et al. A multi-engine approach to answer-set programming* , 2013, Theory and Practice of Logic Programming.

[11] Thomas Stützle,et al. Stochastic Local Search: Foundations & Applications , 2004 .

[12] Ricardo Vilalta,et al. Metalearning - Applications to Data Mining , 2008, Cognitive Technologies.

[13] Yuri Malitsky,et al. ISAC - Instance-Specific Algorithm Configuration , 2010, ECAI.

[14] Marius Thomas Lindauer,et al. The Configurable SAT Solver Challenge (CSSC) , 2015, Artif. Intell..

[15] R. Koenker. Quantile Regression: Fundamentals of Quantile Regression , 2005 .

[16] Alfonso Gerevini,et al. Automatic Generation of Efficient Domain-Optimized Planners from Generic Parametrized Planners , 2013, SOCS.

[17] C. Spearman. The proof and measurement of association between two things. , 2015, International journal of epidemiology.

[18] Lars Kotthoff,et al. Algorithm Selection for Combinatorial Search Problems: A Survey , 2012, AI Mag..

[19] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.

[20] Lars Schmidt-Thieme,et al. Hyperparameter Optimization with Factorized Multilayer Perceptrons , 2015, ECML/PKDD.

[21] Leslie Pérez Cáceres,et al. The irace package: Iterated racing for automatic algorithm configuration , 2016 .

[22] Ashish Sabharwal,et al. An Empirical Study of Optimization for Maximizing Diffusion in Networks , 2010, CP.

[23] Sonja Kuhnt,et al. Design and analysis of computer experiments , 2010 .

[24] Kevin Leyton-Brown,et al. Improved Features for Runtime Prediction of Domain-Independent Planners , 2014, ICAPS.

[25] Kevin Leyton-Brown,et al. Sequential Model-Based Optimization for General Algorithm Configuration , 2011, LION.

[26] David D. Cox,et al. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures , 2013, ICML.

[27] Julien Cornebise,et al. Weight Uncertainty in Neural Network , 2015, ICML.

[28] Hilan Bensusan,et al. Estimating the Predictive Accuracy of a Classifier , 2001, ECML.

[29] Uwe Schöning,et al. Choosing Probability Distributions for Stochastic Local Search and the Role of Make versus Break , 2012, SAT.

[30] R. Geoff Dromey,et al. An algorithm for the selection problem , 1986, Softw. Pract. Exp..

[31] Barry O'Sullivan,et al. Learning Sequential and Parallel Runtime Distributions for Randomized Algorithms , 2016, 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI).

[32] Pavel Brazdil,et al. Active Testing Strategy to Predict the Best Classification Algorithm via Sampling and Metalearning , 2010, ECAI.

[33] Marius Lindauer,et al. SpyBug: Automated Bug Detection in the Configuration Space of SAT Solvers , 2016, SAT.

[34] Yuliya Lierler,et al. Parsing Combinatory Categorial Grammar via Planning in Answer Set Programming , 2012, Correct Reasoning.

[35] Aaron Klein,et al. Bayesian Optimization with Robust Bayesian Neural Networks , 2016, NIPS.

[36] Ivan Serina,et al. LPG: A Planner Based on Local Search for Planning Graphs with Action Costs , 2002, AIPS.

[37] G. J. Hahn,et al. A Simple Method for Regression Analysis With Censored Data , 1979 .

[38] Yuri Malitsky,et al. Model-Based Genetic Algorithms for Algorithm Configuration , 2015, IJCAI.

[39] Shoaib Kamil,et al. OpenTuner: An extensible framework for program autotuning , 2014, 2014 23rd International Conference on Parallel Architecture and Compilation (PACT).

[40] Daniel S. Weld,et al. Temporal Planning with Continuous Change , 1994, AAAI.

[41] Armin Biere,et al. Automated Testing and Debugging of SAT and QBF Solvers , 2010, SAT.

[42] Piet Demeester,et al. A Surrogate Modeling and Adaptive Sampling Toolbox for Computer Based Design , 2010, J. Mach. Learn. Res..

[43] Aaron Klein,et al. Efficient and Robust Automated Machine Learning , 2015, NIPS.

[44] Michèle Sebag,et al. Collaborative hyperparameter tuning , 2013, ICML.

[45] Yoav Shoham,et al. Understanding Random SAT: Beyond the Clauses-to-Variables Ratio , 2004, CP.

[46] Samy Bengio,et al. A Parallel Mixture of SVMs for Very Large Scale Problems , 2001, Neural Computation.

[47] Thomas Stützle,et al. A Racing Algorithm for Configuring Metaheuristics , 2002, GECCO.

[48] Marius Thomas Lindauer,et al. claspfolio 2: Advances in Algorithm Selection for Answer Set Programming , 2014, Theory and Practice of Logic Programming.

[49] Alan J. Hu,et al. Boosting Verification by Automatic Tuning of Decision Procedures , 2007, Formal Methods in Computer Aided Design (FMCAD'07).

[50] Frank Hutter,et al. Initializing Bayesian Hyperparameter Optimization via Meta-Learning , 2015, AAAI.

[51] Kevin Leyton-Brown,et al. Bayesian Optimization With Censored Response Data , 2013, ArXiv.