论文信息 - Weighted Sampling for Combined Model Selection and Hyperparameter Tuning

Weighted Sampling for Combined Model Selection and Hyperparameter Tuning

The combined algorithm selection and hyperparameter tuning (CASH) problem is characterized by large hierarchical hyperparameter spaces. Model-free hyperparameter tuning methods can explore such large spaces efficiently since they are highly parallelizable across multiple machines. When no prior knowledge or meta-data exists to boost their performance, these methods commonly sample random configurations following a uniform distribution. In this work, we propose a novel sampling distribution as an alternative to uniform sampling and prove theoretically that it has a better chance of finding the best configuration in a worst-case setting. In order to compare competing methods rigorously in an experimental setting, one must perform statistical hypothesis testing. We show that there is little-to-no agreement in the automated machine learning literature regarding which methods should be used. We contrast this disparity with the methods recommended by the broader statistics literature, and identify a suitable approach. We then select three popular model-free solutions to CASH and evaluate their performance, with uniform sampling as well as the proposed sampling scheme, across 67 datasets from the OpenML platform. We investigate the trade-off between exploration and exploitation across the three algorithms, and verify empirically that the proposed sampling distribution improves performance in all cases.

Dimitrios Sarigiannis | Thomas Parnell | Haris Pozidis

[1] Kevin Leyton-Brown,et al. Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.

[2] Alexander Allen,et al. Benchmarking Automatic Machine Learning Frameworks , 2018, ArXiv.

[3] Hilan Bensusan,et al. Meta-Learning by Landmarking Various Learning Algorithms , 2000, ICML.

[4] Francesca Mangili,et al. Should We Really Use Post-Hoc Tests Based on Mean-Ranks? , 2015, J. Mach. Learn. Res..

[5] Kevin Leyton-Brown,et al. Sequential Model-Based Optimization for General Algorithm Configuration , 2011, LION.

[6] R. E. Lee,et al. Distribution-free multiple comparisons between successive treatments , 1995 .

[7] Aaron Klein,et al. Efficient and Robust Automated Machine Learning , 2015, NIPS.

[8] Frank Hutter,et al. Initializing Bayesian Hyperparameter Optimization via Meta-Learning , 2015, AAAI.

[9] Luís Torgo,et al. OpenML: networked science in machine learning , 2014, SKDD.

[10] Fela Winkelmolen,et al. A simple transfer-learning extension of Hyperband , 2018 .

[11] Ameet Talwalkar,et al. Massively Parallel Hyperparameter Tuning , 2018, ArXiv.