论文信息 - Toward Optimal Run Racing: Application to Deep Learning Calibration - 字舞流文

Toward Optimal Run Racing: Application to Deep Learning Calibration

This paper aims at one-shot learning of deep neural nets, where a highly parallel setting is considered to address the algorithm calibration problem - selecting the best neural architecture and learning hyper-parameter values depending on the dataset at hand. The notoriously expensive calibration problem is optimally reduced by detecting and early stopping non-optimal runs. The theoretical contribution regards the optimality guarantees within the multiple hypothesis testing framework. Experimentations on the Cifar10, PTB and Wiki benchmarks demonstrate the relevance of the approach with a principled and consistent improvement on the state of the art with no extra hyper-parameter.

Michèle Sebag | Olivier Teytaud | Marc Schoenauer | Sylvain Gelly | Olivier Bousquet | Karol Kurach | Damien Vincent

[1] Quan Sun,et al. Pairwise meta-rules for better meta-learning-based algorithm ranking , 2013, Machine Learning.

[2] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[3] Alain Pétrowski,et al. A clearing procedure as a niching method for genetic algorithms , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[4] Taimoor Akhtar,et al. Hyperparameter Optimization of Deep Neural Networks Using Non-Probabilistic RBF Surrogate Model , 2016, ArXiv.

[5] Carlos Soares,et al. A Comparison of Ranking Methods for Classification Algorithm Selection , 2000, ECML.

[6] Michèle Sebag,et al. Collaborative hyperparameter tuning , 2013, ICML.

[7] Kevin Leyton-Brown,et al. Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.

[8] Brian Kingsbury,et al. New types of deep neural network learning for speech recognition and related applications: an overview , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[10] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[11] Christian Gagné,et al. Bayesian Hyperparameter Optimization for Ensemble Learning , 2016, UAI.

[12] William Chan,et al. Deep Recurrent Neural Networks for Acoustic Modelling , 2015, ArXiv.

[13] Ameet Talwalkar,et al. Efficient Hyperparameter Optimization and Infinitely Many Armed Bandits , 2016, ArXiv.

[14] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[15] Frank Hutter,et al. Speeding Up Automatic Hyperparameter Optimization of Deep Neural Networks by Extrapolation of Learning Curves , 2015, IJCAI.

[16] R. Geoff Dromey,et al. An algorithm for the selection problem , 1986, Softw. Pract. Exp..

[17] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[18] Melanie Hilario,et al. Using Meta-mining to Support Data Mining Workflow Planning and Optimization , 2014, J. Artif. Intell. Res..

[19] Jasper Snoek,et al. Freeze-Thaw Bayesian Optimization , 2014, ArXiv.

[20] Jonas Mockus,et al. Bayesian heuristic approach to global optimization and examples , 2002, J. Glob. Optim..

[21] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[22] Ameet Talwalkar,et al. Non-stochastic Best Arm Identification and Hyperparameter Optimization , 2015, AISTATS.