Comprehensive analysis of gradient-based hyperparameter optimization algorithms
暂无分享,去创建一个
[1] Fredrick Devadoss,et al. Analysis and visual summarization of molecular dynamics simulation , 2014, Journal of Cheminformatics.
[2] Kian Hsiang Low,et al. DrMAD: Distilling Reverse-Mode Automatic Differentiation for Optimizing Hyperparameters of Deep Neural Networks , 2016, IJCAI.
[3] Vadim V. Strijov,et al. Analytic and Stochastic Methods of Structure Parameter Estimation , 2016, Informatica.
[4] Justin Domke,et al. Generic Methods for Optimization-Based Modeling , 2012, AISTATS.
[5] Ljubomir J. Buturovic,et al. Cross-validation pitfalls when selecting and assessing regression and classification models , 2014, Journal of Cheminformatics.
[6] Prabhat,et al. Scalable Bayesian Optimization Using Deep Neural Networks , 2015, ICML.
[7] Jeyanthi Narasimhan,et al. Fast and Accurate Support Vector Machines on Large Scale Systems , 2015, 2015 IEEE International Conference on Cluster Computing.
[8] Chong Wang,et al. Stochastic variational inference , 2012, J. Mach. Learn. Res..
[9] Gunnar Rätsch,et al. Automatic Relevance Determination For Deep Generative Models , 2015, 1505.07765.
[10] Tapani Raiko,et al. Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters , 2015, ICML.
[11] Fabian Pedregosa,et al. Hyperparameter optimization with approximate gradient , 2016, ICML.
[12] Stephen P. Boyd,et al. Multi-period portfolio selection with drawdown control , 2018, Ann. Oper. Res..
[13] Yoshua Bengio,et al. Algorithms for Hyper-Parameter Optimization , 2011, NIPS.
[14] Yoshua Bengio,et al. Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..
[15] Alex Graves,et al. Practical Variational Inference for Neural Networks , 2011, NIPS.
[16] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .
[17] Peter Grünwald,et al. A tutorial introduction to the minimum description length principle , 2004, ArXiv.
[18] Yoshua Bengio,et al. No Unbiased Estimator of the Variance of K-Fold Cross-Validation , 2003, J. Mach. Learn. Res..
[19] David J. C. MacKay,et al. Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.
[20] Ryan P. Adams,et al. Gradient-based Hyperparameter Optimization through Reversible Learning , 2015, ICML.
[21] Alessio Farcomeni,et al. Bayesian constrained variable selection , 2007 .
[22] Razvan Pascanu,et al. Theano: A CPU and GPU Math Compiler in Python , 2010, SciPy.
[23] Sylvain Arlot,et al. A survey of cross-validation procedures for model selection , 2009, 0907.4728.
[24] Myong Kee Jeong,et al. Robust relevance vector machine for classification with variational inference , 2018, Ann. Oper. Res..
[25] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[26] Max Welling,et al. Markov Chain Monte Carlo and Variational Inference: Bridging the Gap , 2014, ICML.
[27] Kevin Leyton-Brown,et al. Sequential Model-Based Optimization for General Algorithm Configuration , 2011, LION.
[28] Gary M. Weiss,et al. Activity recognition using cell phone accelerometers , 2011, SKDD.
[29] Geoffrey E. Hinton,et al. Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure , 2007, AISTATS.