Continuous optimization of hyper-parameters
暂无分享,去创建一个
[1] H. Akaike. A new look at the statistical model identification , 1974 .
[2] G. Wahba. Smoothing noisy data with spline functions , 1975 .
[3] A. N. Tikhonov,et al. Solutions of ill-posed problems , 1977 .
[4] Peter Craven,et al. Smoothing noisy data with spline functions , 1978 .
[5] Yann LeCun,et al. Improving the convergence of back-propagation learning with second-order methods , 1989 .
[6] Yann LeCun,et al. Optimal Brain Damage , 1989, NIPS.
[7] Chris Bishop,et al. Exact Calculation of the Hessian Matrix for the Multilayer Perceptron , 1992, Neural Computation.
[8] S. P. Smith. Differentiation of the Cholesky Algorithm , 1995 .
[9] Jorma Rissanen,et al. Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.
[10] David J. C. MacKay,et al. Bayesian methods for supervised neural networks , 1998 .
[11] Dirk Husmeier. Automatic Relevance Determination (ARD) , 1999 .
[12] Yoshua Bengio,et al. Learning Simple Non Stationarities with Hyper Parameters , 1999 .