论文信息 - Gaussian process dynamic programming - 字舞流文

Gaussian process dynamic programming

Carl E. Rasmussen | Jan Peters | Marc Peter Deisenroth | Jan Peters | M. Deisenroth | C. Rasmussen

[1] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[2] Dimitri P. Bertsekas,et al. Neuro-Dynamic Programming , 2009, Encyclopedia of Optimization.

[3] Carl E. Rasmussen,et al. Model-Based Reinforcement Learning with Continuous States and Actions , 2008, ESANN.

[4] Carl E. Rasmussen,et al. Probabilistic Inference for Fast Learning in Control , 2008, EWRL.

[5] Dieter Fox,et al. GP-BayesFilters: Bayesian filtering using Gaussian process prediction and observation models , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6] J. Peters,et al. Approximate dynamic programming with Gaussian processes , 2008, 2008 American Control Conference.

[7] Andreas Krause,et al. Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , 2008, J. Mach. Learn. Res..

[8] KrauseAndreas,et al. Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , 2008 .

[9] Stefan Schaal,et al. 2008 Special Issue: Reinforcement learning of motor skills with policy gradients , 2008 .

[10] KasabovNikola,et al. 2008 Special issue , 2008 .

[11] Stefan Schaal,et al. Learning to Control in Operational Space , 2008, Int. J. Robotics Res..

[12] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.

[13] Wolfram Burgard,et al. Active Policy Learning for Robot Planning and Exploration under Uncertainty , 2008 .

[14] Dieter Fox,et al. GP-UKF: Unscented kalman filters with Gaussian process prediction and observation models , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15] Nando de Freitas,et al. Active Policy Learning for Robot Planning and Exploration under Uncertainty , 2007, Robotics: Science and Systems.

[16] Dieter Fox,et al. Gaussian Processes and Reinforcement Learning for Identification and Control of an Autonomous Blimp , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[17] Mohammad Ghavamzadeh,et al. Bayesian Policy Gradient Algorithms , 2006, NIPS.

[18] Tobias Pfingsten,et al. Bayesian Active Learning for Sensitivity Analysis , 2006, ECML.

[19] Liming Xiang,et al. Kernel-Based Reinforcement Learning , 2006, ICIC.

[20] Larry Wasserman,et al. All of Nonparametric Statistics (Springer Texts in Statistics) , 2006 .

[21] Malte Kuß,et al. Gaussian process models for robust regression, classification, and reinforcement learning , 2006 .

[22] Zoubin Ghahramani,et al. Sparse Gaussian Processes using Pseudo-inputs , 2005, NIPS.

[23] Christopher K. I. Williams,et al. Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .

[24] Yuhong Yang,et al. Information Theory, Inference, and Learning Algorithms , 2005 .

[25] Pierre Geurts,et al. Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[26] Carl E. Rasmussen,et al. A Unifying View of Sparse Approximate Gaussian Process Regression , 2005, J. Mach. Learn. Res..

[27] L. Wasserman. All of Nonparametric Statistics , 2005 .

[28] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.

[29] Shie Mannor,et al. Reinforcement learning with Gaussian processes , 2005, ICML.

[30] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[31] Dimitri P. Bertsekas,et al. Dynamic programming and optimal control, 3rd Edition , 2005 .

[32] Jeffrey K. Uhlmann,et al. Unscented filtering and nonlinear estimation , 2004, Proceedings of the IEEE.

[33] Carl E. Rasmussen,et al. Gaussian Processes in Reinforcement Learning , 2003, NIPS.

[34] J. Kocijan,et al. Predictive control with Gaussian process models , 2003, The IEEE Region 8 EUROCON 2003. Computer as a Tool..

[35] Agathe Girard,et al. Adaptive, Cautious, Predictive control with Gaussian Process Priors , 2003 .

[36] Shie Mannor,et al. Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning , 2003, ICML.

[37] Daniel Sbarbaro,et al. Nonlinear adaptive control using non-parametric Gaussian Process prior models , 2002 .

[38] Carl E. Rasmussen,et al. Bayesian Monte Carlo , 2002, NIPS.

[39] Claudio De Persis,et al. Proceedings of the 15th IFAC World Congress , 2002 .

[40] C. Rasmussen,et al. Gaussian Process Priors with Uncertain Inputs - Application to Multiple-Step Ahead Time Series Forecasting , 2002, NIPS.

[41] T. Başar,et al. A New Approach to Linear Filtering and Prediction Problems , 2001 .

[42] Tom Minka,et al. A family of algorithms for approximate Bayesian inference , 2001 .

[43] Kenji Doya,et al. Reinforcement Learning in Continuous Time and Space , 2000, Neural Computation.

[44] W. Godwin. Article in Press , 2000 .

[45] Martin A. Riedmiller. Concepts and Facilities of a Neural Reinforcement Learning Control Architecture for Technical Process Control , 1999, Neural Computing & Applications.

[46] David J. C. MacKay,et al. Comparison of Approximate Methods for Handling Hyperparameters , 1999, Neural Computation.

[47] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[48] Thomas G. Dietterich. Adaptive computation and machine learning , 1998 .

[49] Stefan Schaal,et al. Robot Learning From Demonstration , 1997, ICML.

[50] Christopher G. Atkeson,et al. A comparison of direct and model-based reinforcement learning , 1997, Proceedings of International Conference on Robotics and Automation.

[51] Geoffrey E. Hinton,et al. Evaluation of Gaussian processes and other methods for non-linear regression , 1997 .

[52] Leslie Pack Kaelbling,et al. Recent Advances in Reinforcement Learning , 1996, Springer US.

[53] Carl E. Rasmussen,et al. In Advances in Neural Information Processing Systems , 2011 .

[54] K. Chaloner,et al. Bayesian Experimental Design: A Review , 1995 .

[55] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[56] Geoffrey J. Gordon. Stable Function Approximation in Dynamic Programming , 1995, ICML.

[57] Christopher G. Atkeson,et al. Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming , 1993, NIPS.

[58] Pavel Brazdil,et al. Proceedings of the European Conference on Machine Learning , 1993 .

[59] Martin A. Riedmiller,et al. A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[60] David J. C. MacKay,et al. Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[61] I. Verdinelli,et al. Bayesian designs for maximizing information and outcome , 1992 .

[62] A. O'Hagan,et al. Bayes–Hermite quadrature , 1991 .

[63] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[64] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .

[65] George M. Siouris,et al. Applied Optimal Control: Optimization, Estimation, and Control , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[66] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Vol. II , 1976 .

[67] G. Matheron. The intrinsic random functions and their applications , 1973, Advances in Applied Probability.

[68] R. Bellman. Dynamic Programming , 1957, Science.

[69] L. Goddard. Information Theory , 1962, Nature.

[70] R. Howard. Dynamic Programming and Markov Processes , 1960 .

[71] Laura Rountree Smith. January , 1890, The Hospital.