Variational Bayesian learning of nonlinear hidden state-space models for model predictive control

[1]  Carl E. Rasmussen,et al.  Probabilistic Inference for Fast Learning in Control , 2008, EWRL.

[2]  Stefan Schaal,et al.  Learning to Control in Operational Space , 2008, Int. J. Robotics Res..

[3]  Derong Liu,et al.  Adaptive approximation based control: Unifying neural, fuzzy and traditional adaptive approximation approaches. Jay A. Farrell and Marios M. Polycarpou, Wiley, New York, 2006. No of pages: 440. ISBN 978-0-471-72788-0 , 2008 .

[4]  Stefan Schaal,et al.  Policy Learning for Motor Skills , 2007, ICONIP.

[5]  Nando de Freitas,et al.  Bayesian Policy Learning with Trans-Dimensional MCMC , 2007, NIPS.

[6]  Marc Toussaint,et al.  Bayesian inference for motion control and planning , 2007 .

[7]  Jin Yu,et al.  Natural Actor-Critic for Road Traffic Optimisation , 2006, NIPS.

[8]  Marc Toussaint,et al.  Probabilistic inference for solving discrete and continuous state Markov Decision Processes , 2006, ICML.

[9]  J. Farrell,et al.  Adaptive Approximation Based Control: Unifying Neural, Fuzzy and Traditional Adaptive Approximation Approaches (Adaptive and Learning Systems for Signal Processing, Communications and Control Series) , 2006 .

[10]  Juha Karhunen,et al.  State Inference in Variational Bayesian Nonlinear State-Space Models , 2006, ICA.

[11]  T. Raiko,et al.  Learning nonlinear state-space models for control , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[12]  F. Rosenqvist,et al.  Realisation and estimation of piecewise-linear output-error models , 2005, Autom..

[13]  H. Kappen Linear theory for control of nonlinear stochastic systems. , 2004, Physical review letters.

[14]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[15]  Antti Honkela,et al.  Unsupervised Variational Bayesian Learning of Nonlinear Models , 2004, NIPS.

[16]  J. Kocijan,et al.  Gaussian process model based predictive control , 2004, Proceedings of the 2004 American Control Conference.

[17]  A. Pacut,et al.  Model-free off-policy reinforcement learning in continuous environment , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[18]  J. Sjöberg Neural networks for modelling and control of dynamic systems: M. Nørgaard, O. Ravn, N. K. Poulsen and L. K. Hansen. Springer-Verlag, London Berlin Heidelberg, 2000, pp. xiv+246 , 2004 .

[19]  Stefan Schaal,et al.  Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning , 2002, Applied Intelligence.

[20]  Andrew W. Moore,et al.  Locally Weighted Learning for Control , 1997, Artificial Intelligence Review.

[21]  J. Kocijan,et al.  Predictive control with Gaussian process models , 2003, The IEEE Region 8 EUROCON 2003. Computer as a Tool..

[22]  Hagai Attias,et al.  Planning by Probabilistic Inference , 2003, AISTATS.

[23]  Juha Karhunen,et al.  An Unsupervised Ensemble Learning Method for Nonlinear Dynamic State-Space Models , 2002, Neural Computation.

[24]  Stefan Schaal,et al.  Statistical Learning for Humanoid Robots , 2002, Auton. Robots.

[25]  Stephen P. Boyd,et al.  Future directions in control in an information-rich world , 2003 .

[26]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[27]  Karl Johan Åström,et al.  Control of complex systems , 2001 .

[28]  Alberto Bemporad,et al.  Observability and controllability of piecewise affine and hybrid systems , 2000, IEEE Trans. Autom. Control..

[29]  David Q. Mayne,et al.  Constrained model predictive control: Stability and optimality , 2000, Autom..

[30]  Niels Kjølstad Poulsen,et al.  Neural Networks for Modelling and Control of Dynamic Systems: A Practitioner’s Handbook , 2000 .

[31]  Shigenobu Kobayashi,et al.  Efficient Non-Linear Control by Combining Q-learning with Local Linear Controllers , 1999, ICML.

[32]  Geoffrey E. Hinton,et al.  Using Expectation-Maximization for Reinforcement Learning , 1997, Neural Computation.

[33]  Petros G. Voulgaris,et al.  On optimal ℓ∞ to ℓ∞ filtering , 1995, Autom..

[34]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[35]  Christopher G. Atkeson,et al.  Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming , 1993, NIPS.

[36]  Donald A. Sofge,et al.  Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , 1992 .

[37]  Sebastian Thrun,et al.  The role of exploration in learning control , 1992 .

[38]  M. Mariton,et al.  Control of complex systems , 1991 .

[39]  Andrew W. Moore,et al.  Acquisition of Dynamic Control Knowledge for a Robotic Manipulator , 1990, ML.

[40]  Karl Johan Åström,et al.  Adaptive Control , 1989, Embedded Digital Control with Microcontrollers.

[41]  Y. Bar-Shalom Stochastic dynamic programming: Caution and probing , 1981 .