论文信息 - Variational Bayesian learning of nonlinear hidden state-space models for model predictive control - 字舞流文

Variational Bayesian learning of nonlinear hidden state-space models for model predictive control

Tapani Raiko | Matti Tornio | T. Raiko | M. Tornio | Matti Tornio

[1] Carl E. Rasmussen,et al. Probabilistic Inference for Fast Learning in Control , 2008, EWRL.

[2] Stefan Schaal,et al. Learning to Control in Operational Space , 2008, Int. J. Robotics Res..

[3] Derong Liu,et al. Adaptive approximation based control: Unifying neural, fuzzy and traditional adaptive approximation approaches. Jay A. Farrell and Marios M. Polycarpou, Wiley, New York, 2006. No of pages: 440. ISBN 978-0-471-72788-0 , 2008 .

[4] Stefan Schaal,et al. Policy Learning for Motor Skills , 2007, ICONIP.

[5] Nando de Freitas,et al. Bayesian Policy Learning with Trans-Dimensional MCMC , 2007, NIPS.

[6] Marc Toussaint,et al. Bayesian inference for motion control and planning , 2007 .

[7] Jin Yu,et al. Natural Actor-Critic for Road Traffic Optimisation , 2006, NIPS.

[8] Marc Toussaint,et al. Probabilistic inference for solving discrete and continuous state Markov Decision Processes , 2006, ICML.

[9] J. Farrell,et al. Adaptive Approximation Based Control: Unifying Neural, Fuzzy and Traditional Adaptive Approximation Approaches (Adaptive and Learning Systems for Signal Processing, Communications and Control Series) , 2006 .

[10] Juha Karhunen,et al. State Inference in Variational Bayesian Nonlinear State-Space Models , 2006, ICA.

[11] T. Raiko,et al. Learning nonlinear state-space models for control , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[12] F. Rosenqvist,et al. Realisation and estimation of piecewise-linear output-error models , 2005, Autom..

[13] H. Kappen. Linear theory for control of nonlinear stochastic systems. , 2004, Physical review letters.

[14] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[15] Antti Honkela,et al. Unsupervised Variational Bayesian Learning of Nonlinear Models , 2004, NIPS.

[16] J. Kocijan,et al. Gaussian process model based predictive control , 2004, Proceedings of the 2004 American Control Conference.

[17] A. Pacut,et al. Model-free off-policy reinforcement learning in continuous environment , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[18] J. Sjöberg. Neural networks for modelling and control of dynamic systems: M. Nørgaard, O. Ravn, N. K. Poulsen and L. K. Hansen. Springer-Verlag, London Berlin Heidelberg, 2000, pp. xiv+246 , 2004 .

[19] Stefan Schaal,et al. Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning , 2002, Applied Intelligence.

[20] Andrew W. Moore,et al. Locally Weighted Learning for Control , 1997, Artificial Intelligence Review.

[21] J. Kocijan,et al. Predictive control with Gaussian process models , 2003, The IEEE Region 8 EUROCON 2003. Computer as a Tool..

[22] Hagai Attias,et al. Planning by Probabilistic Inference , 2003, AISTATS.

[23] Juha Karhunen,et al. An Unsupervised Ensemble Learning Method for Nonlinear Dynamic State-Space Models , 2002, Neural Computation.

[24] Stefan Schaal,et al. Statistical Learning for Humanoid Robots , 2002, Auton. Robots.

[25] Stephen P. Boyd,et al. Future directions in control in an information-rich world , 2003 .

[26] D K Smith,et al. Numerical Optimization , 2001, J. Oper. Res. Soc..

[27] Karl Johan Åström,et al. Control of complex systems , 2001 .

[28] Alberto Bemporad,et al. Observability and controllability of piecewise affine and hybrid systems , 2000, IEEE Trans. Autom. Control..

[29] David Q. Mayne,et al. Constrained model predictive control: Stability and optimality , 2000, Autom..

[30] Niels Kjølstad Poulsen,et al. Neural Networks for Modelling and Control of Dynamic Systems: A Practitioner’s Handbook , 2000 .

[31] Shigenobu Kobayashi,et al. Efficient Non-Linear Control by Combining Q-learning with Local Linear Controllers , 1999, ICML.

[32] Geoffrey E. Hinton,et al. Using Expectation-Maximization for Reinforcement Learning , 1997, Neural Computation.

[33] Petros G. Voulgaris,et al. On optimal ℓ∞ to ℓ∞ filtering , 1995, Autom..

[34] S. Hyakin,et al. Neural Networks: A Comprehensive Foundation , 1994 .

[35] Christopher G. Atkeson,et al. Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming , 1993, NIPS.

[36] Donald A. Sofge,et al. Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , 1992 .

[37] Sebastian Thrun,et al. The role of exploration in learning control , 1992 .

[38] M. Mariton,et al. Control of complex systems , 1991 .

[39] Andrew W. Moore,et al. Acquisition of Dynamic Control Knowledge for a Robotic Manipulator , 1990, ML.

[40] Karl Johan Åström,et al. Adaptive Control , 1989, Embedded Digital Control with Microcontrollers.

[41] Y. Bar-Shalom. Stochastic dynamic programming: Caution and probing , 1981 .