Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
暂无分享,去创建一个
Martin A. Riedmiller | Jost Tobias Springenberg | Joschka Boedecker | Manuel Watter | Manuel Watter | J. Boedecker | J. T. Springenberg
[1] David Q. Mayne,et al. Differential dynamic programming , 1972, The Mathematical Gazette.
[2] Robert F. Stengel,et al. Optimal Control and Estimation , 1994 .
[3] Kazuo Tanaka,et al. An approach to fuzzy control of nonlinear systems: stability and design issues , 1996, IEEE Trans. Fuzzy Syst..
[4] Michael A. West,et al. Bayesian Forecasting and Dynamic Models (2nd edn) , 1997, J. Oper. Res. Soc..
[5] Jun S. Liu,et al. Mixture Kalman filters , 2000 .
[6] Michail G. Lagoudakis,et al. Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..
[7] Emanuel Todorov,et al. Iterative Linear Quadratic Regulator Design for Nonlinear Biological Movement Systems , 2004, ICINCO.
[8] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.
[9] E. Todorov,et al. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..
[10] William D. Smart,et al. Receding Horizon Differential Dynamic Programming , 2007, NIPS.
[11] Marc Toussaint,et al. Robot trajectory optimization using approximate inference , 2009, ICML '09.
[12] John Langford,et al. Learning nonlinear dynamic models , 2009, ICML '09.
[13] David J. Fleet,et al. Dynamical binary latent variable models for 3D human pose tracking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[14] Martin A. Riedmiller,et al. Deep auto-encoder neural networks in reinforcement learning , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).
[15] Graham W. Taylor,et al. Deconvolutional networks , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[16] Robert A. Legenstein,et al. Reinforcement Learning on Slow Features of High-Dimensional Input Streams , 2010, PLoS Comput. Biol..
[17] Geoffrey E. Hinton,et al. Transforming Auto-Encoders , 2011, ICANN.
[18] Yoshua Bengio,et al. Deep Sparse Rectifier Neural Networks , 2011, AISTATS.
[19] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[20] Will Y. Zou. Unsupervised learning of visual invariance with temporal coherence , 2011 .
[21] Sergey Levine,et al. Variational Policy Search via Trajectory Optimization , 2013, NIPS.
[22] Roland Memisevic,et al. Learning to Relate Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[23] Christian Osendorfer,et al. Image Super-Resolution with Fast Approximate Convolutional Sparse Coding , 2014, ICONIP.
[24] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.
[25] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[26] Daan Wierstra,et al. Deep AutoRegressive Networks , 2013, ICML.
[27] Yunpeng Pan,et al. Probabilistic Differential Dynamic Programming , 2014, NIPS.
[28] Oliver Brock,et al. State Representation Learning in Robotics: Using Prior Knowledge about Physical Interaction , 2014, Robotics: Science and Systems.
[29] Surya Ganguli,et al. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , 2013, ICLR.
[30] Christian Osendorfer,et al. Learning Stochastic Recurrent Networks , 2014, NIPS 2014.
[31] Takamitsu Matsubara,et al. Latent Kullback Leibler Control for Continuous-State Systems using Probabilistic Graphical Models , 2014, UAI.
[32] Martin A. Riedmiller,et al. Approximate real-time optimal control based on sparse Gaussian process models , 2014, 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).
[33] Jan Peters,et al. Learning of Non-Parametric Control Policies with High-Dimensional State Features , 2015, AISTATS.
[34] Max Welling,et al. Transformation Properties of Learned Visual Representations , 2014, ICLR.
[35] Thomas B. Schön,et al. From Pixels to Torques: Policy Learning with Deep Dynamical Models , 2015, ICML 2015.
[36] Joshua B. Tenenbaum,et al. Deep Convolutional Inverse Graphics Network , 2015, NIPS.
[37] Klaus Obermayer,et al. Autonomous Learning of State Representations for Control: An Emerging Field Aims to Autonomously Learn State Representations for Reinforcement Learning Agents from Their Real-World Sensor Observations , 2015, KI - Künstliche Intelligenz.
[38] Alex Graves,et al. DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.
[39] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[40] Thomas Brox,et al. Learning to generate chairs with convolutional neural networks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[41] Yoshua Bengio,et al. NICE: Non-linear Independent Components Estimation , 2014, ICLR.
[42] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[43] Karin Ackermann,et al. Bayesian Forecasting And Dynamic Models Springer Series In Statistics , 2016 .
[44] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[45] DarrellTrevor,et al. End-to-end training of deep visuomotor policies , 2016 .