Bayesian decomposition of multi-modal dynamical systems for reinforcement learning
暂无分享,去创建一个
Thomas A. Runkler | Carl Henrik Ek | Clemens Otte | Markus Kaiser | T. Runkler | C. Ek | Markus Kaiser | Clemens Otte
[1] Martin A. Riedmiller,et al. Batch Reinforcement Learning , 2012, Reinforcement Learning.
[2] Yee Whye Teh,et al. The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables , 2016, ICLR.
[3] Shai Ben-David,et al. Understanding Machine Learning: From Theory to Algorithms , 2014 .
[4] Finale Doshi-Velez,et al. Decomposition of Uncertainty in Bayesian Deep Learning for Efficient and Risk-sensitive Learning , 2017, ICML.
[5] Neil D. Lawrence,et al. Overlapping Mixtures of Gaussian Processes for the Data Association Problem , 2011, Pattern Recognit..
[6] Carl Henrik Ek,et al. Data Association with Gaussian Processes , 2018, ECML/PKDD.
[7] C. Bishop. Mixture density networks , 1994 .
[8] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.
[9] Neil D. Lawrence,et al. Deep Gaussian Processes , 2012, AISTATS.
[10] Carl E. Rasmussen,et al. PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.
[11] Steffen Udluft,et al. Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data , 2009, ICANN.
[12] Finale Doshi-Velez,et al. Learning and Policy Search in Stochastic Dynamical Systems with Bayesian Neural Networks , 2016, ICLR.
[13] Tom Minka,et al. Expectation Propagation for approximate Bayesian inference , 2001, UAI.
[14] James Hensman,et al. Scalable Variational Gaussian Process Classification , 2014, AISTATS.
[15] Y. Bar-Shalom. Tracking and data association , 1988 .
[16] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[17] Carl Henrik Ek,et al. Latent Gaussian Process Regression , 2017, ArXiv.
[18] Ingemar J. Cox,et al. A review of statistical data association techniques for motion correspondence , 1993, International Journal of Computer Vision.