The Markov Decision Process Extraction Network
暂无分享,去创建一个
[1] Geoffrey E. Hinton. Reducing the Dimensionality of Data with Neural , 2008 .
[2] Thomas Martinetz,et al. Neural Rewards Regression for near-optimal policy identification in Markovian and partial observable environments , 2007, ESANN.
[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[4] Steffen Udluft,et al. Solving Partially Observable Reinforcement Learning Problems with Recurrent Neural Networks , 2012, Neural Networks: Tricks of the Trade.
[5] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.
[6] Steffen Udluft,et al. The Recurrent Control Neural Network , 2007, ESANN.
[7] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.
[8] Steffen Udluft,et al. A Neural Reinforcement Learning Approach to Gas Turbine Control , 2007, 2007 International Joint Conference on Neural Networks.
[9] Daniel Schneegaß,et al. Steigerung der Informationseffizienz im Reinforcement-Learning , 2008 .