Adaptive history compression for learning to divide and conquer
暂无分享,去创建一个
[1] J. Urgen Schmidhuber,et al. Neural sequence chunkers , 1991, Forschungsberichte, TU Munich.
[2] Jing Peng,et al. An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.
[3] Jürgen Schmidhuber,et al. Learning to generate sub-goals for action sequences , 1991 .
[4] Ronald J. Williams,et al. Experimental Analysis of the Real-time Recurrent Learning Algorithm , 1989 .
[5] J. Urgen Schmidhuber. Adaptive Decomposition Of Time , 1991 .
[6] Jürgen Schmidhuber,et al. Reinforcement Learning in Markovian and Non-Markovian Environments , 1990, NIPS.