暂无分享,去创建一个
Richard S. Sutton | Huizhen Yu | Sina Ghiassian | Banafsheh Rafiee | R. Sutton | Huizhen Yu | Sina Ghiassian | Banafsheh Rafiee
[1] S. N. Balakrishnan,et al. Neurocontrol: A literature survey , 1996 .
[2] James S. Albus,et al. Data Storage in the Cerebellar Model Articulation Controller (CMAC) , 1975 .
[3] Mahesan Niranjan,et al. On-line Q-learning using connectionist systems , 1994 .
[4] Richard S. Sutton,et al. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding , 1995, NIPS.
[5] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[6] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .
[7] Tsuyoshi Murata,et al. {m , 1934, ACML.
[8] Peter Stone,et al. Reinforcement learning , 2019, Scholarpedia.
[9] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[10] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[11] Gerald Tesauro,et al. Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..
[12] Eduardo D. Sontag,et al. Neural Networks for Control , 1993 .
[13] R. French,et al. Catastrophic Forgetting in Connectionist Networks: Causes, Consequences and Solutions , 1994 .
[14] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .
[15] Geoffrey E. Hinton,et al. Distributed Representations , 1986, The Philosophy of Artificial Intelligence.
[16] 丸山 徹. Convex Analysisの二,三の進展について , 1977 .
[17] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[18] W. T. Miller,et al. CMAC: an associative neural network alternative to backpropagation , 1990, Proc. IEEE.
[19] Chen-Khong Tham,et al. Modular on-line function approximation for scaling up reinforcement learning , 1994 .
[20] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.
[21] Hyongsuk Kim,et al. CMAC-based adaptive critic self-learning control , 1991, IEEE Trans. Neural Networks.
[22] R.M. Dunn,et al. Brains, behavior, and robotics , 1983, Proceedings of the IEEE.
[23] Byoung-Tak Zhang,et al. Overcoming Catastrophic Forgetting by Incremental Moment Matching , 2017, NIPS.
[24] Nitakshi Goyal,et al. General Topology-I , 2017 .
[25] P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .
[26] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..
[27] Michael McCloskey,et al. Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .
[28] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.