暂无分享,去创建一个
[1] T. Bliss,et al. Long‐lasting potentiation of synaptic transmission in the dentate area of the anaesthetized rabbit following stimulation of the perforant path , 1973, The Journal of physiology.
[2] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[3] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.
[4] Terrence J. Sejnowski,et al. Analysis of hidden units in a layered network trained to classify sonar targets , 1988, Neural Networks.
[5] T. Sejnowski,et al. Associative long-term depression in the hippocampus induced by hebbian covariance , 1989, Nature.
[6] M. Poo,et al. Activity-dependent synaptic competition in vitro: heterosynaptic suppression of developing synapses. , 1991, Science.
[7] E. Capaldi,et al. The organization of behavior. , 1992, Journal of applied behavior analysis.
[8] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[9] G. A. Clark,et al. Induction of long-term facilitation in Aplysia sensory neurons by local application of serotonin to remote synapses. , 1993, Proceedings of the National Academy of Sciences of the United States of America.
[10] B. Sakmann,et al. Active propagation of somatic action potentials into neocortical pyramidal cell dendrites , 1994, Nature.
[11] Leslie G. Valiant,et al. Circuits of the mind , 1994 .
[12] Shigenobu Kobayashi,et al. Reinforcement Learning in POMDPs with Function Approximation , 1997, ICML.
[13] D. Johnston,et al. A Synaptically Controlled, Associative Signal for Hebbian Plasticity in Hippocampal Neurons , 1997, Science.
[14] D. Johnston,et al. Regulation of Synaptic Efficacy by Coincidence of Postsynaptic APs and EPSPs , 1997 .
[15] Xi-Ren Cao,et al. Perturbation realization, potentials, and sensitivity analysis of Markov processes , 1997, IEEE Trans. Autom. Control..
[16] Xi-Ren Cao,et al. Algorithms for sensitivity analysis of Markov systems through potentials and perturbation realization , 1998, IEEE Trans. Control. Syst. Technol..
[17] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .
[18] G. Bi,et al. Synaptic Modifications in Cultured Hippocampal Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsynaptic Cell Type , 1998, The Journal of Neuroscience.
[19] John N. Tsitsiklis,et al. Simulation-based optimization of Markov reward processes , 1998, Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171).
[20] Terrence L. Fine,et al. Feedforward Neural Network Methodology , 1999, Information Science and Statistics.
[21] M. Mauk,et al. Simulations of Cerebellar Motor Learning: Computational Analysis of Plasticity at the Mossy Fiber to Deep Nucleus Synapse , 1999, The Journal of Neuroscience.
[22] Peter L. Bartlett,et al. Reinforcement Learning in POMDP's via Direct Gradient Ascent , 2000, ICML.
[23] Peter L. Bartlett,et al. Infinite-Horizon Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..
[24] Peter L. Bartlett,et al. Experiments with Infinite-Horizon, Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..
[25] Richard S. Sutton,et al. Associative search network: A reinforcement learning associative memory , 1981, Biological Cybernetics.
[26] G. Tesauro,et al. Simple neural models of classical conditioning , 1986, Biological Cybernetics.
[27] R. Sutton,et al. Synthesis of nonlinear control surfaces by a layered associative search network , 2004, Biological Cybernetics.
[28] Richard S. Sutton,et al. Landmark learning: An illustration of associative search , 1981, Biological Cybernetics.