Multi-Dimensional Deep Memory Go-Player for Parameter Exploring Policy Gradients
暂无分享,去创建一个
[1] Hans-Paul Schwefel,et al. Evolution and optimum seeking , 1995, Sixth-generation computer technology series.
[2] Nikolaus Hansen,et al. Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.
[3] Nir Oren,et al. Evolving Neural Networks for the Capture Game , 2002 .
[4] Risto Miikkulainen,et al. Evolving a Roving Eye for Go , 2004, GECCO.
[5] Holger Ulmer,et al. JavaEvA : a Java based framework for Evolutionary Algorithms , 2005 .
[6] Bruno Bouzy,et al. Monte-Carlo Go Reinforcement Learning Experiments , 2006, 2006 IEEE Symposium on Computational Intelligence and Games.
[7] Lin Wu,et al. A Scalable Machine Learning Approach to Go , 2006, NIPS.
[8] David Silver,et al. Combining online and offline knowledge in UCT , 2007, ICML '07.
[9] Marcus Liwicki,et al. A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks , 2007 .
[10] Jürgen Schmidhuber,et al. Multi-dimensional Recurrent Neural Networks , 2007, ICANN.
[11] Tom Schaul,et al. A scalable neural network architecture for board games , 2008, 2008 IEEE Symposium On Computational Intelligence and Games.
[12] Frank Sehnke,et al. Policy Gradients with Parameter-Based Exploration for Control , 2008, ICANN.
[13] Alex Graves,et al. Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.
[14] Tom Schaul,et al. Scalable Neural Networks for Board Games , 2009, ICANN.
[15] Tom Schaul,et al. Exploring parameter space in reinforcement learning , 2010, Paladyn J. Behav. Robotics.
[16] Frank Sehnke,et al. Parameter-exploring policy gradients , 2010, Neural Networks.