论文信息 - Scalable Neural Networks for Board Games

Scalable Neural Networks for Board Games

Learning to solve small instances of a problem should help in solving large instances. Unfortunately, most neural network architectures do not exhibit this form of scalability. Our Multi-Dimensional Recurrent LSTM Networks, however, show a high degree of scalability, as we empirically show in the domain of flexible-size board games. This allows them to be trained from scratch up to the level of human beginners, without using domain knowledge.

Tom Schaul | Jürgen Schmidhuber

[1] Lin Wu,et al. A Scalable Machine Learning Approach to Go , 2006, NIPS.

[2] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[3] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .

[4] H. Jaap van den Herik,et al. Solving Go on Small Boards , 2003, J. Int. Comput. Games Assoc..

[5] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[6] Nikolaus Hansen,et al. Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.

[7] Risto Miikkulainen,et al. Evolving Neural Networks to Play Go , 2004, Applied Intelligence.

[8] Alex Graves,et al. Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.

[9] Riccardo Poli,et al. Genetic and Evolutionary Computation – GECCO 2004 , 2004, Lecture Notes in Computer Science.

[10] Tom Schaul,et al. A scalable neural network architecture for board games , 2008, 2008 IEEE Symposium On Computational Intelligence and Games.

[11] Kenneth O. Stanley,et al. Generating large-scale neural networks through discovering geometric regularities , 2007, GECCO '07.

[12] X. Pang,et al. Neural network design for J function approximation in dynamic programming , 1998, adap-org/9806001.

[13] Simon M. Lucas,et al. Coevolution versus self-play temporal difference learning for acquiring position evaluation in small-board go , 2005, IEEE Transactions on Evolutionary Computation.

[14] Terrence J. Sejnowski,et al. Temporal Difference Learning of Position Evaluation in the Game of Go , 1993, NIPS.

[15] Richard S. Sutton,et al. Reinforcement Learning of Local Shape in the Game of Go , 2007, IJCAI.

[16] James Foulds. Learning to Play the Game of Go , 2006 .

[17] Jürgen Schmidhuber,et al. Multidimensional Recurrent Neural Networks , 2007 .

[18] T. Munich,et al. Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[19] Risto Miikkulainen,et al. Evolving a Roving Eye for Go , 2004, GECCO.

[20] Jürgen Schmidhuber,et al. Multi-dimensional Recurrent Neural Networks , 2007, ICANN.

[21] Risto Miikkulainen,et al. Incremental Evolution of Complex General Behavior , 1997, Adapt. Behav..

[22] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[23] Pierre Baldi,et al. The Principled Design of Large-Scale Recursive Neural Network Architectures--DAG-RNNs and the Protein Structure Prediction Problem , 2003, J. Mach. Learn. Res..