论文信息 - Learning Finite State Representations of Recurrent Policy Networks - 字舞流文

Learning Finite State Representations of Recurrent Policy Networks

Recurrent neural networks (RNNs) are an effective representation of control policies for a wide range of reinforcement and imitation learning problems. RNN policies, however, are particularly difficult to explain, understand, and analyze due to their use of continuous-valued memory vectors and observation features. In this paper, we introduce a new technique, Quantized Bottleneck Insertion, to learn finite representations of these vectors and features. The result is a quantized representation of the RNN that can be analyzed to improve our understanding of memory use and general behavior. We present results of this approach on synthetic environments and six Atari games. The resulting finite representations are surprisingly small in some cases, using as few as 3 discrete memory states and 10 observations for a perfect Pong policy. We also show that these finite policy representations lead to improved interpretability.

Alan Fern | Sam Greydanus | Anurag Koul | Alan Fern | S. Greydanus | Anurag Koul

[1] Adelmo Luis Cechin,et al. State automata extraction from recurrent neural nets using k-means and fuzzy clustering , 2003, 23rd International Conference of the Chilean Computer Science Society, 2003. SCCC 2003. Proceedings..

[2] A. Krizhevsky. Convolutional Deep Belief Networks on CIFAR-10 , 2010 .

[3] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.

[4] Stephen H. Unger,et al. Minimizing the Number of States in Incompletely Specified Sequential Switching Functions , 1959, IRE Trans. Electron. Comput..

[5] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[6] Arthur Szlam,et al. Automatic Rule Extraction from Long Short Term Memory Networks , 2016, ICLR.

[7] Ran El-Yaniv,et al. Binarized Neural Networks , 2016, NIPS.

[8] Jonathan Dodge,et al. Visualizing and Understanding Atari Agents , 2017, ICML.

[9] Shie Mannor,et al. Graying the black box: Understanding DQNs , 2016, ICML.

[10] Eran Yahav,et al. Extracting Automata from Recurrent Neural Networks Using Queries and Counterexamples , 2017, ICML.

[11] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[12] Fei-Fei Li,et al. Visualizing and Understanding Recurrent Networks , 2015, ArXiv.

[13] Padhraic Smyth,et al. Self-clustering recurrent networks , 1993, IEEE International Conference on Neural Networks.

[14] Yoshua Bengio,et al. Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation , 2013, ArXiv.

[15] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[16] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[17] Alexander M. Rush,et al. Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks , 2016, ArXiv.

[18] Klaus-Robert Müller,et al. Explaining Recurrent Neural Network Predictions in Sentiment Analysis , 2017, WASSA@EMNLP.

[19] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[20] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.

[21] Raymond L. Watrous,et al. Induction of Finite-State Automata Using Second-Order Recurrent Networks , 1991, NIPS.

[22] Henrik Jacobsson,et al. Rule Extraction from Recurrent Neural Networks: ATaxonomy and Review , 2005, Neural Computation.

[23] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.

[24] Parul Parashar,et al. Neural Networks in Machine Learning , 2014 .

[25] Peter Tiňo,et al. Finite State Machines and Recurrent Neural Networks -- Automata and Dynamical Systems Approaches , 1995 .

[26] C. Lee Giles,et al. Extraction of rules from discrete-time recurrent neural networks , 1996, Neural Networks.

[27] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.