Distilling Deep Reinforcement Learning Policies in Soft Decision Trees
暂无分享,去创建一个
Youri Coppens | Kyriakos Efthymiadis | Tom Lenaerts | Ann Nowe | A. Nowé | T. Lenaerts | Youri Coppens | K. Efthymiadis | Tom Lenaerts
[1] B. K. Panigrahi,et al. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE , 2010 .
[2] Thomas A. Runkler,et al. Particle swarm optimization for generating interpretable fuzzy reinforcement learning policies , 2016, Eng. Appl. Artif. Intell..
[3] Graham Kendall,et al. Editorial: IEEE Transactions on Computational Intelligence and AI in Games , 2015, IEEE Trans. Comput. Intell. AI Games.
[4] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.
[5] Abhinav Verma,et al. Programmatically Interpretable Reinforcement Learning , 2018, ICML.
[6] Antonio Criminisi,et al. Adaptive Neural Trees , 2018, ICML.
[7] Oliver Schulte,et al. Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees , 2018, ECML/PKDD.
[8] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[9] Julian Togelius,et al. The Mario AI Benchmark and Competitions , 2012, IEEE Transactions on Computational Intelligence and AI in Games.
[10] Geoffrey E. Hinton,et al. Distilling a Neural Network Into a Soft Decision Tree , 2017, CEx@AI*IA.
[11] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[12] Thomas A. Runkler,et al. Interpretable Policies for Reinforcement Learning by Genetic Programming , 2017, Eng. Appl. Artif. Intell..
[13] M. V. Rossum,et al. In Neural Computation , 2022 .
[14] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[15] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[16] Razvan Pascanu,et al. Policy Distillation , 2015, ICLR.
[17] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[18] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.