L G ] 1 0 A pr 2 01 9 Programmatically Interpretable Reinforcement Learning
暂无分享,去创建一个
[1] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[2] Jürgen Schmidhuber,et al. Evolving large-scale neural networks for vision-based TORCS , 2013, FDG.
[3] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.
[4] Quoc V. Le,et al. Neural Program Synthesis with Priority Queue Training , 2018, ArXiv.
[5] Murray Shanahan,et al. Towards Deep Symbolic Reinforcement Learning , 2016, ArXiv.
[6] Pushmeet Kohli,et al. RobustFill: Neural Program Learning under Noisy I/O , 2017, ICML.
[7] Tore Hägglund,et al. Automatic tuning of simple regulators with specifications on phase and amplitude margins , 1984, Autom..
[8] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[9] Karl Johan Åström,et al. PID Controllers: Theory, Design, and Tuning , 1995 .
[10] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.
[11] Daniele Loiacono,et al. Learning to overtake in TORCS using simple reinforcement learning , 2010, IEEE Congress on Evolutionary Computation.
[12] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[13] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[14] Isil Dillig,et al. Synthesizing data structure transformations from input-output examples , 2015, PLDI.
[15] Wojciech Samek,et al. Methods for interpreting and understanding deep neural networks , 2017, Digit. Signal Process..
[16] James C. King,et al. Symbolic execution and program testing , 1976, CACM.
[17] Juan Julián Merelo Guervós,et al. Driving in TORCS Using Modular Fuzzy Controllers , 2017, EvoApplications.
[18] Mykel J. Kochenderfer,et al. Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks , 2017, CAV.
[19] Swarat Chaudhuri,et al. Neural Sketch Learning for Conditional Program Generation , 2017, ICLR.
[20] Zachary Chase Lipton. The mythos of model interpretability , 2016, ACM Queue.
[21] Matthew J. Hausknecht,et al. Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis , 2018, ICLR.
[22] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[23] Murray Shanahan,et al. Perception as Abduction: Turning Sensor Data Into Meaningful Representation , 2005, Cogn. Sci..
[24] Koushik Sen,et al. Symbolic execution for software testing: three decades later , 2013, CACM.
[25] Stefan Schaal,et al. Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.
[26] Alborz Geramifard,et al. RLPy: a value-function-based reinforcement learning framework for education and research , 2015, J. Mach. Learn. Res..