Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing
暂无分享,去创建一个
Chen Liang | Quoc V. Le | Ni Lao | Jonathan Berant | Mohammad Norouzi | Mohammad Norouzi | Chen Liang | Jonathan Berant | N. Lao
[1] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.
[2] Ming Zhou,et al. Semantic Parsing with Syntax- and Table-Aware SQL Generation , 2018, ACL.
[3] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[4] Radu Soricut,et al. Cold-Start Reinforcement Learning with Softmax Policy Gradient , 2017, NIPS.
[5] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[6] Jianfeng Gao,et al. Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.
[7] Dan Klein,et al. Learning Dependency-Based Compositional Semantics , 2011, CL.
[8] Wojciech Zaremba,et al. Reinforcement Learning Neural Turing Machines , 2015, ArXiv.
[9] Filip De Turck,et al. #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning , 2016, NIPS.
[10] Samy Bengio,et al. Neural Combinatorial Optimization with Reinforcement Learning , 2016, ICLR.
[11] Wojciech Zaremba,et al. Reinforcement Learning Neural Turing Machines - Revised , 2015 .
[12] Chen Liang,et al. Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision , 2016, ACL.
[13] Sebastian Nowozin,et al. Structured Learning and Prediction in Computer Vision , 2011, Found. Trends Comput. Graph. Vis..
[14] Percy Liang,et al. From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood , 2017, ACL.
[15] Richard Socher,et al. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning , 2018, ArXiv.
[16] Alexandre M. Bayen,et al. Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines , 2018, ICLR.
[17] Nicolas Le Roux. Tighter bounds lead to improved classifiers , 2017, ICLR.
[18] Jayant Krishnamurthy,et al. Neural Semantic Parsing with Type Constraints for Semi-Structured Tables , 2017, EMNLP.
[19] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[20] Alexei A. Efros,et al. Curiosity-Driven Exploration by Self-Supervised Prediction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[21] Dengyong Zhou,et al. Action-depedent Control Variates for Policy Optimization via Stein's Identity , 2017 .
[22] Matthew J. Hausknecht,et al. Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis , 2018, ICLR.
[23] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.
[24] Vijay Vasudevan,et al. Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[25] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[26] Martín Abadi,et al. Learning a Natural Language Interface with Neural Programmer , 2016, ICLR.
[27] Martha White,et al. Linear Off-Policy Actor-Critic , 2012, ICML.
[28] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[29] Marc'Aurelio Ranzato,et al. Sequence Level Training with Recurrent Neural Networks , 2015, ICLR.
[30] Octavian-Eugen Ganea,et al. Neural Multi-step Reasoning for Question Answering on Semi-structured Tables , 2017, ECIR.
[31] Yuchen Zhang,et al. Macro Grammars and Holistic Triggering for Efficient Semantic Parsing , 2017, EMNLP.
[32] Sham M. Kakade,et al. Towards Generalization and Simplicity in Continuous Control , 2017, NIPS.
[33] Raymond J. Mooney,et al. Learning to Parse Database Queries Using Inductive Logic Programming , 1996, AAAI/IAAI, Vol. 2.
[34] Percy Liang,et al. Compositional Semantic Parsing on Semi-Structured Tables , 2015, ACL.
[35] Tao Yu,et al. TypeSQL: Knowledge-Based Type-Aware Neural Text-to-SQL Generation , 2018, NAACL.
[36] Quoc V. Le,et al. Neural Program Synthesis with Priority Queue Training , 2018, ArXiv.
[37] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[38] Dawn Xiaodong Song,et al. SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning , 2017, ArXiv.
[39] Chenglong Wang,et al. Pointing Out SQL Queries From Text , 2018 .
[40] John Schulman,et al. Gotta Learn Fast: A New Benchmark for Generalization in RL , 2018, ArXiv.
[41] Stefan Schaal,et al. Policy Gradient Methods for Robotics , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[42] Mirella Lapata,et al. Coarse-to-Fine Decoding for Neural Semantic Parsing , 2018, ACL.
[43] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.
[44] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[45] Satinder Singh,et al. Self-Imitation Learning , 2018, ICML.
[46] Dale Schuurmans,et al. Reward Augmented Maximum Likelihood for Neural Structured Prediction , 2016, NIPS.
[47] Sebastian Nowozin,et al. DeepCoder: Learning to Write Programs , 2016, ICLR.
[48] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[49] Filip De Turck,et al. VIME: Variational Information Maximizing Exploration , 2016, NIPS.
[50] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[51] Luke S. Zettlemoyer,et al. Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars , 2005, UAI.
[52] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[53] Ming-Wei Chang,et al. The Value of Semantic Parse Labeling for Knowledge Base Question Answering , 2016, ACL.
[54] Po-Sen Huang,et al. Natural Language to Structured Query Generation via Meta-Learning , 2018, NAACL.
[55] Stefan Lee,et al. Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[56] Marcin Andrychowicz,et al. Hindsight Experience Replay , 2017, NIPS.
[57] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[58] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[59] Benjamin Recht,et al. Simple random search provides a competitive approach to reinforcement learning , 2018, ArXiv.
[60] Dale Schuurmans,et al. Bridging the Gap Between Value and Policy Based Reinforcement Learning , 2017, NIPS.
[61] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.
[62] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.
[63] David Duvenaud,et al. Backpropagation through the Void: Optimizing control variates for black-box gradient estimation , 2017, ICLR.
[64] Percy Liang,et al. Inferring Logical Forms From Denotations , 2016, ACL.
[65] Nando de Freitas,et al. Sample Efficient Actor-Critic with Experience Replay , 2016, ICLR.
[66] Ankur Taly,et al. It was the training data pruning too! , 2018, ArXiv.
[67] Hao Liu,et al. Sample-efficient Policy Optimization with Stein Control Variate , 2017, ArXiv.
[68] Andrew Chou,et al. Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.
[69] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.