论文信息 - Extracting Action Sequences from Texts Based on Deep Reinforcement Learning - 字舞流文

Extracting Action Sequences from Texts Based on Deep Reinforcement Learning

Extracting action sequences from natural language texts is challenging, as it requires commonsense inferences based on world knowledge. Although there has been work on extracting action scripts, instructions, navigation actions, etc., they require that either the set of candidate actions be provided in advance, or that action descriptions are restricted to a specific form, e.g., description templates. In this paper, we aim to extract action sequences from texts in free natural language, i.e., without any restricted templates, provided the candidate set of actions is unknown. We propose to extract action sequences from texts based on the deep reinforcement learning framework. Specifically, we view "selecting" or "eliminating" words from texts as "actions", and the texts associated with actions as "states". We then build Q-networks to learn the policy of extracting actions and extract plans from the labeled texts. We demonstrate the effectiveness of our approach on several datasets with comparison to state-of-the-art approaches, including online experiments interacting with humans.

Subbarao Kambhampati | Wenfeng Feng | Hankui Zhuo | S. Kambhampati | Wenfeng Feng | Hankui Zhuo

[1] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[2] Matthew R. Walter,et al. Navigational Instruction Generation as Inverse Reinforcement Learning with Neural Machine Translation , 2016, 2017 12th ACM/IEEE International Conference on Human-Robot Interaction (HRI.

[3] Hector Muñoz-Avila,et al. Learning hierarchical task network domains from partially observed plan traces , 2014, Artif. Intell..

[4] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[5] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.

[6] Ye Zhang,et al. A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification , 2015, IJCNLP.

[7] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[8] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[9] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[10] Qiang Yang,et al. Action-model acquisition for planning via transfer learning , 2014, Artif. Intell..

[11] Avirup Sil,et al. Extracting STRIPS Representations of Actions and Events , 2011, RANLP.

[12] Joshua B. Tenenbaum,et al. Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation , 2016, NIPS.

[13] Jin Wang,et al. Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification , 2017, IJCAI.

[14] Eduard H. Hovy,et al. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[15] Matthew R. Walter,et al. Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences , 2015, AAAI.

[16] Raymond J. Mooney,et al. Unsupervised PCFG Induction for Grounded Language Learning with Highly Ambiguous Supervision , 2012, EMNLP.

[17] Benjamin Kuipers,et al. Walk the Talk: Connecting Language, Knowledge, and Action in Route Instructions , 2006, AAAI.

[18] Raymond J. Mooney,et al. Learning to Interpret Natural Language Navigation Instructions from Observations , 2011, Proceedings of the AAAI Conference on Artificial Intelligence.

[19] Raymond J. Mooney,et al. Adapting Discriminative Reranking to Grounded Language Learning , 2013, ACL.

[20] Michael Beetz,et al. From Natural Language Instructions to Structured Robot Plans , 2017, KI.

[21] Luke S. Zettlemoyer,et al. Reinforcement Learning for Mapping Instructions to Actions , 2009, ACL.

[22] Jianfeng Gao,et al. Deep Reinforcement Learning with a Natural Language Action Space , 2015, ACL.

[23] João Fernando Ferreira,et al. Framer: Planning Models from Natural Language Action Descriptions , 2017, ICAPS.

[24] David L. Chen. Fast Online Lexicon Learning for Grounded Language Acquisition , 2012, ACL.

[25] Regina Barzilay,et al. Language Understanding for Text-based Games using Deep Reinforcement Learning , 2015, EMNLP.

[26] Noah A. Smith,et al. Proceedings of EMNLP , 2007 .

[27] T. L. McCluskey,et al. Acquisition of Object-Centred Domain Models from Planning Examples , 2009, ICAPS.

[28] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[29] Tricia Walker,et al. Computer science , 1996, English for academic purposes series.

[30] Subbarao Kambhampati,et al. Model-lite planning: Case-based vs. model-based approaches , 2017, Artif. Intell..

[31] Iryna Gurevych,et al. Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging , 2017, EMNLP.

[32] Avirup Sil,et al. Extracting Action and Event Semantics from Web Text , 2010, AAAI Fall Symposium: Commonsense Knowledge.