论文信息 - Language-based General Action Template for Reinforcement Learning Agents - 字舞流文

Language-based General Action Template for Reinforcement Learning Agents

Prior knowledge plays a critical role in decision-making, and humans preserve such knowledge in the form of natural language (NL). To emulate real-world decision-making, artificial agents should incorporate such generic knowledge into their decisionmaking framework through NL. However, since policy learning with NL-based action representation is intractable due to NL’s combinatorial complexity, previous studies have limited agents’ expressive power to only a specific environment, which sacrificed the generalization ability to other environments. This paper proposes a new environmentagnostic action framework, the languagebased general action template (L-GAT). We design action templates on the basis of general semantic schemes (FrameNet, VerbNet, and WordNet), facilitating the agent in finding a plausible action in a given state by using prior knowledge while covering broader types of actions in a general manner. Our experiment using 18 text-based games showed that our proposed L-GAT agent which uses the same actions across games, achieved a performance competitive with agents that rely on gamespecific actions. We have published the code at https://github.com/kohilin/lgat.

Asim Munawar | Michiaki Tatsubori | Subhajit Chaudhury | Daiki Kimura | Akifumi Wachi | Ryosuke Kohita | Akifumi Wachi | Michiaki Tatsubori | Asim Munawar | Daiki Kimura | Subhajit Chaudhury | Ryosuke Kohita

[1] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2] Joseph J. Lim,et al. Generalization to New Actions in Reinforcement Learning , 2020, ICML.

[3] Shimon Whiteson,et al. A Survey of Reinforcement Learning Informed by Natural Language , 2019, IJCAI.

[4] Matthew J. Hausknecht,et al. Keep CALM and Explore: Language Models for Action Generation in Text-based Games , 2020, EMNLP.

[5] Jianfeng Gao,et al. Deep Reinforcement Learning with a Natural Language Action Space , 2015, ACL.

[6] Ali Farhadi,et al. IQA: Visual Question Answering in Interactive Environments , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[8] Jason Weston,et al. Reading Wikipedia to Answer Open-Domain Questions , 2017, ACL.

[9] Regina Barzilay,et al. Grounding Language for Transfer in Deep Reinforcement Learning , 2017, J. Artif. Intell. Res..

[10] Martha Palmer,et al. Verbnet: a broad-coverage, comprehensive verb lexicon , 2005 .

[11] Matthew J. Hausknecht,et al. Interactive Fiction Games: A Colossal Adventure , 2020, AAAI.

[12] Jason Weston,et al. Wizard of Wikipedia: Knowledge-Powered Conversational agents , 2018, ICLR.

[13] Jakub Kowalski,et al. Text-based adventures of the golovin AI agent , 2017, 2017 IEEE Conference on Computational Intelligence and Games (CIG).

[14] Matthew J. Hausknecht,et al. Graph Constrained Reinforcement Learning for Natural Language Action Spaces , 2020, ICLR.

[15] Angeliki Lazaridou,et al. Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning , 2020, ACL.

[16] Catherine Havasi,et al. ConceptNet 5.5: An Open Multilingual Graph of General Knowledge , 2016, AAAI.

[17] Yejin Choi,et al. SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference , 2018, EMNLP.

[18] David Wingate,et al. What Can You Do with a Rock? Affordance Extraction via Word Embeddings , 2017, IJCAI.

[19] Christopher R. Johnson,et al. Background to Framenet , 2003 .

[20] Matthew J. Hausknecht,et al. TextWorld: A Learning Environment for Text-based Games , 2018, CGW@IJCAI.

[21] Romain Laroche,et al. Counting to Explore and Generalize in Text-based Games , 2018, ArXiv.