论文信息 - Transferring Procedural Knowledge across Commonsense Tasks

Transferring Procedural Knowledge across Commonsense Tasks

Stories about everyday situations are an essential part of human communication, motivating the need to develop AI agents that can reliably understand these stories. Despite the long list of supervised methods for story completion and procedural understanding, current AI has no mechanisms to automatically track and explain procedures in unseen stories. To bridge this gap, we study the ability of AI models to transfer procedural knowledge to novel narrative tasks in a transparent manner. We design LEAP: a comprehensive framework that integrates state-of-the-art modeling architectures, training regimes, and augmentation strategies based on both natural and synthetic stories. To address the lack of densely annotated training data, we devise a robust automatic labeler based on few-shot prompting to enhance the augmented data. Our experiments with in- and out-of-domain tasks reveal insights into the interplay of different architectures, training regimes, and augmentation strategies. LEAP's labeler has a clear positive impact on out-of-domain datasets, while the resulting dense annotation provides native explainability.

Yifan Jiang | Filip Ilievski | Kaixin Ma

[1] E. Hovy,et al. A Survey of Active Learning for Natural Language Processing , 2022, EMNLP.

[2] Graham Neubig,et al. Language Models of Code are Few-Shot Commonsense Learners , 2022, EMNLP.

[3] Eric Nyberg,et al. Coalescing Global and Local Information for Procedural Text Understanding , 2022, COLING.

[4] Nancy Fulda,et al. Enhanced Story Comprehension for Large Language Models through Dynamic Document-Based Knowledge Graphs , 2022, AAAI.

[5] J. Pujara,et al. Understanding Narratives through Dimensions of Analogy , 2022, ArXiv.

[6] Mo Yu,et al. A Survey of Machine Narrative Reading Comprehension Assessments , 2022, IJCAI.

[7] Marc-Alexandre Côté,et al. ScienceWorld: Is your Agent Smarter than a 5th Grader? , 2022, EMNLP.

[8] M. Lewis,et al. Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? , 2022, Conference on Empirical Methods in Natural Language Processing.

[9] H. V. Hoof,et al. Leveraging class abstraction for commonsense reinforcement learning via residual policy gradient methods , 2022, IJCAI.

[10] Quoc V. Le,et al. Finetuned Language Models Are Zero-Shot Learners , 2021, ICLR.

[11] Weizhu Chen,et al. What Makes Good In-Context Examples for GPT-3? , 2021, DEELIO.