论文信息 - Towards Interactive Language Modeling

Towards Interactive Language Modeling

Interaction between caregivers and children plays a critical role in human language acquisition and development. Given this observation, it is remarkable that explicit interaction plays little to no role in artificial language modeling—which also targets the acquisition of human language, yet by artificial models. Moreover, an interactive approach to language modeling has the potential to make language models substantially more versatile and to considerably impact downstream applications. Motivated by these considerations, we pioneer the space of interactive language modeling. As a first contribution we present a road map in which we detail the steps that need to be taken towards interactive language modeling. We then lead by example and take the first steps on this road map, showing the initial feasibility of our approach. As such, this work aims to be the start of a larger research agenda on interactive language modeling.

Maartje ter Hoeve | Dieuwke Hupkes | Emmanuel Dupoux | Evgeny Kharitonov

[1] Yongdong Zhang,et al. Curriculum Learning for Natural Language Understanding , 2020, ACL.

[2] Jackie Chi Kit Cheung,et al. BanditSum: Extractive Summarization as a Contextual Bandit , 2018, EMNLP.

[3] Eve V. Clark,et al. Conversation and Language Acquisition: A Pragmatic Approach , 2018 .

[4] John Batali,et al. Artificial Evolution of Syntactic Aptitude , 2019, Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society.

[5] Elia Bruni,et al. The Grammar of Emergent Languages , 2020, EMNLP.

[6] SHAPELURN: An Interactive Language Learning Game with Logical Inference , 2021, INTERNLP.

[7] Max Welling,et al. Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement , 2019, ICML.

[8] Dejiao Zhang,et al. Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora , 2021, ArXiv.

[9] Mirella Lapata,et al. Ranking Sentences for Extractive Summarization with Reinforcement Learning , 2018, NAACL.

[10] Phil Blunsom,et al. Pitfalls of Static Language Modelling , 2021, ArXiv.

[11] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[13] Jason Weston,et al. Curriculum learning , 2009, ICML '09.

[14] Janet Wiles,et al. Learning to count without a counter: A case study of dynamics and activation landscapes in recurrent networks , 1995 .

[15] Zheng Cao,et al. Reducing BERT Computation by Padding Removal and Curriculum Learning , 2021, 2021 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[16] Mark Chen,et al. Language Models are Few-Shot Learners , 2020, NeurIPS.