论文信息 - Probing Pre-trained Auto-regressive Language Models for Named Entity Typing and Recognition

Probing Pre-trained Auto-regressive Language Models for Named Entity Typing and Recognition

Multiple works have proposed to probe language models (LMs) for generalization in named entity (NE) typing (NET) and recognition (NER). However, little has been done in this direction for auto-regressive models despite their popularity and potential to express a wide variety of NLP tasks in the same uniﬁed format. We propose a new methodology to probe auto-regressive LMs for NET and NER generalization, which draws inspiration from human linguistic behavior, by resorting to meta-learning. We study NEs of various types individually by designing a zero-shot transfer strategy for NET. Then, we probe the model for NER by providing a few examples at inference. We introduce a novel procedure to assess the model’s memorization of NEs and report the memorization’s impact on the results. Our ﬁndings show that: 1) GPT2, a common pre-trained auto-regressive LM, without any ﬁne-tuning for NET or NER, performs the tasks fairly well; 2) name irregularity when common for a NE type could be an effective exploitable cue; 3) the model seems to rely more on NE than contextual cues in few-shot NER; 4) NEs with words absent during LM pre-training are very challenging for both NET and NER.

Elena V. Epure | Romain Hennequin

[1] Erik Cambria,et al. Label Embedding for Zero-shot Fine-grained Named Entity Typing , 2016, COLING.

[2] Erik F. Tjong Kim Sang,et al. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.

[3] Vincent Guigue,et al. Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization , 2019, ECIR.

[4] Rami Aly,et al. Leveraging Type Descriptions for Zero-shot Named Entity Recognition and Classification , 2021, ACL.

[5] Jens Lehmann,et al. DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[6] Xiao Huang,et al. TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition , 2020, ACL.

[7] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[8] Jiwei Li,et al. A Unified MRC Framework for Named Entity Recognition , 2019, ACL.

[9] Rachel Rudinger,et al. “You Are Grounded!”: Latent Name Artifacts in Pre-trained Language Models , 2020, EMNLP.

[10] Marco Guerini,et al. Toward zero-shot Entity Recognition in Task-oriented Conversational Agents , 2018, SIGDIAL Conference.

[11] Teng Ren,et al. Learning Named Entity Tagger using Domain-Specific Dictionary , 2018, EMNLP.