Distinguishing Address vs. Reference Mentions of Personal Names in Text

Detecting named entities in text has long been a core NLP task. However, not much work has gone into distinguishing whether an entity mention is addressing the entity vs. referring to the entity; e.g., John, would you turn the light off? vs. John turned the light off . While this distinction is marked by a vocative case marker in some languages, many modern Indo-European languages such as English do not use such explicit vocative markers, and the distinction is left to be interpreted in context. In this paper, we present a new annotated dataset that captures the address vs. reference distinction in English, 1 an automatic tagger that performs at 85% accuracy in making this distinction, and demonstrate how this distinction is important in NLP and computational social science applications in English language.

[1]  Akshat Gupta On Building Spoken Language Understanding Systems for Low Resourced Languages , 2022, SIGMORPHON.

[2]  Walid Magdy,et al.  Overview of OSACT4 Arabic Offensive Language Detection Shared Task , 2020, OSACT.

[3]  Dongyan Zhao,et al.  Who Is Speaking to Whom? Learning to Identify Utterance Addressee in Multi-Party Conversations , 2019, EMNLP.

[4]  Margaret Mitchell,et al.  Perturbation Sensitivity Analysis to Detect Unintended Model Biases , 2019, EMNLP.

[5]  Amit Seker,et al.  What’s Wrong with Hebrew NLP? And How to Make it Right , 2019, EMNLP.

[6]  Omer Levy,et al.  What Does BERT Look at? An Analysis of BERT’s Attention , 2019, BlackboxNLP@ACL.

[7]  Steven Bethard,et al.  A Survey on Recent Advances in Named Entity Recognition from Deep Learning models , 2018, COLING.

[8]  M. Ferguson,et al.  How gender determines the way we speak about professionals , 2018, Proceedings of the National Academy of Sciences.

[9]  Yulia Tsvetkov,et al.  RtGender: A Corpus for Studying Differential Responses to Gender , 2018, LREC.

[10]  Robert Östling,et al.  Identifying Speakers and Addressees in Dialogues Extracted from Literary Fiction , 2018, LREC.

[11]  Sara Rosenthal,et al.  Detecting Influencers in Multiple Online Genres , 2017, ACM Trans. Internet Techn..

[12]  Yuta Tsuboi,et al.  Addressee and Response Selection for Multi-Party Conversation , 2016, EMNLP.

[13]  David Bamman,et al.  Annotating Character Relationships in Literary Texts , 2015, ArXiv.

[14]  Yorick Wilks,et al.  A New Dataset and Evaluation for Belief/Factuality , 2015, *SEMEVAL.

[15]  Owen Rambow,et al.  Staying on Topic: An Indicator of Power in Political Debates , 2014, EMNLP.

[16]  Owen Rambow,et al.  Predicting Power Relations between Participants in Written Dialog from a Single Thread , 2014, ACL.

[17]  Denilson Barbosa,et al.  Extracting Family Relationship Networks from Novels , 2014, ArXiv.

[18]  Dorée D. Seligmann,et al.  Who Had the Upper Hand? Ranking Participants of Interactions Based on Their Relative Power , 2013, IJCNLP.

[19]  Mónica Marrero,et al.  Named Entity Recognition: Fallacies, challenges and opportunities , 2013, Comput. Stand. Interfaces.

[20]  Mutee U. Rahman,et al.  Finite State Morphology and Sindhi Noun Inflections , 2010, PACLIC.

[21]  Rieks op den Akker,et al.  Are You Being Addressed? - Real-Time Addressee Detection to Support Remote Participants in Hybrid Meetings , 2009, SIGDIAL Conference.

[22]  Eleanor Dickey,et al.  Forms of address and terms of reference , 1997, Journal of Linguistics.

[23]  Beth M. Sundheim,et al.  Overview of Results of the MUC-6 Evaluation , 1995, MUC.

[24]  Ralf D. Brown,et al.  THE PRONOUNS OF POWER AND SOLIDARITY , 1968 .

[25]  Richard T. Brown,et al.  Address in American English. , 1961 .

[26]  Huan Liu,et al.  "Let's Eat Grandma": When Punctuation Matters in Sentence Representation for Sentiment Analysis , 2021, ArXiv.

[27]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[28]  Andrea Moro,et al.  "Notes on Vocative case: a case study in clause structure" , 2003 .

[29]  Key-Sun Choi,et al.  A Local Grammar-based Approach to Recognizing of Proper Names in Korean Texts , 1997, VLC.