The role of knowledge in determining identity of long-tail entities

Abstract Identifying entities in text is an important step of semantic analysis. Some entity mentions comprise a name or description, but many include no information that identifies them in the system’s knowledge resources, which means that their identity cannot be established through traditional disambiguation. Consequently, such NIL (not in lexicon) entities have received little attention in entity linking systems and tasks so far. However, given the non-redundancy of knowledge on NIL entities, their lack of frequency priors, their potentially extreme ambiguity, and their numerousness, they constitute an important class of long-tail entities and pose a great challenge for state-of-the-art systems. In this paper, we describe a method for imputing identifying knowledge to NILs from generalized characteristics. We enrich the locally extracted information with profile models that rely on background knowledge in Wikidata. We describe and implement two profiling machines using state-of-the-art neural models. We evaluate their intrinsic behavior and their impact on the task of determining the identity of NIL entities.

[1]  Piek T. J. M. Vossen,et al.  SemEval-2018 Task 5: Counting Events and Participants in the Long Tail , 2018, *SEMEVAL.

[2]  Chris Callison-Burch,et al.  The Gun Violence Database: A new task and data set for NLP , 2016, EMNLP.

[3]  Ian Reid,et al.  Framing and Interpretation , 1994 .

[4]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[5]  T István Nagy,et al.  Person attribute extraction from the textual parts of web pages , 2012 .

[6]  Graeme Hirst,et al.  Distributional Measures as Proxies for Semantic Relatedness , 2012, ArXiv.

[7]  Zhe Zhao,et al.  K-BERT: Enabling Language Representation with Knowledge Graph , 2019, AAAI.

[8]  Judita Preiss,et al.  Refining the most frequent sense baseline , 2009, SEW@NAACL-HLT.

[9]  Eduard H. Hovy,et al.  An Interpretable Knowledge Transfer Model for Knowledge Base Completion , 2017, ACL.

[10]  Jeffrey Nichols,et al.  Where Is This Tweet From? Inferring Home Locations of Twitter Users , 2012, ICWSM.

[11]  Axel-Cyrille Ngonga Ngomo,et al.  MAG: A Multilingual, Knowledge-base Agnostic and Deterministic Entity Linking Approach , 2017, K-CAP.

[12]  Adam Tauman Kalai,et al.  Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[13]  David Jurgens,et al.  That's What Friends Are For: Inferring Location in Online Social Media Platforms Based on Social Relationships , 2013, ICWSM.

[14]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[15]  Jin Liu,et al.  Extracting Attributes of Named Entity from Unstructured Text with Deep Belief Network , 2016 .

[16]  Jens Lehmann,et al.  Old is Gold: Linguistic Driven Approach for Entity and Relation Linking of Short Text , 2019, NAACL.

[17]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[18]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[19]  Hiroyuki Shindo,et al.  Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation , 2016, CoNLL.

[20]  Heike Adel,et al.  Comparing Convolutional Neural Networks to Traditional Models for Slot Filling , 2016, NAACL.

[21]  L. Jussim,et al.  Stereotype (In)Accuracy in Perceptions of Groups and Individuals , 2015 .

[22]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[23]  Piek T. J. M. Vossen,et al.  Hunger for Contextual Knowledge and a Road Map to Intelligent Entity Linking , 2017, LDK.

[24]  Andrew McCallum,et al.  Universal schema for entity type prediction , 2013, AKBC '13.

[25]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[26]  Z. Kunda,et al.  Social Cognition: Making Sense of People , 1999 .

[27]  Tariq Samad,et al.  Imputation of Missing Data in Industrial Databases , 1999, Applied Intelligence.

[28]  Cong Yu,et al.  LONLIES: Estimating Property Values for Long Tail Entities , 2016, SIGIR.

[29]  Romaric Besançon,et al.  Apprendre des représentations jointes de mots et d’entités pour la désambiguïsation d’entités (Combining Word and Entity Embeddings for Entity Linking) , 2017, JEPTALNRECITAL.

[30]  D. Kahneman Thinking, Fast and Slow , 2011 .

[31]  John Miller,et al.  Traversing Knowledge Graphs in Vector Space , 2015, EMNLP.

[32]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[33]  Arpit Mittal,et al.  Demand-Weighted Completeness Prediction for a Knowledge Base , 2018, NAACL.

[34]  A. Tversky,et al.  Prospect Theory : An Analysis of Decision under Risk Author ( s ) : , 2007 .

[35]  G. Stone,et al.  Two Kinds of Accuracy in Predicting Another's Responses , 1957 .

[36]  D. Hamilton Conceptual Approaches to Stereotypes and Stereotyping , 2015 .

[37]  Ronald K. Pearson,et al.  The problem of disguised missing data , 2006, SKDD.

[38]  A. Dijker,et al.  Stereotyping and attitudinal effects under time pressure , 1996 .