Probabilistic Bag-Of-Hyperlinks Model for Entity Linking

Many fundamental problems in natural language processing rely on determining what entities appear in a given text. Commonly referenced as entity linking, this step is a fundamental component of many NLP tasks such as text understanding, automatic summarization, semantic search or machine translation. Name ambiguity, word polysemy, context dependencies and a heavy-tailed distribution of entities contribute to the complexity of this problem. We here propose a probabilistic approach that makes use of an effective graphical model to perform collective entity disambiguation. Input mentions (i.e., linkable token spans) are disambiguated jointly across an entire document by combining a document-level prior of entity co-occurrences with local information captured from mentions and their surrounding context. The model is based on simple sufficient statistics extracted from data, thus relying on few parameters to be learned. Our method does not require extensive feature engineering, nor an expensive training procedure. We use loopy belief propagation to perform approximate inference. The low complexity of our model makes this step sufficiently fast for real-time usage. We demonstrate the accuracy of our approach on a wide range of benchmark datasets, showing that it matches, and in many cases outperforms, existing state-of-the-art methods.

[1]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[2]  Paolo Ferragina,et al.  TAGME: on-the-fly annotation of short text fragments (by wikipedia entities) , 2010, CIKM.

[3]  Paolo Ferragina,et al.  From TagME to WAT: a new entity annotator , 2014, ERD '14.

[4]  Andrew McCallum,et al.  Piecewise training for structured prediction , 2009, Machine Learning.

[5]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[6]  Xianpei Han,et al.  An Entity-Topic Model for Entity Linking , 2012, EMNLP.

[7]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[8]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[9]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[10]  Valentin I. Spitkovsky,et al.  A Cross-Lingual Dictionary for English Wikipedia Concepts , 2012, LREC.

[11]  Andrew McCallum,et al.  Joint inference of entities, relations, and coreference , 2013, AKBC '13.

[12]  Jun Zhao,et al.  Collective entity linking in web text: a graph-based method , 2011, SIGIR.

[13]  Massimiliano Ciaramita,et al.  A Scalable Gibbs Sampler for Probabilistic Entity Linking , 2014, ECIR.

[14]  Michael I. Jordan,et al.  Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[15]  Mark W. Schmidt,et al.  Accelerated training of conditional random fields with stochastic gradient methods , 2006, ICML.

[16]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[17]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[18]  Michael Strube,et al.  A Latent Variable Model for Discourse-aware Concept and Entity Disambiguation , 2014, EACL.

[19]  Ganesh Ramakrishnan,et al.  Collective annotation of Wikipedia entities in web text , 2009, KDD.

[20]  Stephen J. Wright,et al.  Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.

[21]  Harald Sack,et al.  Semantic Multimedia Information Retrieval Based on Contextual Descriptions , 2013, ESWC.

[22]  E. Jaynes On the rationale of maximum-entropy methods , 1982, Proceedings of the IEEE.

[23]  Gerhard Paass,et al.  From names to entities using thematic context distance , 2011, CIKM '11.

[24]  Houfeng Wang,et al.  Learning Entity Representation for Entity Disambiguation , 2013, ACL.

[25]  Salvatore Orlando,et al.  Dexter: an open source framework for entity linking , 2013, ESAIR '13.

[26]  Silviu Cucerzan,et al.  Large-Scale Named Entity Disambiguation Based on Wikipedia Data , 2007, EMNLP.

[27]  Zhaochen Guo,et al.  Robust Entity Linking via Random Walks , 2014, CIKM.

[28]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[29]  James R. Curran,et al.  Graph-Based Named Entity Linking with Wikipedia , 2011, WISE.

[30]  Rajeev Rastogi,et al.  Entity disambiguation with hierarchical topic models , 2011, KDD.

[31]  Raphaël Troncy,et al.  Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web , 2014, LREC.

[32]  Xiaolong Wang,et al.  Modeling Mention, Context and Entity with Neural Networks for Entity Disambiguation , 2015, IJCAI.

[33]  Dan Roth,et al.  Relational Inference for Wikification , 2013, EMNLP.

[34]  Raphaël Troncy,et al.  GERBIL: General Entity Annotator Benchmarking Framework , 2015, WWW.

[35]  Felix Naumann,et al.  BEL: Bagging for Entity Linking , 2014, COLING.

[36]  Paolo Ferragina,et al.  Fast and Accurate Annotation of Short Texts with Wikipedia Pages , 2010, IEEE Software.

[37]  Mark Dredze,et al.  Entity Disambiguation for Knowledge Base Population , 2010, COLING.

[38]  Avirup Sil,et al.  Re-ranking for joint named-entity recognition and linking , 2013, CIKM.

[39]  Razvan C. Bunescu,et al.  Using Encyclopedic Knowledge for Named entity Disambiguation , 2006, EACL.

[40]  Haixun Wang,et al.  Wikification via link co-occurrence , 2013, CIKM.

[41]  Sören Auer,et al.  AGDISTIS - Graph-Based Disambiguation of Named Entities Using Linked Data , 2014, International Semantic Web Conference.

[42]  J. Darroch,et al.  Generalized Iterative Scaling for Log-Linear Models , 1972 .

[43]  Dan Klein,et al.  A Joint Model for Entity Analysis: Coreference, Typing, and Linking , 2014, TACL.

[44]  Giuseppe Ottaviano,et al.  Fast and Space-Efficient Entity Linking for Queries , 2015, WSDM.

[45]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.